A method for merging contextual web pages

A context and page technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems that cannot fully meet the requirements of web page processing, lack of semantic analysis links of web pages, etc., and achieve clear context and efficiency and the effect of quality improvement

Active Publication Date: 2014-10-29
HYLANDA INFORMATION TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, the existing technologies represented by the above invention patents generally lack the semantic analysis of web pages, and cannot fully meet the processing requirements for web pages with dynamic and semi-structured characteristics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for merging contextual web pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Compared with the prior art, a notable feature of the present invention is that in the process of merging the context web pages, the content of the web pages is analyzed, and then the context link information is extracted and downloaded accordingly, according to the downloaded content Automatically expand the context, and deduplicate the expanded context content, and recombine them into a new single web page in sequence. The following is a detailed description of this.

[0031] like figure 1 As shown, the original data processed by the present invention is a certain web page among multiple web pages with contextual relationship. For the web page, it is firstly necessary to ensure that it has been downloaded, and a DOM (Document Object Model) tree is generated after being fully displayed. This specifically includes the following:

[0032] IFame, Frame, etc. have been downloaded

[0033] IFame refers to a frame embedded in a web page, and Frame refers to a frame in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for combining context web pages. The method comprises the following steps: firstly analyzing the content of a certain web page among a plurality of web pages with context relation; extracting context link information from the web page, and downloading the corresponding content; expanding contexts according to the downloaded content; eliminating the duplicated content of the expanded contexts; and combining according to the sequence to obtain a new single web page. In the method provided by the invention, a semantic analysis technology for web pages is creatively introduced so as to obtain clearer context relation of the web pages, thereby greatly improving the combination efficiency and quality of the web pages.

Description

technical field [0001] The invention relates to a method for merging multiple web pages with contextual relations, and belongs to the technical field of web page production. Background technique [0002] With the rapid development of the Internet, the web network has become the largest source of information in the world. The development of web network has brought great convenience to human life, people can share a large amount of information across time and space boundaries. However, the entire web network is composed of countless web pages. The mass, diversity, dynamic and semi-structured characteristics of web pages increase the difficulty of automatic processing of its content. [0003] Currently, people generally use mobile communication terminals such as mobile phones and tablet computers to access web networks. When reading a web page with a contextual relationship, it is necessary to click the next page link after reading the content of each page to see the content...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 王东胜
Owner HYLANDA INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products