Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for obtaining web page information

A web page information and acquisition method technology, applied in the Internet field, can solve problems such as inability to guarantee accuracy, low error tolerance, and content that is not expected, and achieve the effects of improving user experience, enhancing anti-interference, and avoiding analysis failures

Active Publication Date: 2019-11-15
ADVANCED NEW TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This needs to obtain the DOM tree template of each webpage separately in advance, which is a very heavy workload, and, with the adjustment of the webpage structure of each website, if the template of the changed webpage cannot be updated in time, the extracted content will not be expected content
In addition, there will be some promotional or promotional content on the webpage, which may include the same content as the content to be parsed. At this time, the accuracy cannot be guaranteed
[0004] At present, it is also possible to provide users with the required information by dividing the content of each webpage into multiple content blocks, and extracting the content blocks corresponding to the content types required by the user according to the type corresponding to each content block, but it is still impossible Accurately obtain web page content, and the stability is relatively low, and even extract wrong information to the user. For some information that requires accurate data and its corresponding relationship, the error tolerance is low. Once an error occurs, it will Bring great inconvenience to users, and even serious economic losses

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for obtaining web page information
  • Method and device for obtaining web page information
  • Method and device for obtaining web page information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are only for explaining the present application, and should not be construed as limiting the present application.

[0024] In the description of the present application, it should be understood that the terms "center", "longitudinal", "transverse", "upper", "lower", "front", "rear", "left", "right", " The orientation or positional relationship indicated by "vertical", "horizontal", "top", "bottom", "inner", "outer", etc. is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the present application and The description is simplified, rather than indicating or implying tha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a webpage information acquisition method and device. The webpage information acquisition method comprises the following steps: obtaining a to-be-analyzed webpage; extracting to-be-analyzed keywords from the to-be-analyzed webpage; obtaining the positions of the to-be-analyzed keywords in the to-be-analyzed webpage; and obtaining contents corresponding to the to-be-analyzed keywords from the to-be-analyzed webpage according to the relationship among the positions. According to the webpage information acquisition method, the analysis failure caused by the interference of the to-be-analyzed keywords contained in the non-essential contents in the to-be-analyzed webpage can be avoided, and the anti-interference performance of the webpage information obtaining can be strengthened, so that the success rate of the webpage information obtaining is improved and the correctness of the webpage information obtaining is improved. Moreover, the webpage information acquisition information method and device are capable of filtering the useless information from the webpages to a great extent and accurately extracting the information required by the users, so that the user experience is enhanced.

Description

technical field [0001] The present application relates to the technical field of the Internet, in particular to a method and device for acquiring web page information. Background technique [0002] With the development of Internet technology, network resources are increasingly abundant, and users can browse different web content through the Internet. In order to reduce the browsing cost of the user, webpage information in the Internet can be extracted, so that the extracted information that may be needed by the user and useful to the user can be provided to the user. [0003] The traditional method for extracting webpage information is to obtain the DOM (Document Object Model, Document Object Model) tree template of each webpage in advance, and then according to the position of the information to be extracted in the DOM tree template corresponding to the webpage, determine the information to be extracted in the parsed node in the DOM tree of the web page, and extract the co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/958
Inventor 陈俊文
Owner ADVANCED NEW TECH CO LTD