Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Webpage data obtaining method and system and data matching and pushing method

A webpage data and acquisition method technology, applied in the field of network information, can solve the problem of low data acquisition efficiency and achieve the effect of improving efficiency

Active Publication Date: 2018-03-30
FIFTH ELECTRONICS RES INST OF MINIST OF IND & INFORMATION TECH +2
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Based on this, it is necessary to provide a webpage data acquisition method, system and data matching push method for the above-mentioned technical problem of low data acquisition efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage data obtaining method and system and data matching and pushing method
  • Webpage data obtaining method and system and data matching and pushing method
  • Webpage data obtaining method and system and data matching and pushing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The technical solution of the present invention will be described in detail below in combination with specific embodiments and accompanying drawings, so as to make it more clear.

[0038] like figure 1 As shown, the present invention provides a method for obtaining web page data, which may include the following steps:

[0039] S10. Comparing the URL of the target webpage with the reference URL to determine the type of the URL;

[0040] S20. Determine a webpage search strategy according to the type of the URL;

[0041] S30. Collect webpage data of the target webpage according to the webpage search strategy.

[0042] In practical applications, the web page data may be relevant data of enterprises in the testing and certification service industry. For the convenience of description, the webpage data is the relevant data of enterprises in the testing and certification service industry as an example for illustration.

[0043] In this method, the webpage search strategy ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a webpage data obtaining method and system, a data matching and pushing method, a computer storage medium and a device. The webpage data obtaining method comprises the steps that a uniform resource locator of a target webpage is compared with a reference uniform resource locator to determine the type of the uniform resource locator; according to the type of the uniform resource locator, a webpage search strategy is determined; according to the webpage search strategy, webpage data of the target webpage is acquired. According to the scheme, the uniform resource locatorof the target webpage is compared with the preset reference uniform resource locator to determine the type of the uniform resource locator, the webpage search strategy is determined according to the type of the uniform resource locator; the webpage data of the target webpage is acquired by adopting the webpage search strategy, and accordingly webpage data obtaining efficiency is improved.

Description

technical field [0001] The present invention relates to the field of network information technology, in particular to a method and system for acquiring webpage data and a data matching and pushing method. Background technique [0002] With the rapid development of the Internet and the explosive growth of various network data, how to quickly obtain web page data from massive network information has become a major problem. [0003] The traditional web page data acquisition method is realized by web crawler technology, that is, starting from one or several initial URLs (Uniform Resource Locator, Uniform Resource Locator), and obtaining information in the web page through a set crawling order or method. , and then extract a new URL address from the webpage as the next-hop address, or perform appropriate address splicing on the basis of the original address to form a new next-hop analysis address, until the stop condition set by the system is met. [0004] However, there is a te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/9535G06F16/955G06F16/958
Inventor 杨晓明刘业政赵国祥刘小茵贺菲菲李尧钱洋李玲菲姜元春孙见山孙春华
Owner FIFTH ELECTRONICS RES INST OF MINIST OF IND & INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products