Vertical intelligent crawler data collecting method based on webpage data capture
A webpage data and data collection technology, applied in the field of data collection, can solve the problems of high maintenance cost, inconvenient maintenance and function expansion, efficiency, etc., and achieve the effect of convenient expansion
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0015] like figure 1 The shown vertical intelligent crawler data collection method based on webpage data capture is characterized in that it includes the following steps: Step ①, through the start-stop entry configuration module, configure the initial entry address of the crawler into the start-up module. In step ②, the crawler control system performs a depth-first algorithm to traverse and crawl webpages according to the set crawling rules and crawling process. In step ③, the crawler parses and extracts the page data through the rule sequence pairs of the rule configuration system, and stores the extracted two-dimensional structure data.
[0016] As far as a preferred embodiment of the present invention is concerned, in order to facilitate subsequent configuration and use, the configuration module and the start-up module are located in the server, and the initial entry address of the crawler is statically imported through the specified crawler URL list file, or, through the c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com