Internet data acquisition method with high matching degree
A data collection, Internet technology, applied in network data indexing, network data retrieval, other database retrieval and other directions, can solve the problems of poor matching of captured data, data duplication, etc., to avoid repeated capture, meet user needs, Wide range of effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
Embodiment Construction
[0011] A method for collecting Internet data with a high degree of matching, the implementation process is as follows: first crawl the url list, provide the website url addresses that need to extract data for web crawlers, and store the website urls that need to extract data into the crawl url list; The crawler obtains the url information of the website that needs to extract data from the crawled url list; the web crawler obtains the corresponding page content from the corresponding url page and extracts the keyword information required by the user; the web crawler writes the extracted data into the database Middle; design the data analysis and comparison module, and process the data in the database through the data analysis and comparison module.
[0012] The web crawler performs data collection work according to the rules configured in advance by the user, and the configured rules include web page download rules, web page parsing rules, and content extraction rules.
[0013]...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com