Method and device for collecting webpage data of direction site based on internet
A web page data and Internet technology, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as the inability to guarantee data collection at the collection site, and achieve the effect of effective data collection
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0015] In order to solve the problem that the collection system in the prior art cannot guarantee the timely and effective data collection of the collection site, the embodiment of the present invention provides a method for collecting web page data based on the Internet-based directional site, especially for the priority and collection of URLs. The priority management of the queue (that is, the queue to be accessed in the collection system) specifically includes: configuring the collection task, including the starting URL and the collection depth. Collect web page data according to the specified starting URL, set different priorities for the new URLs analyzed (i.e. URLs to be collected) according to the URL classification mechanism, and insert into corresponding priority queues. The URLs to be collected in this embodiment are Refers to the URL to be collected and added to the queue of URLs to be accessed.
[0016] When the web page download module requests an available URL fr...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com