Method, device and system for collecting effective information web pages in website information
A technology of effective information and website information, applied in the network field, can solve the problems of unstable crawling results and large resource consumption of the web crawler system, and achieve the effect of solving the problem of resource consumption, reducing interference, and improving utilization
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 2
[0099] The specific implementation of the device for collecting valid information webpages in the website information described in the second embodiment can be executed with reference to the above content. ) for background operations on such websites can refer to the method mentioned above, and will not be described in detail here.
[0100] Such as image 3 As shown, it is a system for collecting valid information webpages in website information according to the third embodiment of the present invention, including: a content management device (CMS, Content Management System) 301, a link library (URLDB) 302 and a web page collection device (Crawler ) 303; among them,
[0101] The content management device 301 is coupled with the link library 302, and is used for pre-configured list page URL link templates and product page URL link templates, wherein the pre-configured product page URL link templates include Product attribute information.
[0102] Wherein, in the URL link tem...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com