Method and device for capturing webpage content
A webpage content and webpage technology, applied in the field of webpage content crawling, can solve the problems of low efficiency of webpage content crawling and high complexity of webpage content crawling, and achieve the effect of reducing complexity and improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0031] In order to solve the problems of high complexity of crawling web content and low efficiency of crawling web content in the current process of crawling different types of web content. In the embodiment of the present invention, when a webpage to be crawled is detected, the URL of the webpage to be crawled is searched from a preset crawling rule library, and when there is no crawling rule corresponding to the URL in the crawling rule library , To analyze the content of the webpage to be crawled, and generate crawling rules for the webpages to be crawled that meet the conditions. By adopting the technical scheme of the present invention, the content of the webpage to be crawled is analyzed, and the crawling rules corresponding to the webpage to be crawled are automatically generated according to the analysis result. There is no need to manually set the crawling rules, which effectively reduces the complexity of crawling web content and improves Improve the efficiency of we...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap