Method for realizing web crawler tasks
A web crawler and task technology, applied in the field of web crawlers, achieves the effects of speed assurance, shortened development cycle, and reduced development difficulty
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example
[0032] Embodiment: A method for establishing a web crawler task for a physician database system, specifically refers to a method for quickly realizing a fast and stable web crawler for crawling corresponding websites when crawling different websites. The specific implementation is as follows:
[0033] A. In step S11, write a template for storing the link address of the page. This step creates a template for storing page link address information for each page that needs to be crawled. This template is equivalent to a page address blank record book that can be used to save the link address of the crawled page and the depth of the page. For example, the link to the detailed information page of Wanfang's paper is:
[0034] (Http: / / d.wanfangdata.com.cn / Periodical_ahzylczz201203001.aspx), the page depth is 3, then the content stored in the template is the above address link and the depth value 3.
[0035] B. In step S12, write a link resolver. First, establish a regular expression, anal...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap