A Focused Crawler Method Based on Link Analysis
A technology focusing on crawlers and link analysis, applied in network data navigation, special data processing applications, instruments, etc., can solve the problems of low accuracy and efficiency of web pages, improve efficiency and accuracy, simplify the processing process, and improve the accuracy. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0048] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
[0049] A focused crawler method based on link analysis, comprising the following steps:
[0050] (1) Grab the webpage, compare the structure of the webpage and the target sample webpage, determine the target webpage, start from the website entrance link, record each link path of the crawler to the target webpage, and establish a target webpage link tree.
[0051] The specific steps for establishing the link tree of the target web page are as follows:
[0052] (11) select a target webpage as the target sample webpage, for comparing the webpage structure to be downloaded;
[0053] (12) Initialize the link tree, that is, the link tree is set to an empty tree;
[0054](13) Initialize the link queue, add the entry link of the website to the end of the link queue. The link queue is a storage structure used to store the links extracted from the webp...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com