Internet topics file searching method, reptile system and search engine
A search method and crawler system technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of incomplete collection and low efficiency of searching Internet subject files, and achieve the effect of improving search efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0062] Referring to FIG. 3 , it is a schematic structural diagram of the crawler system 1 provided by the present invention. It includes: a webpage and file download module 11 , a webpage parsing module 12 , a URL filtering module 13 , a collection control module 14 and a URL queue storage module 15 .
[0063] The functions of each module are described in detail below.
[0064] Web page and file download module 11: use HTTP, FTP protocol to download web page or file, and submit the downloaded web page to web page analysis module 12, submit the downloaded file to the indexing system of search engine to set up index database;
[0065] When the crawler system 1 just starts running, some seed URLs are set and put into the highest priority URL queue of the URL queue storage module 15 (its corresponding URL subject is divided into a default initial value), such as some common directory navigation webpages, such as www. hao123.com, the webpage and file download module 11 obtains the...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 