Distributed search method and system
A distributed and indexing technology, applied in the field of computer communication, can solve the problems of slow index file speed and limited number of saved index files, and achieve the effect of improving retrieval speed and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0042] The core of the present invention is to adopt a distributed computing framework, which can call the CPU resources of the cluster in parallel to realize the construction and query of the distributed index. Further, in the technical solution of the embodiment of the present invention, a method of crawling webpages in a step-by-step manner is also adopted to increase the speed of webpage crawling.
[0043] The technical solutions of the embodiments of the present invention will be described in detail below with reference to the accompanying drawings. figure 1 The shown distributed retrieval system includes: a collection node cluster, an index node cluster, and a retrieval node 105.
[0044] The collection node cluster includes multiple collection nodes 101, and each collection node 101 has a web crawler module, which is used to structure the crawled web pages after web pages are crawled, such as extracting web page time, title, content, The host and other information generate a...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

