Distributed search method and system
A distributed and indexing technology, applied in the field of computer communication, can solve the problems of slow index file speed and limited number of saved index files, and achieve the effect of improving retrieval speed and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0042] The core of the present invention is to adopt a distributed computing framework, which can call the CPU resources of the cluster in parallel to realize the construction and query of the distributed index. Further, in the technical solution of the embodiment of the present invention, a step-by-step webpage crawling method is also adopted to improve the webpage crawling speed.
[0043] The technical solutions of the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. figure 1 The shown distributed retrieval system includes: collection node cluster, index node cluster and retrieval node 105 .
[0044] The collection node cluster includes a plurality of collection nodes 101, and each collection node 101 has a web crawler module, which is used to perform structural processing on the webpages after grabbing the webpages, such as extracting the time, title, content, etc. of the webpages. Host and other informat...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com