Hierarchical structure, distributed search engine system and implementation method thereof

A search engine and hierarchical structure technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as poor timeliness, reduction of invalid links, and inability to index network information island resources

Inactive Publication Date: 2011-01-19
SOUTH CHINA UNIV OF TECH +1
View PDF2 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the one hand, the geographically dispersed heterogeneous digital information contains a large number of valuable resources, and users urgently need to find the desired information from these information; Under such large-scale conditions, to retrieve such a massive amount of information, the processing power of the centralized search engine is limited after all, and multiple search engines are especially needed to assist in the work
With the explosive growth of Internet information, traditional search engines began to show some limitations:
[0003] (1) Insufficient search depth: Traditional search engines can only search for superficial resources that are linked to each other on the Internet, while deep resources, such as pages that require permissions to access, internal pages of organizations, and some isolated island resources of network information, cannot be indexed
[0004] (2) Poor timeliness: If the server update cycle is too long, it is easy to generate a large number of invalid links
In fact, it is impossible to avoid invalid links by using the technology of web spiders to crawl the Internet.
Dead links can only be minimized by shortening the update cycle
[0005] (3) High cost: Massive resource index information requires a huge server to maintain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hierarchical structure, distributed search engine system and implementation method thereof
  • Hierarchical structure, distributed search engine system and implementation method thereof
  • Hierarchical structure, distributed search engine system and implementation method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0072] Such as figure 1 As shown, a layered structure applied to a distributed search engine of the present invention includes a physical layer, an abstraction layer, an application layer and a presentation layer from bottom to top, wherein

[0073] The physical layer provides the data source of the entire structure and is the basis of the entire structure. It is mainly used to receive retrieval requests from the abstraction layer, complete the data retrieval request tasks of each search engine itself, and pass the standard heartbeat information of the system through the abstraction layer. After being encapsulated, it is sent to the application layer (the heartbeat information refers to the information data packets that the physical layer periodically exchanges through the network), and its state information in the application layer is updated. The physical layer is a comprehensive and distributed comprehensive search engine composed of several homogeneous or heterogeneous uni...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical structure applied to a distributed search engine. The hierarchical structure comprises a physical layer, an abstract layer, an application layer and a presentation layer. The invention also discloses a distributed search engine system, which comprises a Web server, an agent node, a query agent pool, an abstract adapter and a plurality of working nodes, wherein the query agent pool consists of a plurality of query nodes. The invention also discloses an implement method for the distributed search engine system, which comprises the following steps of: S1, registration of a query node; S2, registration of working nodes; S3, state update of the nodes; and S4, distribution and retrieval of a query request. The hierarchical structure has the advantages of good performance, high reliability, diversification, specialization, strong applicability and the like.

Description

technical field [0001] The present invention relates to the design and implementation of a distributed search engine system, specifically refers to a plurality of unit search engines deployed on multiple clusters based on WebServices and RMI (Remote Method Invocation) technologies to provide unified search services distributed search engine service. Background technique [0002] With the rapid development of the next-generation network, the maturity of new-generation information technology such as Web2.0, and the distribution of information resources, this poses more new challenges for the architecture design of search engines. On the one hand, the geographically dispersed heterogeneous digital information contains a large number of valuable resources, and users urgently need to find the desired information from these information; Under such a large-scale condition, to retrieve such a massive amount of information, the processing power of the centralized search engine is li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 董守斌李粤张凌李浩李嘉林袁华
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products