Unlock instant, AI-driven research and patent intelligence for your innovation.

Distributed search engine system and ID mapping table expanding method

A search engine and mapping table technology, applied in the field of search engines, can solve the problems that new segmentation units cannot be dynamically indexed at any time, affecting search speed and search quality, etc., to ensure consistency, improve search speed, and improve The effect of search quality

Active Publication Date: 2009-06-24
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, often the search engine system will find many new segmentation units (that is, segmentation units that are not in the ID mapping table) during operation, and these new segmentation units cannot be dynamically indexed at any time
Since the size of the ID mapping table will determine how much information the system can retrieve, this ID mapping table that cannot be dynamically updated during the search process will affect the search speed and search quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed search engine system and ID mapping table expanding method
  • Distributed search engine system and ID mapping table expanding method
  • Distributed search engine system and ID mapping table expanding method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] refer to figure 1 , the distributed search engine system in the embodiment of the present invention includes: text segmentation sub-systems 1 to N, N≥1, wherein each text segmentation sub-system includes: segmentation module 10, ID mapping module 20, ID mapping table A storage module 30, a new word statistics table storage module 40, and a cutting sub-system transceiver module 50.

[0038] Wherein, the segmentation module 10 is used to extract the segmentation unit from the source text grabbed from the Internet in the search engine system facing massive data, and send the extracted segmentation unit to the ID mapping module 20;

[0039] ID mapping table storage module 30, for preserving the ID mapping table of the one-to-one correspondence between the segmentation unit and the text ID, for the ID mapping module to search;

[0040]ID mapping module 20, for receiving the segmentation unit that segmentation module 10 outputs, is responsible for searching the text ID corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a distributed search engine system and an ID mapping table expansion method, comprising: at least one text segmentation sub-system, the text segmentation sub-system includes: a segmentation module, an ID mapping module and an ID mapping table storage module, and a segmentation module Send the segmentation unit extracted from the source text to the ID mapping module, and the ID mapping module searches the ID mapping table for the text ID corresponding to the segmentation unit, and fills it in the segmentation unit information, including: new word statistics The table storage module is used to save the new word statistical table formed by the segmentation unit and the number of occurrences that are not found in the ID mapping table; the sub-system transceiver module of the segmentation: used to obtain the new word statistical table, and update the ID mapping table, Clear the segmentation unit and the number of occurrences of the ID mapping relationship updated in the ID mapping table in the new word statistics table; the ID mapping module is used to count the segmentation not found in the ID mapping table in the new word statistics table The unit and its number of occurrences.

Description

technical field [0001] The invention relates to search engine technology, in particular to a distributed search engine system and an ID (Identifier, identifier) ​​mapping table expansion method. Background technique [0002] In a distributed search engine system facing massive data, it is necessary to analyze the input text information to extract the segmentation units. In order to improve the retrieval performance of the search engine system, after each segmentation unit is proposed, a mapping relationship is established with a text ID. In the subsequent internal processing, this text ID is used as the unique identification of the segmentation unit. [0003] However, often the search engine system will find many new segmentation units (that is, segmentation units not in the ID mapping table) during operation, and these new segmentation units cannot be dynamically indexed at any time. Since the size of the ID mapping table will determine how much information the system can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 杨海松
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH