Unlock instant, AI-driven research and patent intelligence for your innovation.

Distributed search engine system and ID mapping table expanding method

A search engine and mapping table technology, applied in the field of search engines, can solve problems such as affecting search speed and search quality, and new segmentation units cannot be dynamically indexed at any time, so as to ensure consistency, improve search speed, increase The effect of reliability

Active Publication Date: 2007-11-14
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, often the search engine system will find many new segmentation units (that is, segmentation units that are not in the ID mapping table) during operation, and these new segmentation units cannot be dynamically indexed at any time
Since the size of the ID mapping table will determine how much information the system can retrieve, this ID mapping table that cannot be dynamically updated during the search process will affect the search speed and search quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed search engine system and ID mapping table expanding method
  • Distributed search engine system and ID mapping table expanding method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Referring to Fig. 1, the distributed search engine system of the embodiment of the present invention includes: text segmentation sub-systems 1 to N, N≥1, wherein each text segmentation sub-system includes: segmentation module 10, ID mapping module 20, ID mapping table storage module 30 , new word statistics table storage module 40 , and cutting subsystem transceiver module 50 .

[0037] Wherein, the segmentation module 10 is used to extract the segmentation unit from the source text grabbed from the Internet in the search engine system facing massive data, and send the extracted segmentation unit to the ID mapping module 20;

[0038] ID mapping table storage module 30, for preserving the ID mapping table of the one-to-one correspondence between the segmentation unit and the text ID, for the ID mapping module to search;

[0039] ID mapping module 20, for receiving the segmentation unit that segmentation module 10 outputs, is responsible for searching the text ID corresp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a system and the distributed search engine ID mapping table expansion method. Including: at least one version of cutting elements, all the text elements include: segmentation module, module ID mapping and ID mapping table memory modules segmentation module will be extracted from the source text in the segmentation module to the mapping module ID, ID mapping module in ID Mapping Table View and separation unit corresponding text ID, and fill in all the information in the unit. Including: Statistics new words table memory modules for use in the preservation of the ID mapping from the table to find the cut in the number of units and a new word tables; cut molecular system transceiver modules. The term used for the acquisition of new tables and updated ID mapping table, new words in the clearance tables in the ID mapping table updated ID mapping between the unit and cut frequency; ID mapping modules for use in the new term in the statistical tables in the form of ID mapping view to the segmentation unit and the frequency.

Description

technical field [0001] The invention relates to search engine technology, in particular to a distributed search engine system and an ID (Identifier, identifier) ​​mapping table expansion method. Background technique [0002] In a distributed search engine system facing massive data, it is necessary to analyze the input text information to extract the segmentation units. In order to improve the retrieval performance of the search engine system, after each segmentation unit is proposed, a mapping relationship is established with a text ID. In the subsequent internal processing, this text ID is used as the unique identification of the segmentation unit. [0003] However, often the search engine system will find many new segmentation units (that is, segmentation units not in the ID mapping table) during operation, and these new segmentation units cannot be dynamically indexed at any time. Since the size of the ID mapping table will determine how much information the system can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 杨海松
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH