Character string retrieval method and system

A technology of character strings and characters, which is applied in the field of string retrieval methods and systems, can solve the problems of reduced retrieval performance, increased computation, and increased resource occupancy, and achieves the goals of improving retrieval performance, reducing computation, and ensuring fault tolerance Effect

Active Publication Date: 2013-11-06
HEFEI IFLYTEK TOYCLOUD TECH
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with accurate retrieval, fault-tolerant recognition will increase the amount of calculation due to the expansion of unknown results, which will reduce the retrieval performance and greatly increase the resource occupancy rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character string retrieval method and system
  • Character string retrieval method and system
  • Character string retrieval method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0075] The character string retrieval method and system of the embodiment of the present invention adopts a multi-fork prefix tree to save the character string data set, saves the character string information on the termination node of the multi-fork prefix tree, and uses pronunciation instead of pronunciation between the parent node and the child node Characters are related to each other. Since there are many words with the same pronunciation in Chinese characters, compared with using Chinese character multi-tree, the memory usage is smaller.

[0076] When searching, continue or terminate a search path through the activation and inactivation status of the node path on the multi-fork tree. When the termination no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a character string retrieval method and system. The method comprises the following steps: retrieval information input by a user is received; characters of a character string in the retrieval information are converted into Pinyin one by one, and the similar pronunciation collection of the characters is determined; the Pinyin of the characters and similar pronunciation in the similar pronunciation collection are input into a multi-way prefix tree for retrieval; when a node matched with the Pinyin or the similar pronunciation is retrieved, the node is recorded in the state of activation, the activation track is recorded, and after the Pinyin and the similar pronunciation of the next character enter the multi-way prefix tree, all the nodes in the state of activation are continuously retrieved till the terminal node is retrieved; keyword information stored by the terminal node on the activation track is acquired; the keyword information serves as the retrieval result to be shown to the user. Through utilizing the method, the retrieval performance can be improved under the fault-tolerant capability, and the calculation and resource occupation rate are reduced.

Description

technical field [0001] The invention relates to the technical field of information retrieval, in particular to a character string retrieval method and system. Background technique [0002] In the current Internet era, information retrieval is used in almost all industries, and different service providers, including search manufacturers, telecom manufacturers, etc., are no longer satisfied with simply providing users with retrieval information, but more with providing While retrieving information, provide services that users need to improve product experience. This requires the search information to have a high accuracy. Therefore, higher requirements are placed on the accuracy of the user input information and the accuracy of the search results. [0003] At present, the mainstream information input methods mainly include: pinyin input method, handwriting input method, Wubi input method and intelligent voice input method, among which voice input method is widely used in some...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 石峰吴维昊郏全史峰路雪玲张磊聂小林
Owner HEFEI IFLYTEK TOYCLOUD TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products