Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Thesaurus fuzzy enquiry method and thesaurus fuzzy enquiry system

A technology of fuzzy query and thesaurus, applied in the field of thesaurus query, can solve the problem of slow fuzzy query speed, and achieve the effect of fast query speed, reduced storage space, and comprehensive query

Active Publication Date: 2009-04-01
ALIBABA GRP HLDG LTD
View PDF1 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0020] The purpose of the present invention is to provide a kind of thesaurus fuzzy query method and thesaurus fuzzy query system, to solve the technical problem that existing fuzzy query speed is slow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Thesaurus fuzzy enquiry method and thesaurus fuzzy enquiry system
  • Thesaurus fuzzy enquiry method and thesaurus fuzzy enquiry system
  • Thesaurus fuzzy enquiry method and thesaurus fuzzy enquiry system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0062] Since the present inventor has absorbed the essence of the double-array Trie for the invention and creation, before introducing the fuzzy query of the thesaurus of the present invention in detail, first introduce the double-array Trie.

[0063] If you want to query the double-array Trie, you first need to construct the double-array Trie, and determine the base value array and the corresponding check value array.

[0064] Assume that there are only words "Ah, Argentina, Ejiao, Arabia, Arabs, Egypt" in the thesaurus.

[0065] First, encode all 10 Chinese characters that appear in the lexicon: Ah-1, Ah-2, Ai-3, Gen-4, Jiao-5, La-6, and-7, Ting-8, Bo-9 , person-10. This encoding is not unique, it only needs to encode all the characters in the thesaurus one by one, which can be sequential encoding, or the corresponding encoding of each Chinese character that alre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a lexicon fuzzy searching method, comprising the steps as follows: (1) an entry data structure is established: (1-1) all entries in the lexicon are sequentially memorized in the entry memory unit of the entry data structure; (1-2) a positive-direction entry searching structure of the entry date structure is established; all words of all entries are coded correspondingly; subsequently, double-array Trie is established to determine a basic value array and a corresponding calibration value array; subsequently, the memory address information of all words which begin with each affix in the entry memory unit is memorized; the affix is corresponding to the array unit of the basic value array and the calibration value array; (2) when a query sentence is received, codes of all words in the query sentence are gained and the double-array Trie is used to find the basic value array unit where the query sentence is; subsequently, the memory address information of all words which begin with the affix in the entry memory unit corresponding to the basic value array unit is found so as to find all corresponding words in the entry memory unit. The method has extremely fast query speed.

Description

technical field [0001] The invention relates to a thesaurus query technology, in particular to a thesaurus query method and a thesaurus query system. Background technique [0002] At present, information retrieval has developed to the stage of networking and intelligence. The object of information retrieval has expanded from relatively closed, stable and consistent information content that is centrally managed by an independent database to open, dynamic, faster-updating, widely distributed, and loosely managed Web content; the users of information retrieval have also expanded from the original intelligence professionals From the general public, including business people, managers, teachers and students, professionals, etc., they put forward higher and more diverse requirements for information retrieval from results to methods. Adapting to the needs of networking, intelligence and personalization is a new trend in the development of information retrieval technology. [0003...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 孙海涛施行向
Owner ALIBABA GRP HLDG LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More