Method for constructing personalized dictionary applicable to mobile search

A construction method and mobile search technology, applied in network data retrieval, network data indexing, other database retrieval, etc., can solve the problem of the large query range of the second word, the long length of the second word hash table, and the difficulty of constructing the second word hash table, etc. problem, to achieve the effect of improving query efficiency

Active Publication Date: 2014-03-26
XIAN UNIV OF POSTS & TELECOMM
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the commonly used dictionary mechanism is mainly based on the three methods of whole word dichotomy, TRIE index tree method and word-for-word dichotomy. Since these methods realize the search for subwords through dichotomy, as the number of entries in the word segmentation dictionary grows, It will cause the query range of subwords to be too large, and the degree of efficiency improvement is very limited
There are also double-word or multi-word hashing mechanisms derived on this basis, but these methods will make the length of the sub-word hash table too long or make it difficult to construct the sub-word hash table, resulting in a complex storage structure of the dictionary that is difficult to manage
At the same time, in view of the fact that the current word segmentation dictionary based on the conventional word segmentation cannot obtain the interests of the user's query content after the word segmentation, so it cannot meet the high-precision and personalized query requirements in mobile search.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for constructing personalized dictionary applicable to mobile search
  • Method for constructing personalized dictionary applicable to mobile search
  • Method for constructing personalized dictionary applicable to mobile search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be further described in detail below in conjunction with the accompanying drawings.

[0021] figure 1 It is a schematic diagram of the logical structure of the personalized dictionary proposed by the present invention, which is divided into four levels, which are respectively the hash index table of the first word, the segmented hash index table of the location code of the second word, the index table of the second word and the dictionary text.

[0022] The first word hash index table is composed of the first word of the word and related attribute information and the pointer to the lower unit. Its data structure in memory is as follows: figure 2 shown. Among them, isWord, frequency and coding are related attribute information of the first word, respectively indicating whether it is a word, frequency of occurrence and classification coding information; s_hash stores the first address of the subordinate unit corresponding to the current word a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for constructing a personalized dictionary applicable to mobile search. A secondary word zone bit code segment hash mechanism introduced in a dictionary structure is used for dividing entries with same first words into a plurality of subzones according to secondary word zone bit codes, so that secondary words can be fast searched in a small range by using a dichotomy, and the dictionary search efficiency is effectively improved. At the same time, relative information comprising classification and use frequency is introduced into each entry structure of the personalized dictionary, classification information of content searched by a user can be directly acquired after segmenting words, so that the mobile search personalized requirement is met; relative treatment of query expansion and query suggestion is performed by a system conveniently.

Description

technical field [0001] The invention relates to the technical field of Chinese information processing in mobile search, and specifically relates to a method for constructing a personalized dictionary in mobile search. Background technique [0002] A word is the smallest unit with certain semantics. In order to realize the machine's understanding of Chinese sentences, it is first necessary to perform word segmentation to determine each word in the sentence. The so-called word segmentation is to segment a sentence according to the meaning of the words in it. Automatic word segmentation is the basic link of Chinese information processing. The dictionary mechanism and processing efficiency referred to by word segmentation directly affect the system processing efficiency and the information that can be provided after word segmentation. [0003] At present, the commonly used dictionary mechanism is mainly based on the three methods of whole word dichotomy, TRIE index tree method ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/951G06F40/242
Inventor 王忠民齐静娜贺炎邓万宇梁琛王文浪
Owner XIAN UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products