Unlock instant, AI-driven research and patent intelligence for your innovation.

Prime number replacing character string search technology

A retrieval technology, string technology, applied in digital data information retrieval, unstructured text data retrieval, electronic digital data processing and other directions, can solve the problems of inefficiency, low efficiency, inability to effectively improve fuzzy retrieval efficiency, etc. To achieve the effect of improving efficiency and reducing the amount of calculation

Inactive Publication Date: 2005-03-02
徐文新
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Complicated Chinese characters cannot be input with common font codes such as Wubi fonts. Quanpin input method or internal code input is usually used, which is inefficient and no effective solution has been found so far.
[0003] On the other hand, the character string fuzzy search in the database is carried out by bit-by-bit comparison. For example, to judge whether the character string bdopfqew contains the character f, the computer uses f to compare the character string starting from b, which is not efficient. Indexing on string fields cannot effectively improve the efficiency of fuzzy retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] 400 to 600 prime numbers N represent the basic radicals and strokes of Chinese characters, such as 2 for "big", 3 for "ding", 5 for "mouth", 7 for "wood", and 11 for "亻". The F value of the product of the prime numbers of the basic radicals is assigned to all Chinese characters, such as "Ke" is composed of "D" and "Kou", and 3*5=15, then the F value of "Ke" is 15. Similarly, the F value of "Qi" is 2*5*7=30, the F value of "Ke" is 7*3*5=105, and the F value of "He" is 11*3*5=165. Thus, a database containing all Chinese characters and their corresponding values ​​F can be constructed.

[0025] After assigning values ​​to all Chinese characters in this way, if a certain Chinese character contains a certain radical, then the F value of this Chinese character must be divisible by the prime number N of a certain radical. If "Ke" contains "口", the F value of "Ke" must be divisible by the prime number N"5" of "Ke"; on the other hand, that is, all Chinese characters whose F val...

Embodiment 2

[0030] Use 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97 ,101,103,107,109,113,127,131,137,139,149,151,157,163,167,173,179,181,191,193,197,199,211,223,227,229 56 prime numbers such as , 233, 239, 241, 251, 257, 263 represent 56 English letters such as cluster a-z and A-Z, then English words can be represented by the product F of these prime numbers. If the value of able is 2*3*37*11=2442, it is possible to establish a database of English words and their corresponding F values. In the data, all words with the suffix "able" must be divisible by 2442, so you can use whether it is divisible by 2442 to find all the words with the suffix "able".

[0031] Searching with this method will retrieve non-target words such as bale, which is not an accurate query, but it can effectively narrow the scope of the search, and then use the character comparison method to perform a secondary search to achieve an accurate query.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Several prime numbers, N1, N2 ... are used in replacing several master characters, P1, P2 ..., the product of several prime numbers, N1*N2..., named F value, is used in replacing the character string H comprising these master characters, so are F1, F2, F3, F4 ... for the character strings H1, H2, H3, H4 ..., and thus, a character string information base is established. When Fn may be divided exactly by N1*N2..., the corresponding character string Hn contains the master characters P1, P2 ... corresponding to N1, N2 ..., and thus the fuzzy search of character string in digital base is realized. The similar search method may be also performed to long integral data. Similarly, when the Chinese character radicals P1, P2 ... are assigned with prime numbers, N1, N2 ..., Chinese characters H1, H2, H3, H4 ... may have corresponding radical products F1, F2, F3, F4 ... and may be searched via their radical combinations.

Description

technical field [0001] The present invention represents a basic character with a prime number, represents a character string with a prime number product value, and performs a division operation on the prime number product value with a prime number or the product of several prime numbers. It is a database retrieval technology that the character string represented by these prime numbers contains a certain character or several characters corresponding to these prime numbers. The main purpose is to achieve "retrieval of any Chinese character with any combination of radicals at any level", and it can also be used to improve the fuzzy retrieval of character strings in general databases. Background technique [0002] Complicated and difficult Chinese characters can not be imported with common font codes such as Wubi fonts, and usually adopt the Quanpin input method or internal code input, which is inefficient, and no effective solution has been seen so far. [0003] On the other h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/30G06F40/00
CPCG06F17/2217G06F17/30634G06F16/33G06F40/126
Inventor 徐文新
Owner 徐文新