Pinyin error correction method and system

An error correction method and pinyin technology, applied in the field of data processing, can solve the problems of reducing the performance and applicability of the error correction system, increasing the difficulty of user selection, and low similarity of syllable strings, achieving high accuracy and saving search and traversal. Number of times, the effect of fast error correction

Active Publication Date: 2014-12-31
NEW SINGULARITY INT TECHN DEV
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since there are 410 Chinese syllable strings and 3,500 commonly used Chinese characters, no matter in applications such as pinyin retrieval or pinyin input method word selection, if the error correction range is large, the error correction result set will increase accordingly. The smaller the similarity between the string to be input and the string actually input by the user, the lower the ranking in the suggested result set, which not only increases the difficulty of user selection, but also reduces the performance and applicability of the error correction system, for example : When the error correction range of the search string "xiamin" is determined to be single-character error correction (the pinyin string input by the user is added, deleted, or modified by one character), the result set is "xiami", "xiaming", "xiamen", " ximin", "xiaomin", "xiemin", etc., but if the error correction range is large, "xiaoming" or even "xiangming" will appear in the result set, so the similarity of the syllable strings that the user wants to retrieve is low, no doubt Reduce user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pinyin error correction method and system
  • Pinyin error correction method and system
  • Pinyin error correction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0053] For the convenience of the present invention is correctly understood, at first several terms are defined as follows:

[0054] Syllable string: the syllable string in the present invention is a pinyin string composed of one or several phonemes according to the rules of grammar, and all syllable strings have Chinese characters corresponding to them, for example, the syllable string "chong" Corresponding to the Chinese character "重", the syllable string does not contain tones. For example, "chong" and "qing" are both a syllable strin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a pinyin error correction method and system. The pinyin error correction method comprises the following steps: building a word bank, organizing all pinyin strings and reverse strings which can be retrieved by a user respectively, and storing the pinyin strings and the reverse strings in positive and negative ternary search trees; analyzing and detecting a retrieval string input by the user, judging whether the retrieval string can be split into a plurality of syllable strings or not, and acquiring the error type of the retrieval string, wherein the error type is classified into a legal error and an illegal error; performing legal error correction on a legal error retrieval string, performing illegal error correction on an illegal error retrieval string, and acquiring an error correction result by adopting inquiry operation of the positive and negative ternary search trees; computing the similarity of all pinyin strings and the retrieval string input by the user in the error correction result, acquiring K pinyin strings with highest similarity, and outputting the K pinyin strings. The pinyin error correction method provided by the invention has the advantages of simple and efficient design, high error correction speed and high accuracy.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a pinyin error correction method and system with high speed and high accuracy. Background technique [0002] Pinyin error correction technology refers to detecting the pinyin character string input by the user, and modifying and optimizing the pinyin characters that are entered incorrectly or unreasonably, so as to ensure the output result. In search applications, the pinyin retrieval technology can effectively avoid the input method, realize the original pinyin retrieval, and change the search behavior to a certain extent; while in the input method application, the pinyin error correction technology can automatically identify and modify the wrong characters entered by the user string to ensure the correct output of Chinese characters and improve the fault tolerance of the input method. Pinyin error correction technology can effectively enhance application scalability an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/322G06F16/374
Inventor 熊小鹏刘磊
Owner NEW SINGULARITY INT TECHN DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products