Chinese term automatic correction method in input process

An input process and automatic error correction technology, which is applied in the field of automatic error correction technology for Chinese terms, can solve problems such as not using the working characteristics of input method software

Inactive Publication Date: 2010-02-03
XI AN JIAOTONG UNIV
View PDF0 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016] 3. When the existing patent handles errors, it does not use the working characteristics of the input method software used by the user, but performs error correction in units of Chinese characters; this method combines the characteristics of the input method software for error correction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese term automatic correction method in input process
  • Chinese term automatic correction method in input process
  • Chinese term automatic correction method in input process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0144] In order to understand the present invention more clearly, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0145] 5.1 Chinese term auto-correction mechanism combining completion and error correction

[0146] In the present invention, a mechanism combining term completion and term error correction is adopted to automatically correct errors of Chinese terms input by users. The principle is as follows: figure 1 shown. The idea of ​​this mechanism is to "correct" both legal and illegal user input, and prompt the user with a list of suggestions. The specific working mechanism is as follows:

[0147] I. Treat the user's input as a "word bag", and each element in the word bag is called a "word", that is figure 1 "term" in . There are three possible forms of this term: a string of Chinese characters, a string of letters, or a mixed string of letters and Chinese characters. It is the basic unit of error correc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese term automatic correction method in an input process, comprising the following steps: (1) using each element input by users as a Term, which is one of a Chinese character string, a letter string or a mixing string of Chinese characters and letters; (2) quickly judging whether the Term is a valid term by structure of a Bloom filter; if the Term is a valid term, carrying out term completion on the Term; otherwise, carrying out term correction on the Term; the term completion means to provide completion prompting to a right short term and the term correction means to correct the incorrect input to obtain the valid term prompting closing to the intention of a user; the term completion and the term correction both adopt term clustering and a Trie tree construction and search mechanism; the judging mechanism of the Bloom filter is adopted in the user input judgment; for the right input, the valid term completion mechanism is adopted to carry out term completion prompting; to input error by users, the original input is restored based on operation characteristics of all sorts of Pinyin input method software and the correction prompting is carried out on the error correction mechanism compiled aiming at the original input.

Description

[0001] Field [0002] The invention relates to the technical field of computer text automatic error correction, in particular to an automatic error correction technology for Chinese terms in the process of user input. Background technique [0003] With the increasingly widespread application of Internet technology, online learning has become one of the main means for people to acquire and learn knowledge. The completion of this process requires users to input information through computers. For Chinese users, generally speaking, what the user inputs is a character string, which is converted into a string of Chinese characters after being processed by some Chinese input method software. No matter the processing of user input or Chinese input method software, errors may occur, so automatic error correction technology is needed to deal with these errors. After a novelty search, the applicant retrieved three patents related to the present invention and belonged to the field of aut...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/30G06F3/023
Inventor 刘均郑庆华沙莎郑成周正
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products