Text error correction method and device and terminal

A text error correction and wrong word technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of high cost, inconvenient use, a lot of time and labor spent on confusing sets, and achieve the goal of improving accuracy Effect

Active Publication Date: 2017-03-22
SHANGHAI XIAOI ROBOT TECH CO LTD
View PDF7 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually, error detection and correction are performed based on the confusion set, but the establishment of the confusion set takes a lot of time and manual maintenance, which is costly and inconvenient to use

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text error correction method and device and terminal
  • Text error correction method and device and terminal
  • Text error correction method and device and terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] As mentioned in the background technology, in the prior art, error checking and error correction are required for errors in the replacement of sound-like words. Usually, error detection and correction are performed based on the confusion set, but the establishment of the confusion set takes a lot of time and manual maintenance, which is costly and inconvenient to use.

[0036] In the case of errors in the replacement of sound-like words in the text, since the wrong sound-like words will be divided into multiple characters during word segmentation, the technical solution of the present invention combines at least a part of the word strings obtained by word segmentation to obtain multiple Wrong word candidate words, so that word strings with the same pinyin can be used to create wrong word candidate classes, and based on word formation probability, recommended words are selected in the wrong word candidate class. ; Furthermore, it is easy and effective to automatically fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text error correction method and device and a terminal. The text error correction method comprises the steps of carrying out word segmentation on to-be-corrected corpus, thereby obtaining individual character strings and word strings; combing at least one part of the individual character strings, thereby obtaining a plurality of error word candidate words; classifying the error word candidate words and word strings with the same Pinyin into the same error word candidate class; and in each error word candidate class, selecting recommending words according to a word forming probability of each error word candidate word and each word string, thereby carrying out text error correction. According to the technical scheme of the method, the device and the terminal, the convenience and effectiveness of carrying out error correction on the words with the similar Pinyin in a text are improved.

Description

technical field [0001] The present invention relates to the field of natural language processing, in particular to a text error correction method, device and terminal. Background technique [0002] Text error correction is one of the hard problems in natural language processing. Chinese text errors mainly include substitution errors, multi-character errors and missing characters. With the widespread use of various pinyin input methods, there are widespread errors in the replacement of sound-like words in text data, for example, "checked luggage" is mistakenly written as "dragged luggage". The existence of wrong words usually directly leads to word segmentation errors, and word segmentation errors make the semantics of the text confusing and bring difficulties to text processing. [0003] In the prior art, for errors in the replacement of sound-like words, it is necessary to perform error checking and error correction. Usually, error detection and correction are performed ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/284G06F40/30
Inventor 谢瑜张昊朱频频
Owner SHANGHAI XIAOI ROBOT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products