Method and device for error correction model training and text error correction

a text error correction and model training technology, applied in the field of information processing, can solve the problems of low error recall rate of text error correction program based on language rules, difficult to summarize language rules, error character strings, etc., and achieve the effect of improving the accuracy and comprehensiveness of existing text-processing methods

Inactive Publication Date: 2014-07-31
TENCENT TECH (SHENZHEN) CO LTD
View PDF8 Cites 54 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0007]The present application provides a text-processing method and apparatus based on context information of a word in a sentence to improve upon the accuracy and comprehensiveness of existing text-processing methods.

Problems solved by technology

There are often error character strings, such as wrongly written or mispronounced characters and mis-spelled words, in the text used in daily work and life.
But due to the complex structure of language itself, it is not easy to summarize language rules, and there are often conflicts between different summarized language rules.
Therefore, the error recall rate of text error correction program based on language rules is low and the accuracy of error correction is also low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for error correction model training and text error correction
  • Method and device for error correction model training and text error correction
  • Method and device for error correction model training and text error correction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021]Reference will now be made in detail to embodiments, examples of which are illustrated in the accompanying drawings. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the subject matter presented herein. But it will be apparent to one skilled in the art that the subject matter may be practiced without these specific details. In other instances, well-known methods, procedures, components, and circuits have not been described in detail so as not to unnecessarily obscure aspects of the embodiments.

[0022]In accordance with some embodiments of the present application, a text-processing program conducts the error correction processing according to the context information of a character string. Specifically, the program recognizes the error character strings appearing in some contexts by the similarity analysis of correct character strings and character strings to be processed with the same context informati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A computer-implemented method is performed at a device having one or more processors and memory storing programs executed by the one or more processors. The method comprises: selecting a target word in a target sentence; from the target sentence, acquiring a first sequence of words that precede the target word and a second sequence of words that succeed the target word; from a sentence database, searching and acquiring a group of words, each of which separates the first sequence of words from the second sequence of words in a sentence; creating a candidate sentence for each of the candidate words by replacing the target word in the target sentence with each of the candidate words; determining the fittest sentence among the candidate sentences according to a linguistic model; and suggesting the candidate word within the fittest sentence as a correction.

Description

RELATED APPLICATIONS[0001]This application is a continuation application of PCT Patent Application No. PCT / CN2013 / 086152, entitled “Method and Device for Error Correction Model Training and Text Error Correction” filed on Oct. 29, 2013, which claims priority to Chinese Patent Application No. 201310033697.8, “Method and Device for Error Correction Model Training and Text Error Correction”, filed on Jan. 29, 2013, both of which are hereby incorporated by reference in their entirety.FIELD OF THE INVENTION[0002]The present application relates to the technical field of information processing, especially relates to a method and device for error correction model training and text error correction.BACKGROUND OF THE INVENTION[0003]There are often error character strings, such as wrongly written or mispronounced characters and mis-spelled words, in the text used in daily work and life. How to recognize and correct the error character strings in the text by a computer is a technical problem to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F17/21
CPCG06F17/21G06F40/232
Inventor LI, LOUCHENG, QIANGRAO, FENGLU, LIZHANG, XIANGYUE, SHUAICHEN, BO
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products