Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese error correction method, device and equipment

A Chinese and grammar technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of low Chinese error correction accuracy, and achieve the effect of improving the accuracy

Pending Publication Date: 2019-09-24
SHANGHAI INST OF TECH
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, there are more types of spelling mistakes than other languages, because many Chinese characters have similar shapes or the same pronunciation, and even some characters are similar in shape and pronunciation, so the accuracy of Chinese error correction in existing solutions is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese error correction method, device and equipment
  • Chinese error correction method, device and equipment
  • Chinese error correction method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0042] Such as figure 1 As shown, the present invention provides a method for Chinese error correction, including:

[0043] Step 1: Obtain the target sentence to be corrected.

[0044] Step 2: Perform word segmentation on the target short sentence to be corrected to obtain a word segmentation sequence including several words, and each word in the word segmentation sequence may be used as the target word to be error corrected in turn.

[0045] Here, in one embodiment of the present invention, python's jieba word segmentation is used. Its principle is to first match the character string to be segmented with its own dictionary, and if there is a word in the dictionary, return the word, and then use HMM The model performs an algorit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a Chinese error correction method, device and equipment. The method includes: obtaining a target short sentence to be corrected; performing word segmentation processing on the target short sentence; obtaining an original word sequence corresponding to the target short sentence, inputting words in the word sequence into a grammar language model; obtaining a score for the word, replacing the word with a low score with a confusion set one by one to generate an error correction candidate set; and for each error correction word sequence in the error correction word sequence set, performing calculating and utilizing binary and ternary grammar to obtain a score of each error correction word sequence, and determining the error correction word sequence with the maximum score in the error correction word sequence set as an error correction result of the target short sentence so as to improve the accuracy of Chinese error correction.

Description

technical field [0001] The invention relates to a method, device and equipment for Chinese error correction. Background technique [0002] With the rapid development of artificial intelligence, language recognition, and optical character recognition, Chinese automatic proofreading technology has also been greatly developed. Pinyin error correction technology has played an important role in promoting the development of Chinese automatic proofreading technology. Spell checking is a common task in every written language, and it can be used to automatically detect and correct human spelling mistakes. Automatic spelling correction has been around since the 1960s. A spell checking system should have both error detection and error correction capabilities. Error detection refers to detecting various types of misspellings in text, and error correction refers to further correcting detected wrong characters. In recent decades, Chinese as a foreign language has been flourishing. In...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
CPCG06F40/216G06F40/253G06F40/289
Inventor 华容俞涛
Owner SHANGHAI INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products