Text error correction method, electronic equipment and computer readable storage medium

A text error correction and target technology, applied in the computer field, can solve the problems of wrong error correction results, low accuracy of final error correction results, and high cost of text error correction

Pending Publication Date: 2021-12-03
上海携宁计算机科技股份有限公司
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the inventor found that in general text error correction, such as the technical solution disclosed in the patent application number "CN202010164805.5", most of the texts to be processed are first segmented according to the granularity of words, and then the results after segmentation Perform error correction according to word granularity. The error correction result of this method depends on the segmentation result, and the segmentation result depends on the segmentation method. If the segmentation method is not suitable, it will lead to errors in the final error correction result; and , segmenting according to word granularity depends on the corresponding vocabulary dictionary, so the vocabulary dictionary needs to be maintained for a long time, which leads to high cost of text error correction; at the same time, traditional scoring rules usually use edit distance for calculation, and edit distance Generally, the cost of replacement, addition, and deletion is set to 1, and it is impossible to distinguish approximately wrong inputs, which leads to a relatively low accuracy of the final error correction result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text error correction method, electronic equipment and computer readable storage medium
  • Text error correction method, electronic equipment and computer readable storage medium
  • Text error correction method, electronic equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, various implementation modes of the present invention will be described in detail below in conjunction with the accompanying drawings. However, those of ordinary skill in the art can understand that, in each implementation manner of the present invention, many technical details are provided for readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following implementation modes, the technical solution claimed in this application can also be realized. The division of the following embodiments is for the convenience of description, and should not constitute any limitation to the specific implementation of the present invention, and the various embodiments can be combined and referred to each other on the premise of no contradiction.

[0029] In the technical...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention relates to the technical field of computers, and discloses a text error correction method, electronic equipment and a computer readable storage medium. The method includes segmenting a vocabulary to be corrected according to the word granularity to obtain a plurality of retrieval segments, with the types of the retrieval fragments being a single letter or Chinese character pinyin; in a preset index lexical element set, determining a target index lexical element consistent with the retrieval fragment, with the types of the index lexical elements in the index lexical element set comprising single letters and Chinese character pinyin; retrieving in a preset index according to the target index lexical element to obtain a plurality of proper nouns consistent with the target index lexical element in sequence as candidate words; calculating an editing distance according to the word frequency of the vocabulary to be corrected and the word frequency of the candidate words, and scoring the candidate words to obtain scores corresponding to the candidate words; utilizing the candidate word with the highest score as an error correction result to replace the vocabulary to be corrected. The cost of text error correction can be remarkably reduced, the accuracy of text error correction is greatly improved, and meanwhile, the precision of text error correction is improved.

Description

technical field [0001] The embodiments of the present application relate to the field of computer technology, and in particular to a text error correction method, electronic equipment, and a computer-readable storage medium. Background technique [0002] Natural language processing is an important direction in the field of computer science and artificial intelligence. Natural language processing is a science that integrates linguistics, computer science, and mathematics. It can realize effective communication between humans and computers using natural language. Natural language processing technology is mainly used in machine translation, public opinion monitoring, automatic summarization, opinion extraction, text recognition, text semantic comparison, text error correction, Chinese optical character recognition (Optical Character Recognition, referred to as: OCR), etc., among which, Text error correction refers to the process of correcting erroneous content in the text. Text...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/232G06F40/289G06F40/242G06F16/33G06F16/31
CPCG06F40/232G06F40/289G06F40/242G06F16/334G06F16/319
Inventor 张浩波
Owner 上海携宁计算机科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products