Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text error correction method, device and equipment and storage medium

A text error correction and text technology, applied in the field of text processing, can solve the problems of long time consumption, large search space, and inability to guarantee the correct rate of distinction, so as to improve the accuracy of distinction and reduce the amount of calculation

Inactive Publication Date: 2020-09-04
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the process of text recognition, the candidate set for text error correction is generated by the full dictionary. When searching for candidate words, the full search will cause the search space to be too large and take a long time; In this case, the word vectors of words composed of different forms and near characters may be relatively close, and the accuracy of the distinction cannot be guaranteed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text error correction method, device and equipment and storage medium
  • Text error correction method, device and equipment and storage medium
  • Text error correction method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the purpose, technical solutions and advantages of the application clearer, the application will be further described in detail below in conjunction with the accompanying drawings. All other embodiments obtained under the premise of creative labor belong to the scope of protection of this application.

[0032] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

[0033] In the following description, the term "first\second\third" is only used to distinguish similar objects, and does not represent a specific ordering of objects. Understandably, "first\second\third" Where permitted, the specific order or sequencing may be interchanged such that the embodiments of the application described herein can be practiced in sequences oth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a text error correction method and device, equipment and a storage medium. The method comprises the steps of replacing at least one obfuscated character in a to-be-corrected text by adopting a preset obfuscated character library to obtain a first text set; in the first text set, candidate texts meeting preset conditions are determined; replacing at least one obfuscated character in the candidate text by adopting the preset obfuscated character library to obtain a second text set; according to the second text set, traversing a domain lexicon storing at least two words which are the same as the domain to which the text to be corrected belongs to obtain a target text matched with the second text; therefore, the text to be corrected is corrected by adopting the confused word stock and the domain word bank, and the domain proper nouns can be corrected, so that the accuracy of correcting the text is improved.

Description

technical field [0001] The present application relates to the technical field of text processing, in particular to a text error correction method, device, equipment and storage medium. Background technique [0002] In the process of text recognition, the candidate set for text error correction is generated by the full dictionary. When searching for candidate words, the full search will cause the search space to be too large and take a long time; In this case, the word vectors of words composed of different forms and near characters may be relatively close, and the accuracy of the distinction cannot be guaranteed. Contents of the invention [0003] The embodiment of the present application provides a text error correction method, device, equipment, and storage medium. By using the confusing word library and the domain thesaurus to correct the text to be corrected, it is possible to correct the domain proper nouns, thereby improving the accuracy of the text. The accuracy wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/232G06F40/242
CPCG06F40/232G06F40/242
Inventor 洪科元李斌章秦苏晨
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products