Text information processing method and device

An information processing method and text technology, applied in the field of text information processing methods and devices, can solve problems such as low accuracy, and achieve the effect of improving accuracy
CN110765996AActive Publication Date: 2020-02-07BEIJING BAIDU NETCOM SCI & TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
BEIJING BAIDU NETCOM SCI & TECH CO LTD
Publication Date
2020-02-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The embodiment of the invention discloses a text information processing method and device, and relates to the field of cloud computing. A specific embodiment of the method comprises the steps of identifying a to-be-processed text from an image comprising the to-be-processed text; inputting the to-be-processed text into a pre-trained recurrent neural network language model, and identifying wronglywritten characters in the to-be-processed text; inputting the wrongly written characters in the to-be-processed text into a pre-trained text error correction model to obtain similar characters corresponding to the wrongly written characters; and determining correct characters corresponding to the wrongly written characters in the similar characters by utilizing the text error correction model based on the coherence of the to-be-processed text, and replacing the wrongly written characters with the correct characters to obtain an error correction text of the to-be-processed text. The wrongly written characters are recognized through the pre-trained recurrent neural network language model, and the correct characters of the wrongly written characters are obtained through the pre-trained text error correction model, so that the error correction text is obtained, and the accuracy of a recognition result is improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The embodiments of the present application relate to the field of computer technology, and in particular to a text information processing method and device. Background technique

[0002] With the development of computer technology, OCR (Optical Character Recognition, Optical Character Recognition) character recognition technology is widely used in various fields. OCR text recognition technology can convert image information into text information, and then the machine performs semantic analysis and intention recognition on the text through natural language processing technology.

[0003] At present, the OCR character recognition technology is very mature for printed text recognition, and the accuracy can reach more than 90%. However, for the recognition of handwritten text, the existing OCR character recognition technology has low accuracy.

[0004] In the prior art, the correction of the recognition result obtained by recognizing the handwritten text ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More