Text recognition method based on optical character recognition and error correction tight coupling processing

An optical character recognition and text recognition technology, applied in the field of OCR recognition, can solve the problems of low recognition accuracy and incompetent recognition of scenes, and achieve the effect of wide application range, mature model and accurate recognition results.

Pending Publication Date: 2020-04-24
厦门商集网络科技有限责任公司 +1
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing technology, the two links of OCR recognition and error correction processing based on machine deep learning are loosely coupled, such as figure 2 As shown, the OCR recognition module outputs a string of text, and the text error correction module uses the text string as input to correct possible recognition errors. Except that the text error correction module takes the output of the OCR recognition module as input, the difference between the two modules There is no other correlation between them. This loosely coupled relationship makes the recognition accuracy not high, and it is difficult to be competent for complex recognition scenarios.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method based on optical character recognition and error correction tight coupling processing
  • Text recognition method based on optical character recognition and error correction tight coupling processing
  • Text recognition method based on optical character recognition and error correction tight coupling processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0069] like figure 1 As shown, the text recognition method based on optical character recognition and error correction tightly coupled processing includes the following steps:

[0070] S1: Input the text image to be recognized.

[0071] S2: Receive the text image, perform optical character recognition on the text image through a neural network recognition model, and output the recognized text information and a character probability matrix, wherein the character probability matrix records the probability of occurrence of different characters in different time sequences, It is auxiliary information generated during the process of recognizing text by the neural network recognition model.

[0072] The neural network model is a CRNN character recognition model, and the character probability matrix output by it is the probability that the model predicts the characters that may appear at the current position when it recognizes the characters in sequence, that is, the character pred...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text recognition method based on optical character recognition and error correction tight coupling processing. According to the method, a text image is recognized through aneural network model, an optimal candidate text sentence is selected through text bundle search transcription and lexicon selection processing by utilizing internal information generated during text image recognition, error correction is performed through the neural network model, and a more accurate text recognition result is output. Optical character recognition and text error correction are tightly coupled, and compared with an existing loose coupling text recognition method, the text error correction performance can be effectively improved, and the text recognition accuracy is improved.

Description

technical field [0001] The invention relates to a text recognition method based on optical character recognition and error correction tightly coupled processing, belonging to the field of OCR recognition. Background technique [0002] With the information processing technology in recent years, the performance of the optical character recognition (OCR) system based on machine deep learning for text positioning and text recognition has been greatly improved. In some fields, the accuracy of text recognition is close to the level of manual recognition, which helps Realize the application of various scenarios, such as ID card recognition and license plate recognition. In some commercial applications, such as bill reimbursement and bank transactions, OCR technology is also playing an important role. OCR recognition requires error correction for the recognition results to ensure the correctness of the results. Using machine automatic text error correction is an important way. At p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06K9/34G06F40/289G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V10/22G06V10/267G06N3/044G06N3/045G06F18/24
Inventor 韦建周异陈凯何建华
Owner 厦门商集网络科技有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products