Character recognition method and system based on attention mechanism

A text recognition, attention technology, applied in character recognition, character and pattern recognition, computer parts and other directions, can solve the problem of forming noise area, limited attention area, attention drift and so on

Pending Publication Date: 2020-10-16
厦门商集网络科技有限责任公司 +1
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing deep learning model based on the attention mechanism has two defects: (1) Due to the limited focus area of ​​the attention on the feature map, the area that has not been paid attention to during the training phase will form a noise area in the feature map
The attention generated by the attention module is easily disturbed by the noise area, and cannot be well focused on the area where the text is located, re

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character recognition method and system based on attention mechanism
  • Character recognition method and system based on attention mechanism
  • Character recognition method and system based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0069] Example 1

[0070] like figure 1 As shown, a text recognition method based on attention mechanism includes the following steps:

[0071] S1: Build a text recognition model for recognizing text in an image; the text recognition model consists of the following modules:

[0072] Convolutional neural network for extracting feature maps of input images;

[0073] An attention mechanism module, including a sequence encoder, a forward sequence decoder and a reverse sequence decoder, for encoding and decoding the feature map, and outputting the feature vector of the predicted character;

[0074] a character decoding layer, for compiling the feature vector of the predicted character into a text recognition result, and compiling the feature map into a feature map character probability vector;

[0075] S2: construct a training sample set, the training sample set includes a training image and an image annotation corresponding to the training image, wherein the image annotation is...

Example Embodiment

[0083] Embodiment 2

[0084] like figure 1 As shown, a text recognition method based on attention mechanism includes the following steps:

[0085] S1: Build a text recognition model for recognizing text in an image; the text recognition model is composed of a convolutional neural network, an attention mechanism module, and a character decoding layer, wherein the attention mechanism module includes a sequence encoder, a positive Forward Sequence Decoder and Reverse Sequence Decoder.

[0086] In the step S1, the convolutional neural network includes a multi-layer convolution filter bank and a pooling sub-module, the convolution filter bank adopts a residual structure, and the character decoding layer is fully connected by a multi-layer neural network. The multi-layer convolution filter bank extracts image features, the pooling sub-module changes the feature map resolution, and the output of the convolutional neural network is a feature map with a certain number of channels.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a character recognition method and system based on an attention mechanism, and relates to a deep learning and image processing technology. According to the method, a convolutional neural network and a linguistic module based on an attention mechanism are used as the backbone of a deep learning model, a customized loss function is used to reinforce feature map extraction, the model is guided to learn to distinguish a foreground and a background during training, and forward and reverse bidirectional decoders are introduced to perform bidirectional decoding on characters.The method and the system are high in anti-interference capability, attention drifting can be reduced, and meanwhile, the situation that final recognition fails due to the fact that the first character is difficult to recognize during the forward decoding of the model can be avoided.

Description

technical field [0001] The present invention relates to deep learning and image processing technology, in particular to a text recognition method and system based on an attention mechanism. Background technique [0002] There are many existing text recognition technologies, including traditional OCR recognition methods and methods based on deep learning. The method based on deep learning inputs a large number of manually labeled image and text samples into the designed neural network, so that the parameters in the neural network can be trained to fit the mapping relationship between the image and the text, and then complete the recognition task. The methods of deep learning are mainly divided into methods based on attention mechanism and methods based on CTC. Among them, the attention mechanism in deep learning (https: / / blog.csdn.net / hpul fc / article / details / 80448570) is essentially similar to the selective visual attention mechanism of human beings. Select the information ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06N3/04
CPCG06V30/40G06V10/40G06V30/10G06N3/048G06N3/045G06F18/214
Inventor 顾澄宇王士林陈凯周异何建华
Owner 厦门商集网络科技有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products