Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Novel Text Recognition Method Based on Counting Focusing Model

A text recognition and model technology, applied in the field of optical character recognition, can solve the problem of complex module design for focus weight calculation, and achieve the effects of simplified design and low code implementation requirements

Active Publication Date: 2021-11-02
SUN YAT SEN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The present invention solves the technical defect that the prior art does not assume the relative position of successive focus positions, and needs to allow the model to learn to focus from left to right or from top to bottom during the training process, resulting in a complicated training process, and to calculate the focus weight The design of the module is too complicated for technical defects, and a new text recognition method based on focus weight is provided.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Novel Text Recognition Method Based on Counting Focusing Model
  • A Novel Text Recognition Method Based on Counting Focusing Model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] The overall framework of the counting focus model is the same as the previous focus model, and consists of two parts: the encoder (decoder) based on convolutional neural network (CNN) extracts high-level features from the input image to obtain a high-level feature map (featuremap); The decoder of the long short-term memory network (LSTM) and the attention mechanism (Attention Mechanism) decodes the characters from left to right in sequence from the high-level feature map. Specific as figure 1 shown.

[0035] The encoder uses a common CNN, and the process of extracting high-level features to obtain a high-level feature map has no improvement compared with the prior art. The main improvement of the recognition method provided by the present invention lies in the calculation process of the decoder, such as figure 2 As shown, the calculation process of the decoder is as follows:

[0036] S21. Segment the high-level feature map from left to right along the horizontal dime...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a novel text recognition method based on a counting focus model, the counting focus model includes an encoder and a decoder, and the recognition method includes the following steps: S1. Using an encoder based on a convolutional neural network to process an input image The high-level features are extracted to obtain the high-level feature map; S2. The decoder based on the long-term short-term memory network and the focusing mechanism decodes the characters from left to right in sequence from the high-level feature map.

Description

technical field [0001] The invention belongs to the field of optical character recognition, and more specifically relates to a novel text recognition method based on a counting focus model. Background technique [0002] OCR single-line text recognition is the process of recognizing the text content of an input image containing a single-line text. One of the mainstream models currently used on this task is the attention / focus model (Attention Model), and its recognition process is: [0003] 1) First use the convolutional neural network (CNN) to extract the high-level feature map (feature map) of the input image; [0004] 2) Use the long-term short-term memory network (LSTM) to "attend" the high-level feature map multiple times, and calculate the attention weights (attention weights); [0005] 3) Use the focus weight to perform weighted average of the high-level feature maps, and predict the text characters that need to be output at the current step (step) according to the o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/20G06K9/34
CPCG06V10/22G06V30/153
Inventor 郑华滨潘嵘
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products