Character recognition system and method based on combination of neural network and attention mechanism

A neural network and text recognition technology, applied in the field of text and image recognition, can solve problems such as poor long-sequence text recognition, complex character backgrounds in natural scenes, and poor text and image recognition, and achieve long-term dependence. The effect of adaptability and good stability

Active Publication Date: 2019-02-26
CHONGQING UNIV OF POSTS & TELECOMM
View PDF8 Cites 74 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to many factors such as the position, deformation, and illumination of the characters in the natural scene, and the background of the characters in the natural scene is also quite complicated, there are many technical difficulties that need to be overcome in the recognition
[0004] At present, many research methods are mostly based on top-down algorithm models. Jaderberg et al. desig

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character recognition system and method based on combination of neural network and attention mechanism
  • Character recognition system and method based on combination of neural network and attention mechanism
  • Character recognition system and method based on combination of neural network and attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.

[0037] The technical scheme that the present invention solves the problems of the technologies described above is:

[0038] The algorithm model mainly includes the following steps:

[0039] Step S1, the convolutional neural network feature extraction module is used for the spatial feature of the text image;

[0040] Step S2, inputting the spatial features extracted by the convolutional neural network into the bidirectional long-term short-term memory network module, and the bidirectional long-term short-term memory network can extract the sequential features of the text;

[0041] Step S3, perform semantic encoding on the extracted feature vectors, and then assign the attention weights of the feature vect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention claims to protect a character recognition system and method based on the combination of a neural network and an attention mechanism, the system comprising: a convolution neural network feature extraction module, which is used for spatial feature of character image; The spatial features extracted by the convolution neural network are input to the bi-directional long-short memory network module, and the bi-directional long-short memory network can extract the sequence features of characters. The extracted feature vectors are semantically encoded, and then the attention weights of feature vectors are assigned through the attention mechanism, so that the attention is focused on the feature vectors with higher weights. In the decoding part of the model, the features extracted fromattention and the prediction information of the previous time are used as the inputs of the nested long-short memory network. The purpose of using the long-short memory network is to keep the temporal characteristics of the eigenvectors and make the attention points of the model constantly change with time. In the decoding part, the features extracted from attention and the prediction informationof the previous time are used as the inputs of the nested long-short memory network. The invention can more accurately detect the text area in the natural scene, and has good detection effect on thesmall target text and the text with small tilt angle.

Description

technical field [0001] The invention belongs to character and image recognition in natural scenes, and relates to a correlation algorithm combining a convolutional neural network, a long-term short-term memory network and an attention mechanism. Background technique [0002] Natural scenes are the living environment we live in. Natural scene images contain a variety of visual information, such as text, cars, landscapes, organisms, and architectural landscapes. These elemental information constitute the main content of natural scenes. Element. [0003] Digital recognition in natural scenes belongs to the category of text recognition in natural scenes. The research on text recognition in natural scenes began in the 1990s, but it is still an unsolved problem until now. Generally speaking, the text recognition task in natural scenes consists of two parts: text region detection and text recognition. Text recognition is based on detection, and the detected text box is used as re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V30/40G06V10/464G06N3/045G06F18/214
Inventor 杨宏志庞宇王慧倩
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products