Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Scene character recognition method and system based on semantic enhancement encoder decoder framework

A text recognition and decoder technology, applied in character recognition, character and pattern recognition, instruments, etc., can solve the problems of lack of use of global information, lack of effective help in the recognition process, and ineffective learning of global information , to achieve the effect of strong flexibility and generalization

Pending Publication Date: 2020-10-09
INST OF INFORMATION ENG CAS
View PDF1 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 3. The existing methods lack certain supervision on the use of global information, resulting in the inability to effectively learn the global information, thus not being very effective in the recognition process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene character recognition method and system based on semantic enhancement encoder decoder framework
  • Scene character recognition method and system based on semantic enhancement encoder decoder framework
  • Scene character recognition method and system based on semantic enhancement encoder decoder framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the technical solution of the present invention more comprehensible, specific embodiments and accompanying drawings are described in detail as follows.

[0034] This embodiment proposes a scene character recognition method (SE-ASTER for short) based on the semantically enhanced encoder-decoder framework, based on the semantically enhanced encoder-decoder structure (SEED), such as figure 1 As shown, in the existing framework, SEED uses the visual information output by the encoder to predict a global semantic information, and introduces the word vector commonly used in the field of natural language processing as a supervision of it, and then uses this global semantic information information to guide the subsequent decoding process. At the same time, a new mainstream method ASTER is combined with the proposed framework, and a new scene image text recognition method SE-ASTER is proposed. Such as figure 2 As shown, SE-ASTER is implemented by a scene text ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a scene character recognition method and system based on a semantic enhancement encoder decoder framework, and the method comprises the steps of correcting a text in any shape on a target image into a horizontal text, and obtaining a corrected image; inputting the corrected image into a convolutional neural network to extract visual features, and extracting sequence information from the visual features by using a recurrent neural network; predicting global semantic information according to the sequence information; and initializing the state of a GRU (Gateway Recirculation Unit) based on an attention mechanism by utilizing the global semantic information, calculating an attention weight according to the visual feature and the hidden state of each decoding time of theGRU, weighting the visual feature according to the attention weight, and predicting each character on the image. According to the invention, global information can be effectively utilized to overcomethe defect that local information is used in an existing method, and meanwhile, a gap between visual information and semantic information is reduced, so that the model can better process a low-quality image.

Description

technical field [0001] The invention relates to the field of computer image character recognition, in particular to a scene character recognition method and system based on a semantically enhanced encoder-decoder framework. Background technique [0002] Text detection and recognition of scene images is a research hotspot in recent years. Text recognition is the core part of the whole process. Its task is to transcribe the text in the picture into a text format that can be directly edited by the computer. With the development of deep learning, this field has made rapid progress. Inspired by the field of machine translation, the current mainstream methods are based on the encoder-decoder structure. The encoder extracts rich visual features through convolutional neural networks and cyclic neural networks, and the decoder obtains the required features through the attention mechanism. According to the text The order of the sequence predicts each character in the sequence. [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/40G06N3/04
CPCG06V30/1478G06V10/30G06V30/10G06N3/045
Inventor 王伟平乔峙周宇杨东宝周玉灿
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products