Natural scene text recognition method based on cross attention mechanism

A technology for natural scene and text recognition, applied in the field of natural scene text recognition, can solve the problems of difficulty in natural scene text recognition, increased difficulty in recognizing text, affecting recognition results, etc., to save labeling costs, improve recognition performance, and recognize accuracy. high effect

Pending Publication Date: 2019-11-05
SOUTH CHINA UNIV OF TECH
View PDF7 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the rich and varied background of the text in the natural scene, and often due to some artistic effects, the arrangement of the fonts is irregular, such as a curved shape, which greatly increases the difficulty for the computer to recognize the text from the picture. Factors make natural scene text recognition difficult, so there is an urgent need for a more effective method for recognizing natural scene text
[0004] The research progress of deep neural network just provides us with tools. Recently, researchers have proposed a variety of methods for using deep neural network to recognize text i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Natural scene text recognition method based on cross attention mechanism
  • Natural scene text recognition method based on cross attention mechanism
  • Natural scene text recognition method based on cross attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0058] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0059] refer to Figure 1-5 As shown, the present invention provides a kind of natural scene text recognition method based on cross attention mechanism, comprises the steps:

[0060] S1. Data acquisition: downloa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a natural scene text recognition method based on a cross attention mechanism, and the method comprises the steps: data obtaining: downloading a sample picture in a natural scene, and synthesizing the picture into a training set through employing a public code; wherein stretching operation is conducted on the sizes of all the training sample pictures, the size of the processed sample pictures is 32 * 100, the height-width ratio is kept consistent with that of an original picture, and the insufficient parts are filled with black edges; label manufacturing: a supervised method is adopted to train an identification model, so that each row of text pictures has corresponding text information; training a network: inputting the prepared training picture data and labels intoa cross attention network for training, wherein the cross attention network is composed of a vertical attention network and a horizontal attention network; inputting test data into the trained network, and finally obtaining an identification result and predicting the confidence coefficient of each character. The method is high in recognition accuracy and strong in robustness, and has good recognition performance for texts with irregular shapes.

Description

technical field [0001] The invention belongs to the technical field of pattern recognition and artificial intelligence, in particular to a natural scene text recognition method based on a cross-attention mechanism. Background technique [0002] With the rapid development of computer technology, artificial intelligence technology is gradually changing our lives, making our lives more convenient and efficient. The recent rapid development of GPU and other hardware technologies has also made the practical application of deep neural networks possible. [0003] In real life, we cannot do without text. Most of the information that humans obtain visually is carried by words. Whether in the past or in the future, human beings will rely heavily on obtaining information from text, and the crucial step in obtaining text information is to correctly recognize text. It is easy for humans to recognize text from a picture, but it is not an easy task for computers. If a computer is neede...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20G06N3/04G06N3/08
CPCG06N3/08G06V10/22G06N3/045Y02T10/40
Inventor 黄云龙金连文罗灿杰林庆祥周伟英
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products