Scene text recognition method based on multi-scale features

A multi-scale feature and text recognition technology, applied in the field of scene text recognition, can solve the problems affecting the performance of the recognizer and the inability to adapt to different character sizes, and achieve the effects of fast recognition speed, improved robustness, and high recognition accuracy

Pending Publication Date: 2020-11-20
SOUTH CHINA UNIV OF TECH +1
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the current scene text recognition technology only retains the output of the last layer of the deep convolutional neural network, its fixed and too large receptive field cannot adapt to the situation where characters of different sizes appear in the text at the same time, and in the case of generally small characters Under normal circumstances, more than half of its too large receptive field will fall in the background area, which seriously affects the performance of the recognizer.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene text recognition method based on multi-scale features
  • Scene text recognition method based on multi-scale features
  • Scene text recognition method based on multi-scale features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0031] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0032] refer to figure 1 As shown, this embodiment provides a method for scene text recognition based on multi-scale features, including the following steps:

[0033] S1, data acquisition: acquire the scene text ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a scene text recognition method based on multi-scale features, and the method comprises the steps: obtaining a scene text image data set, and carrying out the size adjustment of image data in the scene text image data set; constructing a text recognition model, and training the text recognition model by using the scene text image data set after size adjustment, wherein thetext recognition model comprises a multi-scale feature coding module and a multi-layer attention mechanism decoding module, the multi-scale feature coding module is used for acquiring character features of a plurality of scales, and the multi-layer attention mechanism decoding module combines a two-dimensional attention mechanism and a one-dimensional attention mechanism to fuse the character features of the plurality of scales to obtain a character recognition result in a scene text; and collecting a scene text image to be recognized, adjusting the scene text image to a uniform size, and inputting the scene text image into the trained text recognition model to complete recognition of the multi-scale characters in the scene text. According to the method, the multi-scale characters in the scene text can be quickly and accurately recognized.

Description

technical field [0001] The invention relates to the technical field of scene text recognition, in particular to a scene text recognition method based on multi-scale features. Background technique [0002] In recent years, with the rapid development of deep neural networks, the innovative application of artificial intelligence technology has been greatly promoted. Scene text recognition, as a part of artificial intelligence technology, has received extensive attention. Unlike optical character recognition in scanned documents, scene text recognition is very challenging due to various text fonts, low resolution, and susceptibility to light and shadow changes. The current mainstream scheme is summarized as using deep convolutional neural network to extract high-order features of images, using recurrent neural network to associate horizontal feature vectors, and finally using CTC (Connectionist Temporal Classification, connectionist temporal classification) or attention mechani...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/34G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V20/63G06V30/153G06N3/045G06F18/214G06F18/25
Inventor 张家鑫金连文罗灿杰王天玮李子彦周伟英
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products