Text recognition method under natural scene on the basis of spatial transformation

A text recognition and space transformation technology, applied in the field of computer vision, can solve problems such as being easily affected by factors such as shooting angles and lighting, and text distribution is irregular and complex

Active Publication Date: 2016-07-06
HUAZHONG UNIV OF SCI & TECH
View PDF4 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The image background of natural scenes is usually complex, the distribution of text is irregular, and it is easily affected by factors such as shooting angle and lighting. Therefore, text analysis in natural scenes has always been one of the difficulties in the field of computer vision technology.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method under natural scene on the basis of spatial transformation
  • Text recognition method under natural scene on the basis of spatial transformation
  • Text recognition method under natural scene on the basis of spatial transformation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0051] Below at first explain and illustrate with regard to the technical terms of the present invention:

[0052] Convolutional Neural Network (CNN): A neural network that can be used for tasks such as image classification and regression. Networks usually consist of convolutional layers, downsampling layers, and fully connected layers. The convolutional layer and the downsampling layer are re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text recognition method under a natural scene on the basis of spatial transformation. The method comprises the following steps: firstly, obtaining the text contents of a text image in a training image set, and training network models including a datum point positioning network, an image preprocessing network, an image encoder network, a feature encoder network and the like; then, utilizing the network models obtained by training to carry out spatial transformation on an image in a set of images to be identified to obtain the transformed image to be identified; and then, calculating the feature vector and a prediction probability sequence of the transformed image to be identified, and finally obtaining an image recognition result. The method is high in text recognition accuracy and can overcome the influence of adverse factors including irregular text arrangement and the like.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to a text recognition method in natural scenes based on spatial transformation. Background technique [0002] In production and life, people will come into contact with a large number of texts in natural scenes, such as road signs, billboards, product packaging, etc. These texts contain rich information. Automatically obtaining text information in images can help people more effectively Understanding images and extracting these text information has very important practical value, and has strong practicality for unmanned driving, automatic acquisition of geographic location, and automatic translation based on images. The image background of natural scenes is usually complex, the distribution of text is irregular, and it is easily affected by factors such as shooting angle and lighting. Therefore, text analysis in natural scenes has always been one of the diffi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/24G06F18/214
Inventor 白翔石葆光张新浩
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products