End-to-end identification method for scene text with random shape

A technology of arbitrary shape and recognition method, applied in the field of computer vision, which can solve problems such as inability to train

Active Publication Date: 2018-09-18
HUAZHONG UNIV OF SCI & TECH
View PDF4 Cites 191 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, both of these approaches have two major drawbacks, first, neither of them can be trained fully in an end-to-end fashion
Second, these methods can only recognize horizontal or oriented text, but the shape of the text in actual scene pictures may change significantly from horizontal or oriented to curved forms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end identification method for scene text with random shape
  • End-to-end identification method for scene text with random shape
  • End-to-end identification method for scene text with random shape

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0056] Below at first explain and illustrate with regard to the technical terms of the present invention:

[0057] ResNet-50: A neural network that can be used for classification. The network is mainly composed of 50 layers of convolutional layers, pooling layers, and shortcut connection layers. The convolutional layer is used to extract image features; the function of the pooling layer is to r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an end-to-end identification method for a scene text with a random shape. The method comprises the steps of extracting a text characteristic through a characteristic pyramid network for generating a candidate text box by an area extracting network; adjusting the position of the candidate text box through quick area classification regression branch for obtaining more accurate position of a text bounding box; inputting the position information of the bounding box into a dividing branch, obtaining a predicated character sequence through a pixel voting algorithm; and finally processing the predicated character sequence through a weighted editing distance algorithm, finding out a most matched word of the predicated character sequence in a given dictionary, thereby obtaining a final text identification result. According to the method of the invention, the scene texts with the random shape can be simultaneously detected and identified, wherein the scene texts comprisehorizontal text, multidirectional text and curved text. Furthermore end-to-end training can be completely performed. Compared with prior art, the identification method according to the invention has advantages of obtaining advantageous effects in accuracy and versatility, and realizing high application value.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to an end-to-end recognition method for arbitrary-shaped scene text. Background technique [0002] In the field of computer vision, scene text detection and recognition is a very active and challenging research direction, and many real-life applications are closely related to it, such as image-based geolocation, real-time translation, and assistance for the blind. [0003] The goal of scene text detection and recognition methods is to simultaneously detect and recognize text from natural scenes, which is divided into two tasks: detection and recognition. In most of the past studies, text detection and recognition are handled separately, that is, the first step uses the trained detector to detect the text area in the natural scene picture, and the second step uses the text area detected in the first step Input the identification module to identify and obtain ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34G06K9/32G06K9/62G06N3/04
CPCG06V30/153G06V10/267G06V10/25G06N3/045G06F18/214G06N3/084G06V20/63G06V10/454G06V10/82G06N3/044
Inventor 白翔吕鹏原廖明辉姚聪储佳佳
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products