Scene text end-to-end identification method based on boundary point detection

A recognition method and boundary point technology, applied in the field of computer vision, can solve the problem that the recognition network cannot model the text sequence information.

Active Publication Date: 2020-02-25
HUAZHONG UNIV OF SCI & TECH
View PDF2 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such methods require character-level annotation, and

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene text end-to-end identification method based on boundary point detection
  • Scene text end-to-end identification method based on boundary point detection
  • Scene text end-to-end identification method based on boundary point detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to make the purpose, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0062] Below at first explain and illustrate with regard to the technical terms of the present invention:

[0063] ResNet-50: A neural network that can be used for classification. The network is mainly composed of 50 layers of convolutional layers, pooling layers, and shortcut connection layers. The convolutional layer is used to extract image features; the function of the pooling layer is to ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a scene text end-to-end recognition method based on boundary point detection, and the method comprises the steps: extracting text features through a feature pyramid network, and generating candidate textboxes through a region extraction network; detecting a more accurate multi-directional bounding box of the text instance through a multi-directional rectangular detection network; secondly, detecting an upper boundary point sequence and a lower boundary point sequence of the text in the multi-directional bounding box; and finally, converting the text in any shape into ahorizontal text by utilizing the detected boundary point sequence for the subsequent attention mechanism-based sequence recognition network to performing recognizing, and finally, finding out the mostmatched word of the prediction sequence in the given dictionary by utilizing a cluster search algorithm to obtain a final text recognition result. According to the method, the scene text in any shapein the natural image can be detected and recognized at the same time under the condition that character-level labeling is not needed, the scene text comprises the horizontal text, the multi-directiontext and the curved text, and end-to-end training can be completely carried out.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to an end-to-end recognition method for scene text based on boundary point detection. Background technique [0002] In the field of computer vision, scene text detection and recognition is a very active and challenging research direction, and many practical applications are highly related to it, such as network information security monitoring systems, intelligent transportation systems, and assistance for the blind. [0003] In most of the past studies, scene text detection and recognition techniques are regarded as two separate processes, that is, the first step uses a trained detector to detect text regions in natural scene pictures, and the second step uses the first step The detected text area is input into the recognition module for recognition, and the text content is obtained. Since detection and recognition tasks are highly related and complementary ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/46G06K9/62
CPCG06V30/153G06V10/44G06V30/10G06F18/241G06F18/214
Inventor 刘文予白翔许永超王豪卢普张辉杨明锟何梦超王永攀
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products