Scene text detection method based on stroke width transformation and convolution neural network

A technology of stroke width transformation and convolutional neural network, applied in the field of scene text detection based on stroke width transformation and convolutional neural network, to achieve the effects of rich quantity and quality, fast detection speed and fast detection rate

Active Publication Date: 2019-01-01
XIAMEN KING LONG UNITED AUTOMOTIVE IND CO LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention provides a scene text detection method based on stroke width transformation and convolutional neural network, the main purpose of which is to solve the above-mentioned problems existing in the detection of scene text by existing detection methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene text detection method based on stroke width transformation and convolution neural network
  • Scene text detection method based on stroke width transformation and convolution neural network
  • Scene text detection method based on stroke width transformation and convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The specific embodiments of the present invention will be described below with reference to the drawings.

[0025] See figure 1 with figure 2 , The implementation of the embodiment of the present invention includes the following steps:

[0026] A. Prepare the training data set and train the text binary classifier based on the Bootstrap strategy through the convolutional neural network. Among them, the training data set contains a positive sample image set of text images and a negative sample image set that does not contain text images; all the samples in the positive sample image set and the negative sample image set are cropped images of 48×48 pixels.

[0027] The positive sample images and part of the negative sample images were collected from the training data set of the STV2k database collected from the streets of China and other existing databases. Use the annotation data of these text databases for image cropping. Randomly sample rectangular windows in the entire imag...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a scene text detection method based on stroke width conversion and convolution neural network, which relates to the field of scene text detection. The method comprises the following steps: preparing a training data set; training a text binary classifier based on a Bootstrap strategy by the convolution neural network; training a text binary classifier based on a Bootstrap strategy by the convolution neural network. Obtaining candidate text regions from the image by using the maximum stable extremum region algorithm; classifying the candidate text region using the text binary classifier; in the candidate text region, the candidate characters are obtained based on the stroke width transformation algorithm, and the candidate characters are filtered through the geometricconstraints. The method has the advantages that the training based on the Bootstrap strategy enriches the quantity and quality of sample images; Stroke width transformation algorithm is used to improve the detection performance based on the candidate text region, and the detection level is determined to be character level. The region-based algorithm and a large number of Chinese training samplesenable the method to detect Chinese text effectively.

Description

Technical field [0001] The invention relates to the field of scene text detection, in particular to a scene text detection method based on stroke width transformation and convolutional neural network. Background technique [0002] In unmanned driving technology, it is an important task to perform three-dimensional environment modeling through perception technology. There are many related modeling data in real road scenes, such as text information in traffic signs, license plates, road signs, and billboards. The text detection and recognition in natural scene images can be used to automatically extract text information, which is one of the important research directions in computer vision. In recent years, research scholars have made certain breakthroughs in their research and built a series of evaluation databases. However, due to factors such as changeable image scenes and diverse texts, there are still many challenges in text detection and recognition in scene images. [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/32G06K9/34
CPCG06V20/62G06V30/158G06V30/10
Inventor 肖苹苹柯志达林春敏彭振文苏亮陈卫强周方明
Owner XIAMEN KING LONG UNITED AUTOMOTIVE IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products