Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Natural scene image text detection method based on cascaded convolutional neural network

A convolutional neural network, natural scene image technology, applied in instruments, computing, character and pattern recognition, etc., can solve problems such as failure, achieve the effect of solving imbalance, speeding up detection, and improving detection performance

Inactive Publication Date: 2018-03-06
WUHAN UNIV +1
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The problem with the penalty factor at the algorithm level is how to set the penalty factor and how to determine the size of the penalty factor, and when the sample categories are extremely unbalanced, this method will fail

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Natural scene image text detection method based on cascaded convolutional neural network
  • Natural scene image text detection method based on cascaded convolutional neural network
  • Natural scene image text detection method based on cascaded convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit this invention.

[0032] Such as figure 1 , the technical solution adopted in the embodiment of the present invention is a method for detecting text in natural scene images based on a cascaded convolutional neural network, comprising the following steps:

[0033] (1) Using the Maximally Stable Extremal Regions (MSERs) method to extract candidate characters in the input image, there are unreal characters in the extraction results, and the number of unreal characters is much greater than the number of real characters.

[0034] The specific implementation of MSER feature extract...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a natural scene image text detection method based on a cascaded convolutional neural network, which comprises the steps of firstly performing candidate character extraction onan input image by using a maximally stable extremal region method; secondly, connecting a pre-trained 16-net convolutional neural network and 32-net convolutional neural network in series to form a cascaded convolutional neural network, classifying the candidate characters by using the cascaded convolutional neural network, eliminating inauthentic characters in the candidate characters to obtain afinal authentic character detection result; then pairing every two adjacent characters in the character detection result, combining the characters having a shared character until all of the characters are combined, and obtaining a text line; and finally dividing the text line into a plurality of words or phrases according to the character spacing, and acquiring a final text detection result. Thenatural scene image text detection method effectively solves a problem that training samples are unbalance in category, and effectively improves the text detection efficiency and performance.

Description

technical field [0001] The invention belongs to the technical field of natural scene text detection, and in particular relates to a natural scene image text detection method based on a cascaded convolutional neural network. Background technique [0002] Natural scene text detection generally includes four steps: candidate character extraction, character and non-character classification, text line construction and text line segmentation. Candidate character extraction refers to extracting regions with such features from natural scene images as candidate characters by studying the structure and color features of characters; then using machine learning algorithms or deep learning algorithms to train a binary classifier, and using this The classifier classifies the candidate characters and divides them into characters and non-characters; then uses the positional relationship between adjacent characters to group the characters into text lines, that is, the text line structure; fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/34G06K9/62
CPCG06V20/62G06V10/267G06F18/241
Inventor 刘菊华吴伟顾龙
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products