Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Rapid image text detection method based on multi-channel and multi-dimensional cascade filter

A multi-scale, multi-channel technology, applied in the field of image processing, can solve the problems of string breakage and loss, low text detection recall rate, poor detection performance, etc., to eliminate false detection, improve recall rate, and simple and clear structure Effect

Inactive Publication Date: 2017-02-08
XIDIAN UNIV
View PDF0 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 1) The detection speed is slow
[0008] 2) For large characters, dot-matrix fonts, translucent and non-uniform lighting, etc., the detection performance is poor
[0009] 3) It is easy to break and lose when the string is synthesized, and the recall rate of text detection is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rapid image text detection method based on multi-channel and multi-dimensional cascade filter
  • Rapid image text detection method based on multi-channel and multi-dimensional cascade filter
  • Rapid image text detection method based on multi-channel and multi-dimensional cascade filter

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] refer to figure 1 , the implementation steps of the present invention are as follows:

[0030] Step 1: Under different channels and scales of the input image, extract the largest stable extremum region as the character candidate region.

[0031] 1a) Reduce the length and width of the input image I to 0.125 times of the original, and obtain the reduced image I 1 ;

[0032] 1b) The input image I and the reduced image I 1 Convert from RGB color space to YUV color space, where Y represents the brightness component, U is the blue chroma component, and V is the red chroma component;

[0033] 1c) Respectively in the input image I and the reduced image I 1 In the RGBUV channel of , the maximum stable extremum region is extracted as a character candidate according to the following formula,

[0034] Among them, Q m Indicates the area where the gray intensity is m, Δ is the variation of the gray intensity, which is set to 3 in the present invention, when q(m) is a local mi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a rapid image text detection method based on a multi-channel and multi-dimensional cascade filter. The problem is mainly solved that the recall ratio is low and the speed is slow in the prior art. The method comprises: 1) extracting a maximum stable extremum region in the different channels and scales of an input image as a character candidate region; 2) removing the background region in the character candidate region by employing a cascade filter from coarse to fine, namely setting a threshold value for the morphological features of the character candidate region, and performing the first grade coarse filtration; setting thresholds for the stroke width and the stroke width variable coefficient of the character candidate region, performing the second grade coarse filtration, then removing the overlapping regions, and employing a convolution neural network binary classifier perform fine filtration; and 3) aggregating the region into the character string according to the geometry and the position feature of the character candidate region after cascade filter through a graph model. The rapid image text detection method based on the multi-channel and multi-dimensional cascade filter has high recall ratio, high accuracy and fast speed, and can be used for detection of image text at various interference surroundings.

Description

technical field [0001] The invention belongs to the technical field of image processing, in particular to an image text detection method, which can be used for text detection in natural scene images such as license plates and road signs. Background technique [0002] With the rapid development of computers, handheld mobile camera devices and the popularization of web 2.0 technology, the number of network images containing text has increased dramatically. By extracting text information from images, it helps to deepen image understanding and retrieve required information from massive data, effectively saving time and improving efficiency. Traditional document text detection technology has matured, but image text still has many challenges due to its complexity, such as the variability of fonts, the complexity of the background and other interference factors. Therefore, text detection in related images has gradually become a hot spot in the field of image processing. [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/32G06K9/34G06K9/62
CPCG06V20/62G06V10/267G06V30/10G06F18/2414
Inventor 田春娜夏勇高新波张相南
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products