EMD (empirical mode decomposition)-based intelligent document image block detection method

A detection method and technology for intelligent documents, applied in the fields of instruments, character and pattern recognition, computer parts, etc., can solve problems such as difficulties, limited storage space and computing power of portable devices, and time-consuming, and achieve the effect of small amount of calculation.

Active Publication Date: 2015-09-30
SHENZHEN TISMART TECH CO LTD
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, many algorithms have been proposed to distinguish text from the background of document images, but due to the limitation of imaging environment, document images are often degraded by factors such as complex backgrounds, non-uniform intensity, shadows, etc. in the text, it becomes very difficult
[0005] In order to solve these problems, many binarization algorithms have been proposed. These algorithms are mainly classified into two categories, which include: 1. Global threshold segmentation algorithms, such as Tsai, Johannsan, Kapur and other algorithms. However, these methods are not applicable For source images containing various background patterns or uneven backgrounds; 2. Local threshold segmentation algorithms, such

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • EMD (empirical mode decomposition)-based intelligent document image block detection method
  • EMD (empirical mode decomposition)-based intelligent document image block detection method
  • EMD (empirical mode decomposition)-based intelligent document image block detection method

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0071] Such as figure 2 Shown, a kind of EMD-based intelligent document image block detection method, it specifically comprises:

[0072] S1. Acquiring a document image, and performing preprocessing on the acquired document image, for example, rotating an oblique document image into a horizontal document image;

[0073] S2. Perform grayscale transformation processing on the document image to obtain the document grayscale image, and use f(x, y) to represent the pixel matrix of the document grayscale image, and the dimension of the matrix is ​​M*N;

[0074] S3. Calculate the mean and variance of the document grayscale image, and calculate the horizontal projection histogram of the document grayscale image;

[0075] The formula used for calculating the horizontal projection histogram of the grayscale image of the document is as follows:

[0076] H ( y ) = Σ x ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an EMD (empirical mode decomposition)-based intelligent document image block detection method. The method includes: subjecting a document image to grey level transformation; calculating an average, a variance and a horizontal projective histogram of a document gray level image; subjecting the histogram to EMD decomposition, and taking a summation result of a last eigenmode modular function component and a residual error as a trend line of the histogram; calculating x-coordinates of intersections between the trend line and the histogram, and calculating spacing distances of the x-coordinates; carrying out image subblock segmentation according to the spacing distances, and calculating an average and a variance of each image subblock; judging whether the image subblocks are background or not according to the average and the variance of the document gray level image; subjecting the histogram to EMD decomposition, and taking a summation result of a last and the averages and the variances of the image subblocks, and processing correspondingly according to judgment results. The method is not only applicable to file images degraded by complicated imaging environments but small in calculated amount and applicable to portable devices. In addition, the method is widely applicable to the field of file image detection.

Description

technical field [0001] The invention relates to document image detection and processing technology, in particular to an EMD-based intelligent document image block detection method, which is suitable for adaptive threshold segmentation of degraded document images. Background technique [0002] technical term explanation [0003] EMD: Empirical Mode Decomposition, the English full name is Empirical Mode Decomposition. [0004] It has great application value for distinguishing text from the background of the document image, that is, converting the grayscale image of the document image into a binary image, and in many applications, it is mainly used in optical character recognition, automatic bank check processing, Signature verification, etc. At present, many algorithms have been proposed to distinguish text from the background of document images, but due to the limitation of imaging environment, document images are often degraded by factors such as complex backgrounds, non-u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/34G06K9/46
CPCG06V30/414G06V10/507G06V10/267
Inventor 蔡念肖盼陈裕潮刘根杨志景王晗
Owner SHENZHEN TISMART TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products