Supercharge Your Innovation With Domain-Expert AI Agents!

Self adaptive two-valued method, device for file

A binary, self-adaptive technology, applied in character and pattern recognition, instruments, calculations, etc., can solve the problems of complex, background intensity changes, poor quality document work is not very good, etc., to achieve the effect of small amount of calculation

Inactive Publication Date: 2009-11-18
CANON KK
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Changes in background intensity due to uneven illumination and improper storage;
[0006] 2. Very low local contrast due to blemishes, stains and shadows in the capture process of document images;
[0007] 3. Non-stationary and signal-related noise;
[0008] 4. There is a change in the contrast between the foreground and background areas of the image;
[0010] 6. Grayscale images have low resolution
[0020] 1. The global threshold segmentation technique is suitable for high-quality document images, but it does not work very well for documents with poor quality;
[0021] 2. The local threshold segmentation technique does not work very well for small-sized images, because the image must first be sampled and reduced into smaller blocks, and then expanded
Boundary padding can seriously affect the final result;
[0022] 3. Some local average threshold segmentation techniques such as the Niblaek method often amplify noise and tend to misclassify large background areas as text;
[0023] 4. These techniques often require the use of edge detection techniques, thinning methods and / or post-processing to remove "ghost" objects;
[0024] 5. The local threshold segmentation technology requires multiple image readings, which is very slow and not suitable for high-quality images
Segmentation methods can outperform it in terms of sophistication if it becomes too complex for improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self adaptive two-valued method, device for file
  • Self adaptive two-valued method, device for file
  • Self adaptive two-valued method, device for file

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] computer system example

[0052] The method and device of the present invention can be implemented in any information processing device. The information processing device is, for example, a personal computer (PC), a notebook computer, a digital camera, or a single-chip microcomputer (SCM) embedded in a scanner, copier, or facsimile. It is easy for those skilled in the art to realize the method and device of the present invention by software, hardware and / or firmware. In particular, it should be noted that it will be obvious to those of ordinary skill in the art that in order to perform any step or combination of steps of the method, or any part or combination of parts of the device of the present invention, it may be necessary to use input and output devices, storage devices and A microprocessor such as a CPU or the like. These devices are not necessarily mentioned in the following description of the method and device of the present invention, but are actually used. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application relates to a method, device and storage medium for adaptive binarization of documents. The method for performing self-adaptive binarization on a gray-scale document image of the present invention includes: a division step, dividing the gray-scale document into blocks; a first determination step, determining the background block and the background block among the divided blocks according to the characteristics of the blocks Text block; the second determining step is to determine the background pixels in the pixels included in each text block determined in the first determining step; the first calculation step is to calculate a block threshold surface representing the threshold of each block, wherein, based on the first All pixels included in the background block determined in a determining step calculate the threshold value of the background block, and the threshold value of the text block is calculated based on the background pixels included in the text block determined in the first determining step, the background pixels are in the second determined in the determining step; and a binarization step of binarizing the grayscale document image using the block threshold surface calculated in the first calculation step.

Description

technical field [0001] The present invention relates generally to document image processing, and more particularly to optical character recognition (OCR). More specifically, it relates to a document adaptive binarization method, device and storage medium. Background technique [0002] Binarization of document images is the first step in document image analysis systems such as optical character recognition systems. The output of the thresholding operation is a binary image, where one state represents the foreground object, ie printed text, and its complementary state corresponds to the background. Binarization methods can be divided into two categories: global and local thresholding segmentation techniques, see .D. Trier and A.K. Jain. Goal-directed evaluation of binarizaton methods, IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(12): 1191-1201, 1995, Yibing Yang and Hong Yan. Anadaptive logical method for binarization of degraded document images, Patt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/20
Inventor 曾旭李献肖其林
Owner CANON KK
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More