Document Image Binarization Method Based on Symmetry of Stroke Structure

A technology for document image and structural symmetry, which is applied in the directions of instruments, calculations, characters and pattern recognition, etc., can solve problems such as unclear printing and noise, and achieve the effect of enhancing adaptability, strong local adaptability, and overcoming non-text noise interference

Active Publication Date: 2019-07-19
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In practical applications, the quality of text images may vary greatly, and there may be troubles such as unclear printing or noise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document Image Binarization Method Based on Symmetry of Stroke Structure
  • Document Image Binarization Method Based on Symmetry of Stroke Structure
  • Document Image Binarization Method Based on Symmetry of Stroke Structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The technical problems solved by the embodiments of the present invention, the technical solutions adopted and the technical effects achieved are clearly and completely described below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present application, not all of them. Based on the embodiments in the present application, all other equivalent or obviously modified embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention. Embodiments of the invention can be embodied in many different ways as defined and covered by the claims.

[0029] It should be noted that, in the following description, many specific details are given for the convenience of understanding. It may be evident, however, that the present invention may be practiced without these specific details.

[0030] It should also ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document image binarization method based on the symmetry of stroke structure. Wherein, the method includes: determining the gradient image of the document image, wherein the document image is a grayscale image; using the maximum inter-class variance method to perform global binarization processing on the gradient image; according to the width and The symmetry of the gradient direction in the local area, remove the non-stroke gradient noise in the image after global binarization processing, and determine the gradient image with symmetric local gradient direction; determine the structural symmetry element image based on the gradient image with symmetric local gradient direction; The local density of foreground elements in the structurally symmetric element image is used to filter out noise, and combined with the document image to perform local binarization based on a voting strategy. The embodiment of the invention solves the technical problem of how to enhance the adaptability to document image text extraction.

Description

technical field [0001] The embodiment of the present invention relates to the technical fields of pattern recognition and optical character recognition, and specifically relates to a document image binarization method based on stroke structure symmetry, but is by no means limited thereto. Background technique [0002] In recent years, with the rapid development of network technology, human beings have entered the information age. Traditional information acquisition methods, such as books, newspapers, and periodicals, are inconvenient to carry and require a lot of space for storage, which is not easy to edit and organize. and spread. People are more and more inclined to use electronic devices such as disks for storage. Therefore, it is of great significance to quickly input the text information of paper materials into the computer. OCR (Optical Character Recognition, Optical Character Recognition) technology was born from this. OCR technology can realize high-speed and aut...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/38G06K9/40
CPCG06V10/28G06V10/30
Inventor 肖柏华何坤史存召贾馥溪王春恒
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products