Ancient document image stain removal method

A document image and stain removal technology, which is applied in the field of ancient book document image stain removal, can solve the problems of poor binarization effect and inability to remove stains, etc., and achieve a good binarization effect

Active Publication Date: 2018-01-19
NORTHWEST UNIVERSITY FOR NATIONALITIES
View PDF15 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the problems in the prior art that the stains in the image cannot be removed by methods such as Otsu, Niblack, and Sauvola, and the binarization effect is poor, the present invention provides a method for removing stains from images of ancient book documents, which uses the Lab colo

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Ancient document image stain removal method
  • Ancient document image stain removal method
  • Ancient document image stain removal method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0044] The application principle of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0045] Such as figure 1 As shown, a method for removing stains from an image of an ancient book document is specifically carried out in accordance with the following steps:

[0046] S101: Stain treatment:

[0047] The image to be processed (such as figure 2 Described) is converted to Lab color space by RGB color space, the numerical value of three channel images L, a, b of Lab color space calculates according to formula (1), (2), (3), (4):

[0048]

[0049]

[0050]

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an ancient document image stain removal method. The method comprises the following steps of: converting a to-be-processed image from an RGB color space to a Lab color space, separating the to-be-processed image by utilizing three channel images of the Lab color space, and selecting an L channel and a b channel by utilizing own information of separated different channel images so as to weaken or eliminate the influences of stains; automatically determining image text blocks according to the quantity and sizes of characters in the stain eliminated image, and judging whether the blocked image needs expansion or not; and carrying out global and local combined binarization processing on the image blocks to obtain a binary image. According to the method, the three channelimages of the Lab color space are separated and fusion is carried out on two channels, so that the problem of removing stains in ancient document images is solved; and during the binarization processing, the global and local are combined by adoption of an automatic image blocking method, so that the stains in the ancient document images can be effectively removed and the binarization effect is good.

Description

technical field [0001] The invention belongs to the technical field of document analysis and recognition, and in particular relates to a method for removing stains from images of ancient book documents. Background technique [0002] Image binarization is often used in document image restoration. Image binarization can be divided into grayscale image binarization and color image binarization. Therefore, the selection of the threshold has a decisive effect on the image binarization result. Due to the age, many pages of ancient books have stains. How to remove the stains, obtain effective binary images of Tibetan ancient books, and then perform document segmentation and character recognition is an important link. [0003] At present, the threshold-based binarization method can be divided into global threshold method and local threshold method. The global threshold method uses a single threshold for the entire image, and judges the target and background by comparing all pixel v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T7/11G06T5/50
Inventor 王维兰韩跃辉王轶群
Owner NORTHWEST UNIVERSITY FOR NATIONALITIES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products