Method for extracting text information from adaptive images

A text information and extraction method technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems that cannot be used alone, large amount of calculation, wrong extraction, etc., to improve the overall performance, the calculation method is simple, powerful The effect of versatility

Inactive Publication Date: 2009-12-30
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The above-mentioned existing research methods, the edge-based text extraction method is simple to calculate, but due to the use of a single edge detection method, the detection effect is not ideal when the contrast between the text and the background is not much different; , it is also easy to get wrong results
This requires the combination of other information to expand the detection range; Texture-based text extraction methods need to extract effective texture features, although it can effectively detect the area where the text is located, it will also extract background areas with similar textures, which need to be combined with other Features; the main difficulty of the color-based clustering method is that it is impossible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting text information from adaptive images
  • Method for extracting text information from adaptive images
  • Method for extracting text information from adaptive images

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] refer to figure 1 , figure 2 and image 3 , is a kind of self-adaptive image text information extraction method of the present invention, and it comprises steps: 1) image preprocessing; 2) image background complexity analysis; 3) text initial detection; 4) text verification; 5) text extraction; 6 ) text information output or display;

[0039] The specific steps are:

[0040] A) First, read the image from the selected path and convert the color image to a gray image;

[0041] B) Calculate the background complexity of the entire image according to the gray-scale variation density of all pixels in the gray image, and classify the background complexity of the image. The calculation method of image background complexity is as follows:

[0042] A certain pixel point P 0 The gray intensity S' of is calculated according to the following formula:

[0043] S'=MAX{|P 1 -P 8 |, |P 2 -P 7 |, |P 3 -P 6 |, |P 4 -P 5 |} (1)

[0044] S = ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting text information from adaptive images, and relates to technology for extracting text information from images. The method comprises the following steps: 1) preprocessing the images; 2) analyzing complexities of the backgrounds of the images; 3) initially detecting texts; 4) verifying the texts; 5) extracting the texts; and 6) outputting or displaying the text information. The method adopts different text detection methods for the images with different complexities of the backgrounds by computing the complexities of the backgrounds of the images, reduces missing and false detection caused by a single text detection method, and improves the overall performance of a text extracting system. The computation method of the complexities of the backgrounds of the images is simple and effective. The method can detect text information in the images with different complexities of the backgrounds, and the text information detected by the method is free from the influence of type-font, word size and language; and the method has strong commonality.

Description

technical field [0001] The invention relates to the technical field of text information extraction in the field of pattern recognition and machine vision, in particular to an adaptive image text information extraction method. Background technique [0002] With the wide application of image acquisition equipment such as digital cameras, cameras, and ultra-high-speed scanners, the information in images has attracted more and more attention. However, it is still difficult for computers to understand the content of images. The text embedded in the image can provide some important information that people want, such as the cover of the book, the video, the color picture of the natural scenery picture www web page, etc., which is of great help to understand the content in the image. Let the computer recognize the text in the image like a human, that is, the automatic detection system of the text has attracted more and more attention in recent years. It has extremely important signi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20G06K9/46
Inventor 李敏花肖柏华王春恒戴汝为
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products