Method for extracting text in complex background image

A text extraction and complex background technology, applied in the field of image processing, can solve problems such as difficult positioning

Active Publication Date: 2013-08-28
北京百驰数据服务有限公司
View PDF2 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These methods have their own advantages and disadvantages in the application of complex background

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting text in complex background image
  • Method for extracting text in complex background image
  • Method for extracting text in complex background image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] Below in conjunction with accompanying drawing, the text extraction method in a kind of complex background image that the present invention proposes is described in detail:

[0080]The text extraction method in the complex background image of the present invention realizes the whole process of the text extraction method in the complex background image with the C++ programming language through the VS2010 platform in the Windows operating system. We select a network image containing text with a size of 512*512 as the source image, and use this as an example to locate and extract the text in the image based on the method proposed by the present invention, and check its extraction effect. figure 1 It is an overall flow chart of the inventive method, and the concrete steps are as follows:

[0081] Step 1: Use the weighted average method to grayscale the source image src to obtain a grayscale image Img. The formula for calculating the grayscale value of each point is:

[008...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for extracting a text in a complex background image. The method comprises the steps of firstly, using the susan operator to detect and identify angular points in a source image, after removing isolated angular points, conducting integral projection transformation to cut out suspected text regions, and screening and removing non-text regions according to priori knowledge; then, judging the background complexity of a text region by the utilization of gray level jump information, when a background is judged to be complex, conducting color clustering on the text region by the utilization of the kmeans clustering algorithm, and determining the type which a text belongs to and extracting the text according to the color information at the position where the angular points are the densest; when the background is judged to be simple, conducting binaryzation on the image by the utilization of a largest-between-class variance method; finally, realizing accurate extraction of the text region. The text extraction method can locate the text region in the complex background image, and finally extracts characters after removing the background.

Description

technical field [0001] The invention belongs to the technical field of image processing, in particular to a text extraction method in complex background images. Background technique [0002] In recent years, with the rapid development of network technology and multimedia technology, network culture with the network as the carrier is becoming a new trend in the current cultural development, followed by digital information such as plain text, digital images, videos, etc. The level of growth is increasing, which has a major impact on people's lives. There is a large amount of data in this information, including not only information that is beneficial to people, but also more and more obscene, violent, and reactionary information. It is obviously unrealistic to rely on manual detection for these information detection tasks, and it is necessary for computers to be able to automatically identify and detect them. At present, the text recognition technology is relatively mature. T...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/46
Inventor 达飞鹏刘超饶立李燕春吕江昭王辰星何学勇
Owner 北京百驰数据服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products