Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for detecting text in natural scene picture based on local width change of strokes

A technology of stroke width transformation and natural scene, which is applied to computer parts, instruments, calculations, etc., can solve the problems of high false alarm, high calculation cost, and multiple follow-up processing of background edges.

Inactive Publication Date: 2015-07-22
PEKING UNIV
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The principle of the method based on the connected domain is relatively intuitive, and it is only suitable for processing images with a relatively simple background; the basic idea of ​​the method based on the texture analysis is to consider the text area as a special texture object, and its detection accuracy is high, but it needs to be processed on the entire image. Scanning and texture segmentation requires a lot of training in advance, the calculation cost is relatively high, and the detection effect is also related to the training samples; the edge-based method is prone to high false alarms, so how to effectively filter the background edge requires more follow-up processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting text in natural scene picture based on local width change of strokes
  • Method for detecting text in natural scene picture based on local width change of strokes
  • Method for detecting text in natural scene picture based on local width change of strokes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings.

[0047] figure 1 is the original input image of the present embodiment, the flow process of text detection using the method of the present invention is as follows figure 2 shown. Each picture mentioned below is a two-dimensional matrix, that is, a two-dimensional matrix is ​​used to store the value of each pixel to form a picture, and the pixel value is the value of each position of the matrix.

[0048] The first step is to pre-process the image. The pre-processing steps include image resizing (maintain aspect ratio, shrink to no more than 1600*1200), image smoothing (Gaussian blur), image grayscale processing, and then generate edge map.

[0049] In this embodiment, the image with too high pixels is down-sampled, then the image is smoothed, and then a grayscale method that maintains a high contrast as much as possible is used to generate...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for detecting text in a natural scene picture based on local width change of strokes. The method comprises the steps that 1) the natural scene picture to be detected is processed in advanced to obtain a gray scale picture, and an edge graph of the gray scale picture is obtained; 2) a corresponding binary image is generated by binarization according to the edge graph and the gray scale picture; 3) adjacent pixels of the same value are gathered to form a candidate part according to the binary image; 4) the stroke width of the candidate part is calculated to obtain the width of each pixel in stroke; 5) a filter is used to filter the candidate part according to the calculated stroke widths, and a text part is obtained by screening; and 6) the screened text parts are connected to form a text row, and a final result is displayed. The text detection method of the invention is low in calculation cost, high in efficiency and visual, and can effectively improve the text detection accuracy of the natural scene picture.

Description

technical field [0001] The invention belongs to the technical field of pattern recognition in artificial intelligence, and in particular relates to a text detection method in natural scene pictures based on partial stroke width transformation. Background technique [0002] Pictures in natural scenes refer to pictures taken directly from real scenes in life by various shooting devices such as cameras, mobile phones, etc., without specific restrictions. It is necessary to find text areas in natural scene images. These text data carry very useful and important information in people's daily life, such as street signs, billboards, traffic signs, etc., which are very helpful for assisting people to obtain scene information. . The appearance of text is mainly to remind people to pay attention to some useful information. Using portable devices around us to extract, identify and record, and even search for relevant content online can greatly facilitate our lives. Text is the most d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/54
Inventor 董文妍连宙辉唐英敏肖建国
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products