Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Generalized text localization in images

a text localization and image technology, applied in image enhancement, image analysis, instruments, etc., can solve the problems of non-general in some aspects, non-text parts of web pages, and the inability of existing text segmentation and text recognition algorithms to extract tex

Inactive Publication Date: 2002-10-31
INTEL CORP
View PDF0 Cites 94 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, as can be seen from reading their descriptions, they are each non-general in some aspect.
Segmenting and recognizing text in the non-text parts of web pages is also an important issue.
Existing text segmentation and text recognition algorithms cannot extract the text.
Thus, all existing search engines cannot index the content of image-rich web pages properly.
An input layer of 30.times.15 neurons achieved not better classification results, but was computational more expensive.
On the other side, using an input layer with less than 10 rows resulted in substantially worse results.
Again, using more hidden neurons did not result in any performance improvements, while using only one increased the false alarm rate by a factor of three.
While it is straightforward how to get examples for different types of text, it may be more difficult to get a representative non-text set.
The composition of the training set may seriously affect a network's performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generalized text localization in images
  • Generalized text localization in images
  • Generalized text localization in images

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

) gives an example of the Initial text box creation algorithm.

[0061] Initial Text Box Creation Algorithm (Pseudocode Example 1):

1 (1) search for next core pixel and create a new text box of width and height 1. (2) do (3) extendNorth (box) (4) extendEast(box) (5) extendSouth(box) (6) extendWest(box) (7) while (box changed)

[0062] The average intensity of the pixels of the adjacent row above the total width of the box in the overall edge strength image is taken as the criterion for growing in that direction. If the average intensity is larger than th.sub.region=4.5, the row is added to the box. This value is chosen to be a little bit smaller than th.sub.core in order not only to get a text box including the core of a text region, but a text box that encompasses all parts of the text. Next, the same criterion is used to expand the box to the left, bottom, and right. This iterative box expansion repeats as long as the bounding box keeps growing (see Pseudocode Example 1).

[0063] FIG. 3 il...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

In some embodiments, the invention includes a method for locating text in digital images. The method includes scaling a digital image into images of multiple resolutions and classifying whether pixels in the multiple resolutions are part of a text region. The method also includes integrating scales to create a scale integration saliency map and using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold. The initial text bounding boxes are consolidated. In other embodiments, a method includes classifying whether pixels are part of a text region, creating initial text bounding boxes, and consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.

Description

[0001] 1. Technical Field of the Invention[0002] The present invention relates generally to localization and / or segmentation of text in images.[0003] 2. Background Art[0004] Existing work on text recognition has focused primarily on optical recognition of characters (called optical character recognition (OCR)) in printed and handwritten documents in answer to the great demand and market for document readers for office automation systems. These systems have attained a high degree of maturity. Further text recognition work can be found in industrial applications, most of which focus on a very narrow application field. An example is the automatic recognition of car license plates.[0005] Proposals have been made regarding text detection in and text extraction from complex images and video. However, as can be seen from reading their descriptions, they are each non-general in some aspect. Further, some do not involve removal of the localized text from its background.[0006] Accordingly, a ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G06T5/00G06V30/10
CPCG06K9/325G06T7/0081G06T2207/30176G06T2207/20008G06T2207/20144G06T2207/10016G06T7/11G06T7/194G06V20/62G06V30/10
Inventor LIENHART, RAINER WWERNICKE, AXEL
Owner INTEL CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products