Generalized text localization in images

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
a text localization and image technology, applied in image enhancement, image analysis, instruments, etc., can solve the problems of non-general in some aspects, non-text parts of web pages, and the inability of existing text segmentation and text recognition algorithms to extract tex

Inactive Publication Date: 2002-10-31

INTEL CORP

View PDF0 Cites 94 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, as can be seen from reading their descriptions, they are each non-general in some aspect.

Segmenting and recognizing text in the non-text parts of web pages is also an important issue.

Existing text segmentation and text recognition algorithms cannot extract the text.

Thus, all existing search engines cannot index the content of image-rich web pages properly.

An input layer of 30.times.15 neurons achieved not better classification results, but was computational more expensive.

On the other side, using an input layer with less than 10 rows resulted in substantially worse results.

Again, using more hidden neurons did not result in any performance improvements, while using only one increased the false alarm rate by a factor of three.

While it is straightforward how to get examples for different types of text, it may be more difficult to get a representative non-text set.

The composition of the training set may seriously affect a network's performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

) gives an example of the Initial text box creation algorithm.

[0061] Initial Text Box Creation Algorithm (Pseudocode Example 1):

1 (1) search for next core pixel and create a new text box of width and height 1. (2) do (3) extendNorth (box) (4) extendEast(box) (5) extendSouth(box) (6) extendWest(box) (7) while (box changed)

[0062] The average intensity of the pixels of the adjacent row above the total width of the box in the overall edge strength image is taken as the criterion for growing in that direction. If the average intensity is larger than th.sub.region=4.5, the row is added to the box. This value is chosen to be a little bit smaller than th.sub.core in order not only to get a text box including the core of a text region, but a text box that encompasses all parts of the text. Next, the same criterion is used to expand the box to the left, bottom, and right. This iterative box expansion repeats as long as the bounding box keeps growing (see Pseudocode Example 1).

[0063] FIG. 3 il...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

In some embodiments, the invention includes a method for locating text in digital images. The method includes scaling a digital image into images of multiple resolutions and classifying whether pixels in the multiple resolutions are part of a text region. The method also includes integrating scales to create a scale integration saliency map and using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold. The initial text bounding boxes are consolidated. In other embodiments, a method includes classifying whether pixels are part of a text region, creating initial text bounding boxes, and consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.

Description

[0001] 1. Technical Field of the Invention[0002] The present invention relates generally to localization and / or segmentation of text in images.[0003] 2. Background Art[0004] Existing work on text recognition has focused primarily on optical recognition of characters (called optical character recognition (OCR)) in printed and handwritten documents in answer to the great demand and market for document readers for office automation systems. These systems have attained a high degree of maturity. Further text recognition work can be found in industrial applications, most of which focus on a very narrow application field. An example is the automatic recognition of car license plates.[0005] Proposals have been made regarding text detection in and text extraction from complex images and video. However, as can be seen from reading their descriptions, they are each non-general in some aspect. Further, some do not involve removal of the localized text from its background.[0006] Accordingly, a ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(United States)

IPC IPC(8): G06T5/00G06V30/10

CPCG06K9/325G06T7/0081G06T2207/30176G06T2207/20008G06T2207/20144G06T2207/10016G06T7/11G06T7/194G06V20/62G06V30/10

Inventor LIENHART, RAINER WWERNICKE, AXEL

Owner INTEL CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Generalized text localization in images

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology