Text tracking and multi-frame reinforcing method in video

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A video and text technology, applied in the field of text recognition, can solve problems such as high computational complexity, inability to track text correctly, and incomplete airspace information.

Inactive Publication Date: 2008-10-01

BEIHANG UNIV

View PDF1 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At this time, the SSD-based method will generate a large matching value when matching, and eventually cannot correctly track the position of the text in adjacent frames

In addition, the M-estimated template matching method itself has high computational complexity. Although it can track horizontal or oblique texts with translation, reduction, enlargement, rotation, fade in and fade out, and partial occlusion, the position of the text in adjacent frames can be tracked. It is mainly used to enhance the quality of text in multiple frames, using the temporal features of the text area to eliminate the background, and when the text is occluded or zoomed in, the spatial information of the tracked text pixel itself has become incomplete. quality, it doesn't make much sense to get a high-quality text binary image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] Before the technical solution of the present invention is described in detail, the Hausdorff distance measure, a key concept involved, will be described first. Given two sets of finite point sets A={a 1 ,...,a p} and B={b 1 ,...,b q}, the Hausdorff distance is defined as:

[0043] H(A,B)=max(h(A,B),h(B,A))

[0044] in:

[0045] h ( A , B ) = max a ∈ A min b ∈ B | | a - b | | , h ( B , A ) = max b ∈ B min a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a method for tracing text and strengthening multi-frame in a video. The texts in the video are mostly laminated in a complicated background, and if the texts are directly delivered into OCR (Optical Character Recognition) software for recognition, the recognition rate is low, therefore, the text strengthening operations are required to separate the texts from the background. The texts in the video mostly last for tens of frames, and even hundreds of frames, and in the adjacent frames, the colour of the texts are basically invariable but the background changes dynamically, accordingly, abundant complementary information between the multi-frames can be used for strengthening the texts. The method of the invention eliminates the influence of background edge pixel through the text stroke's characteristic of possession of edge pair, then uses Hausdorff distance measurement method for tracing the location of the texts at the adjacent frame, after obtaining a plurality of copies of the text images at the adjacent frame, makes use of minimum pixel searching method to remove the background in order to acquire binary text image including the clean background, greatly improving the rate of OCR software in identifying the video texts.

Description

field of invention [0001] The present invention relates to the technical field of content-based multimedia retrieval, and more specifically, the present invention relates to text recognition in videos. The invention utilizes the feature that the text line in the video scene has a large amount of complementary information between multiple frames, and proposes a text tracking and multi-frame enhancement method in the video. The method of the invention can eliminate the background and improve the optical character recognition (OCR, Optical Character Recognition) recognition rate of the video text. Background of the invention [0002] The text in the video provides information highly related to the video content, such as scene location, event time, and sports scores, player names and other information, but compared with the text in the document image, the text recognition in the video faces the following difficulties : (1) Due to the reason of TV standard, video transmission an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06K9/32G06K9/72G06T5/20

Inventor朱成军李超秦思思熊璋

OwnerBEIHANG UNIV

Text tracking and multi-frame reinforcing method in video

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology