Text tracking and multi-frame reinforcing method in video

A video and text technology, applied in the field of text recognition, can solve problems such as high computational complexity, inability to track text correctly, and incomplete airspace information.

Inactive Publication Date: 2008-10-01
BEIHANG UNIV
View PDF1 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At this time, the SSD-based method will generate a large matching value when matching, and eventually cannot correctly track the position of the text in adjacent frames
In addition, the M-estimated template matching method itself has high computational complexity. Although it can track horizontal or oblique texts with translation, reduction, enlargement, rotation, fade in and fade out, and partial occlusion, the position of the text in adjacent frames can be tracked. It is mainly used to enhance the quality of text in multiple frames, using the temporal features of the text area to eliminate the background, and when the text is occluded or zoomed in, the spatial information of the tracked text pixel itself has become incomplete. quality, it doesn't make much sense to get a high-quality text binary image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text tracking and multi-frame reinforcing method in video
  • Text tracking and multi-frame reinforcing method in video
  • Text tracking and multi-frame reinforcing method in video

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] Before the technical solution of the present invention is described in detail, the Hausdorff distance measure, a key concept involved, will be described first. Given two sets of finite point sets A={a 1 ,...,a p} and B={b 1 ,...,b q}, the Hausdorff distance is defined as:

[0043] H(A,B)=max(h(A,B),h(B,A))

[0044] in:

[0045] h ( A , B ) = max a ∈ A min b ∈ B | | a - b | | , h ( B , A ) = max b ∈ B min a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for tracing text and strengthening multi-frame in a video. The texts in the video are mostly laminated in a complicated background, and if the texts are directly delivered into OCR (Optical Character Recognition) software for recognition, the recognition rate is low, therefore, the text strengthening operations are required to separate the texts from the background. The texts in the video mostly last for tens of frames, and even hundreds of frames, and in the adjacent frames, the colour of the texts are basically invariable but the background changes dynamically, accordingly, abundant complementary information between the multi-frames can be used for strengthening the texts. The method of the invention eliminates the influence of background edge pixel through the text stroke's characteristic of possession of edge pair, then uses Hausdorff distance measurement method for tracing the location of the texts at the adjacent frame, after obtaining a plurality of copies of the text images at the adjacent frame, makes use of minimum pixel searching method to remove the background in order to acquire binary text image including the clean background, greatly improving the rate of OCR software in identifying the video texts.

Description

field of invention [0001] The present invention relates to the technical field of content-based multimedia retrieval, and more specifically, the present invention relates to text recognition in videos. The invention utilizes the feature that the text line in the video scene has a large amount of complementary information between multiple frames, and proposes a text tracking and multi-frame enhancement method in the video. The method of the invention can eliminate the background and improve the optical character recognition (OCR, Optical Character Recognition) recognition rate of the video text. Background of the invention [0002] The text in the video provides information highly related to the video content, such as scene location, event time, and sports scores, player names and other information, but compared with the text in the document image, the text recognition in the video faces the following difficulties : (1) Due to the reason of TV standard, video transmission an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/72G06T5/20
Inventor 朱成军李超秦思思熊璋
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products