Video caption text extraction and identification method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A text extraction and recognition method technology, applied in character and pattern recognition, instruments, computer parts and other directions, can solve problems such as poor adaptability, low resolution, character segmentation, character image noise, etc., to avoid uneven spacing Effect

Active Publication Date: 2012-01-25

北京中科阅深科技有限公司

View PDF2 Cites 49 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In general, the existing video subtitle text extraction and recognition technology is not perfect, mainly reflected in: for complex and changeable backgrounds, especially those with similar or even the same color as the foreground cannot be processed; for changeable fonts, Such as: Song, Hei, Kai, etc. are not very adaptable; character recognition adopts the conventional OCR method, and the character segmentation, character image noise, and low resolution are not considered enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0032] The present invention can be implemented on computing devices such as personal computers and servers.

[0033] The technical solution adopted by the present invention is: divide the subtitle area in the video into lines, automatically judge the foreground color for each line and generate a binary image, segment and recognize the characters in the binary image, and obtain the final text recognition result. The positioning of subtitles in the video is not the content of the present invention, and it is assumed that the subtitle area has been obtained by positioning by other methods.

[0034] refer to figure 1 , the video subtitle text extraction and recognition method that the present invention proposes specifically...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a video caption text extraction and identification method which comprises the following steps of: inputting an image of a caption area in a video; converting the input image into a grayscale; judging the arrangement direction of characters in the caption area; counterclockwise rotating the caption area in which vertical arrangement is adopted 90 degrees to obtain a horizontal caption area; lining the caption area to obtain single-line caption images; automatically judging foreground colors of the single-line caption images to obtain binary single-line caption images; over-segmenting the binary single-line caption images to obtain character segment sequences; and performing text line identification on the over-segmented binary single-line caption images. By utilizing the method, horizontal and vertical video caption text lines can be effectively segmented, the foreground colors of the characters can be accurately judged, noises can be filtered, and accurate character segmentation and identification results can be obtained; and the method can be applicable to a plurality of purposes such as video and image content editing, indexing, retrieving and the like.

Description

technical field [0001] The invention belongs to the field of pattern recognition and computer vision, in particular to a processing method for text detection and recognition in video images. Background technique [0002] As one of the most popular forms of media, video is widely disseminated through the Internet and television. In order to make users find the video content they are interested in more conveniently and quickly, video retrieval and classification has gradually become a research hotspot in the field of pattern recognition and computer vision. Among them, the text information in the video, especially the subtitle information, has the most significant effect on video retrieval and classification. This is because: (1) the text information in the video is closely related to the current content of the video; (2) the characters in the video have very obvious visual features, which are easy to extract; (3) character recognition (OCR) technology is relatively Recognit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/20G06K9/32G06K9/46

Inventor 刘成林白博殷飞

Owner 北京中科阅深科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Video caption text extraction and identification method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology