Video text extraction method
A text extraction and text technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve problems such as time-consuming, improve quality, improve text recognition rate, and improve system efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0043] This embodiment provides a method for video text extraction of text, comprising the following steps:
[0044] S101, dividing the video sequence into each shot, so that the text tracking in step S104 can be carried out inside the shot instead of the entire video sequence, which can simplify the problem, and can support parallel text positioning and tracking, greatly improving system efficiency .
[0045] The so-called shot refers to the video image sequence recorded by one camera operation of the camera. The shot boundary is the result of switching between two shots, and the content of the video has changed, that is to say, the shot boundary reflects the discontinuity of the video content.
[0046] S102, detecting and locating the specific position of the candidate text in a single video frame; minimizing the number of false text regions.
[0047] S103, on the basis of the text positioning, the text is tracked inside the video lens, and the text area sequence of the sa...
Embodiment 2
[0051] The present invention aims at extracting structured text information from video programs. The main links are divided into: segmentation, positioning, tracking, identification and layout analysis.
[0052] The system uses a "semi-automatic" strategy to process video, that is, while providing the system with the processed video, it also needs to provide the corresponding configuration file.
[0053] There are many kinds of video programs, and the production styles of different programs, such as whether the stroke width of characters is uniform, the color contrast between characters and background, and the arrangement characteristics of characters, etc., are very different. It is impossible to find a general text feature and processing suitable for all video text information. method.
[0054] Algorithms and their parameters in text processing are highly specific. An algorithm can achieve high performance in a specific application environment, but the performance becomes ...
Embodiment 3
[0150] Embodiment 3 of the present invention provides a system for video text extraction, including:
[0151] Video shot segmentation module 11, for video sequence is divided into single shot;
[0152] Text location module 12, for detecting and locating the specific position of candidate text in a single video frame sequence;
[0153] The text tracking module 13 is used to track the text inside the video shot on the basis of the text positioning, so as to obtain the text region sequence of the same text object in continuous multi-frames;
[0154] Enhancement and binarization module 14, for utilizing the text sequence obtained by text tracking to enhance the text, suppress the background, and then perform binarization to obtain a binarized text image;
[0155] The text recognition module 15 is configured to perform text recognition on the binarized text image to obtain character sequence information of the text.
[0156] Preferably, the text positioning module 12 specifically...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com