Character extracting method in digital video based on character segmentation and color cluster

A technology of color clustering and digital video, which is applied in the field of image processing, can solve problems affecting OCR recognition, residual background, misclassification, etc., and achieve the effect of reducing processing complexity, reducing calculation amount, and improving recognition effect

Inactive Publication Date: 2009-08-26
BEIJING UNIV OF POSTS & TELECOMM
View PDF0 Cites 62 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This type of method can be used in the case of complex backgrounds, but when the background contains components with the same or similar color as the text, these components will be misclassified into text categories, resulting in a large amount of residual background, which affects OCR recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character extracting method in digital video based on character segmentation and color cluster
  • Character extracting method in digital video based on character segmentation and color cluster
  • Character extracting method in digital video based on character segmentation and color cluster

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0048] see figure 1 , introduce two specific operation steps of the character extraction method in the digital video based on character segmentation and color clustering of the present invention:

[0049] Step 1. Character segmentation: The analysis and segmentation method based on vertical projection features is adopted, that is, the feature difference between the character area and the character interval area is used to perform vertical projection to segment the character area image, and each line contains multiple characters. , segmented into multiple sub-region images containing only a single character, to reduce the processing complexity of subsequent steps, improve accuracy, and ensure a high recognition rate of OCR;

[0050] Step 2. Character extraction: f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a character extracting method in a digital video based on character segmentation and color cluster, which comprises the following steps: (1) character segmentation: utilizing the characteristic differences of a character area and a character interval area to carry out vertical projection to segment images in the character area, namely, segmenting each row of area image containing a plurality of characters into a plurality of subarea images only containing a single character so as to reduce the post operating and treating difficulties and improve the identifying accuracy rate of OCR; and (2) character extraction: firstly, using the character color characteristic in the image to cluster colors, finding out an image layer containing maximum character information as a target image layer, and deleting the background area; and then, using the communicating characteristics of the characters to analyze a communicating area of the target image layer, and removing non-character areas to obtain such three results as single character images, an integral image of the character area and an integral image spliced by the single character images respectively, wherein all the three results are input to an OCR system to be identified, and the latter two results use the semantic processing function of the OCR and can accurately determine the characters with similar forms according to the context to improve the identifying effect.

Description

technical field [0001] The invention relates to a digital image processing method, in particular to a character extraction method in digital video based on character segmentation and color clustering, and belongs to the technical field of image processing. Background technique [0002] Character information in digital video is an important clue to understand the high-level semantics of video content, and extracting these characters effectively is an important technology for content understanding and content-based multimedia information retrieval systems. Currently, Optical Character Recognition (OCR) technology has matured for character recognition in binary images and has entered into commercial applications. Today, characters from many different sources are recognized by OCR systems. Therefore, how to make the input character image meet the recognition requirements of OCR has become a recognized goal that must be achieved. At present, most OCR systems are oriented to doc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/34
Inventor 黄晓冬马华东张赫
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products