Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Character detection method and system

A text detection and text technology, applied in image data processing, instruments, calculations, etc., can solve problems such as accuracy differences

Active Publication Date: 2017-08-25
TSINGHUA UNIV +1
View PDF6 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to recognize the text in the non-scanned image, detecting the text in the image is a necessary pre-step. The current text detection technology mostly uses manually specified features to determine whether the image contains text, and most of them are for English characters. Detection, due to the significant difference in the glyph structure between Chinese and English, there is a large difference between the accuracy of Chinese detection in document images and the accuracy of English detection in document images, which is difficult to meet the needs of practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character detection method and system
  • Character detection method and system
  • Character detection method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0083] see figure 2 , the method for detecting text by the text detection system of the present embodiment comprises the following steps:

[0084] Step 201, performing color reduction processing on the target image to obtain a color reduction image of the target image.

[0085] Input the target image to be detected, and quantify the red, green and blue (RGB) channels of the target image to K levels respectively (K is an integer and 255>K>1, for example, the value is 4), that is, RGB three The brightness of each channel in the color channel is divided (for example, evenly divided) into K intervals (Bin), that is, the brightness level of 0-255 is reduced to 0-(K-1) level, and each pixel in the target image is divided into The brightness of the RGB three-color channel is mapped to the Bin of the corresponding channel. For the target image, since each channel in the RGG three-color channel has 256 brightness levels (0-255), the target image can have 255^3( 255) colors, and afte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a character detection method and system. The method comprises the following steps that: carrying out subtractive color processing on each image in the three-color channel of a target image to obtain a subtractive color image, and converting the target image into a binary image; combining connected blocks with the same color in the subtractive color image, and combining the connected blocks with the same color in the binary image; independently combining the connected block of each type of color channel of the three-color channel of the subtractive color image and the connected blocks in the binary image in vertical and horizontal directions in a connection way to obtain a candidate character area in the target image; and on a position which corresponds to the candidate character area in the target image, extracting a specific area, and judging whether the extracted specific area contains a character line or character row according to a comparison result of a character area containing probability in the specific area and a preset probability threshold value. When the method is implemented, texts in the image can be accurately detected.

Description

technical field [0001] The invention relates to a text detection technology in an image, in particular to a text detection method and system. Background technique [0002] A document image is a document in an image format. It is a document that converts a paper document, etc. into an image format by a certain method (such as scanning) for electronic reading by users. Format) format image, and DjVu format image. [0003] The current text detection technology can detect the text in the document image (locate the area bearing the text in the image), and perform text recognition based on the detected area bearing the text. [0004] Images in the general sense include not only document images, but also non-document images (that is, images uploaded by users in scanned format images such as web albums, which may be Joint Photographic Experts Group (JPG) images, bitmap (BMP) images , Tagged Image File Format (TIFF) images, Graphics Interchange Format (GIF) images, and Exchangeable...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/00G06T7/90
CPCG06T2207/10008G06T2207/30176G06T7/00
Inventor 徐昆郭晓威黄飞跃郑宇飞张惜今卢艺帆
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products