Character recognition method and apparatus

A character recognition and character technology, applied in the field of character recognition, can solve the problems of multi-noise area, OCR recognition difficulty, multiple characters cannot be separated, etc., to achieve the effect of improving recognition performance and great practical value

Inactive Publication Date: 2015-11-11
TENCENT TECH (SHENZHEN) CO LTD
View PDF3 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 1. For images with uneven illumination, the segmentation effect of existing algorithms is often very poor;
[0007] 2. The segmentation of the text area is incomplete, often showing that a single character is divided into mul

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character recognition method and apparatus
  • Character recognition method and apparatus
  • Character recognition method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0033] The main solution of the embodiment of the present invention is: by acquiring the input text image; performing text line segmentation on the text image to obtain the text line area of ​​the text image; performing character area segmentation on the text line area according to the character attributes of the text to obtain the character area information ; According to the character area information, a single character is segmented in combination with a text image to obtain a character segmentation result, which can improve the accuracy of character recognition compared with the prior art.

[0034] The terminal involved in the hardware operating environment involved in the scheme of the embodiment of the present invention can be a PC, or a smart phone, a tablet computer, an e-book reader, an MP3 (MovingPictureExper...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a character recognition method and apparatus. The method comprises: obtaining an input text image; performing text line segmentation on the text image to obtain a text line region of the text image; performing character region segmentation on the text line region according to text character attributes to obtain character region information; and according to the character region information, performing single character segmentation in combination with the text image to obtain a character segmentation result. According to the character recognition method and apparatus, the text segmentation can be accurately performed, so that the recognition performance of OCR (Optical Character Recognition) is greatly improved; and the scheme has relatively high practical values in various text recognition applications.

Description

technical field [0001] The invention relates to the technical field of character recognition, in particular to a character recognition method and device. Background technique [0002] OCR (Optical Character Recognition, Optical Character Recognition) means that electronic devices (such as scanners or digital cameras) check characters printed on paper, determine their shapes by detecting dark and bright patterns, and then use character recognition methods to translate the shapes into computer text. Process; that is, the process of scanning text data, analyzing and processing image files, and obtaining text and layout information. [0003] Among them, when performing character recognition, image and character segmentation is usually required. Among them, image segmentation refers to dividing the image into several non-overlapping regions according to features such as gray scale, color, texture and shape, and making these features show similarities in the same region, but show...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20G06K9/34
CPCG06V10/141G06V30/153
Inventor 王红法周龙沙张小鹏
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products