Chinese character segmentation method for off-line handwritten Chinese character recognition

A Chinese character and Chinese character recognition technology, applied in the field of Chinese character recognition

Inactive Publication Date: 2012-06-13
SUZHOU UNIV
View PDF2 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] The purpose of the present invention is to provide a Chinese character segmentation method for off-line handwritten Chinese characte

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese character segmentation method for off-line handwritten Chinese character recognition
  • Chinese character segmentation method for off-line handwritten Chinese character recognition
  • Chinese character segmentation method for off-line handwritten Chinese character recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0052] Example: see figure 1 Shown is a frame diagram of a handwritten Chinese character segmentation system using the segmentation method of the present invention. The details of each module are as follows:

[0053] (1) Binarization module

[0054] For color or grayscale images, binarization is required to separate the foreground (Chinese characters) from the background. In order to reduce interference information and improve the efficiency of segmentation, it is a necessary module of the system.

[0055] (2) Filtering and denoising module

[0056] The filter denoising module is to filter the noise in the binary image that affects the accuracy of character segmentation, and reduce the impact of noise on Chinese character segmentation.

[0057] (3) Coarse segmentation module based on projection analysis

[0058] The rough segmentation module is the initial segmentation process for the preprocessed binary image. This module adopts the projection analysis algorithm, and us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese character segmentation method for off-line handwritten Chinese character recognition. The method is characterized by comprising the following steps: (1) a Chinese character image to be recognized is pre-processed (including binarization of the image); (2) the Chinese character image is roughly segmented on the basis on projection analysis, the non-adherent characters are segmented into single characters and the adherent characters are integrally segmented; (3) the average height of the non-adherent character is acquired; (4) the characters segmented in the step (2) are judged according to the average height of the non-adherent characters, which is acquired in the step (3), so as to obtain an adherent character string set; and (5) each adherent character string in the adherent character string set is re-segmented on the basis of minimum weighting segmentation path, so as to achieve segmentation of the adherent characters. The method is effectively self-adaptive to segmentation of both adherent Chinese characters and non-adherent Chinese characters, and has higher segmentation accuracy and efficiency.

Description

technical field [0001] The invention relates to the field of Chinese character recognition, in particular to the problem of Chinese character segmentation in off-line handwritten Chinese character recognition, especially the segmentation of cohesive character strings. Background technique [0002] As a difficult point in preprocessing, Chinese character segmentation technology has always been an obstacle to the application of offline Chinese character recognition systems. Correct Chinese character recognition is only possible if individual Chinese characters are correctly segmented from the document image. However, due to the arbitrary writing of handwritten Chinese characters and the complexity of the positional relationship between adjacent Chinese characters, handwritten Chinese characters are much more difficult to segment than printed Chinese characters, especially the segmentation of cohesive characters. At present, commonly used segmentation techniques include statis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20
Inventor 刘纯平周双飞王朝晖季怡龚声蓉蒋德茂
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products