Handwritten Chinese character recognition method based on substructure learning

A recognition method and substructure technology, applied in the field of recognition of handwritten Chinese character strings, can solve the problems of large differences in handwritten Chinese character deformation and handwriting styles, low recognition reliability, and limited string recognition accuracy

Inactive Publication Date: 2013-09-25
TIANJIN NORMAL UNIVERSITY +1
View PDF6 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The Chinese character classifier used in the traditional method is only trained on the Chinese character category, but in practical applications, due to the influence of many factors such as large deformation of handwritten Chinese characters,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Handwritten Chinese character recognition method based on substructure learning
  • Handwritten Chinese character recognition method based on substructure learning
  • Handwritten Chinese character recognition method based on substructure learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] In order to realize the purpose of the invention, the present invention has done the following work:

[0085] 1) An automatic learning method for the substructure of Chinese characters is proposed:

[0086] Consider character segmentation fragments as the substructure of Chinese characters, generate a large number of Chinese character segmentation fragment samples from actual text and character samples, and automatically extract stable character segmentation fragment patterns as Chinese character substructure patterns through cluster analysis of these samples, and The substructure composition information of different Chinese characters is recorded and stored in the substructure dictionary. At the same time, in order to solve the computational difficulty brought by a large number of samples, a two-stage clustering analysis method is proposed, and the substructure learning is divided into two stages: local substructure learning and global substructure learning.

[0087] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a handwritten Chinese character recognition method based on substructure learning. The handwritten Chinese character recognition method based on substructure learning comprises the following steps of taking a Chinese character segmented fragment as a substructure of a Chinese character, extracting a Chinese character substructure mode from a Chinese character segmented fragment sample, bringing the Chinese character substructure mode into training of a Chinese character classifier, and finally realizing recognition of a handwritten Chinese character string through the combination of substructure recognition information and Chinese character substructure constitution information. The handwritten Chinese character recognition method is based on the characteristic that each Chinese character is composed of one or more substructures. Due to the facts that the Chinese character substructures are extracted, and the Chinese character substructures and the individual Chinese character are simultaneously trained in the Chinese character classifier, the reliability of recognition of the Chinese character segmented fragment is effectively improved in the process of recognition of the handwritten Chinese character string, errors, caused by unreliable recognition of the Chinese character segmented fragment, of recognition of the handwritten Chinese character string are reduced, and the precision of handwritten Chinese character recognition is improved.

Description

technical field [0001] The invention belongs to the technical field of pattern recognition, in particular to a method for recognizing handwritten Chinese character strings. Background technique [0002] Handwritten Chinese recognition technology has a wide range of applications in the fields of office automation, data entry, human-computer interaction, etc. However, the unlimited handwritten Chinese recognition technology is one of the difficulties, which restricts its popularization in actual commercial use, but its good application prospects Make it a hotspot of current technology research. The traditional handwritten Chinese recognition technology adopts the recognition-based segmentation method to realize the recognition of handwritten Chinese character strings. First, the Chinese character string is divided into several character segments. Usually, a character is divided into one or more character segments, thus forming a A sequence of character fragments, different co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/68
Inventor 朱远平何源孙俊
Owner TIANJIN NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products