Off-line handwritten Chinese character recognition method carrying out data expansion based on deformation method

A technology of data expansion and Chinese character recognition, which is applied in character and pattern recognition, neural learning methods, computer components, etc., can solve problems such as complex, strong, and inapplicable methods, and achieve high classification performance and improved robustness sexual effect

Inactive Publication Date: 2017-02-15
SOUTH CHINA UNIV OF TECH
View PDF1 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above method has high requirements on the order of magnitude of the data set samples, and requires a large amount of data to train the network model, which does not work well for small orders of magnitude samples.
Moreover, the method is complicated. For different data sets, the effect of the model is somewhat different, and it does not have good universality.
Therefore, for the classification of small samples of offline handwritten Chinese characters, there is currently no method with strong recognition ability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Off-line handwritten Chinese character recognition method carrying out data expansion based on deformation method
  • Off-line handwritten Chinese character recognition method carrying out data expansion based on deformation method
  • Off-line handwritten Chinese character recognition method carrying out data expansion based on deformation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The specific implementation of the present invention will be further described below in conjunction with accompanying drawing and example, but the implementation and protection of the present invention are not limited to this, it should be pointed out that if there are no special details below, those skilled in the art can refer to existing technology achieved.

[0026] The convolutional neural network consists of an input layer, a hidden layer, and an output layer, and the hidden layer mainly includes a convolutional layer, a maximization pooling sampling layer, and a fully connected layer.

[0027] (1) Convolutional layer. The convolutional layer is used to extract basic visual features in the visual receptive field, also known as a feature map, and the operation unit is also called a neuron.

[0028] (2) Maximum sampling layer. Because images have a "static" property, it means that features that are useful in one image region are likely to be equally applicable in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an off-line handwritten Chinese character recognition method carrying out data expansion based on a deformation method. The recognition method comprises a step of building a platform on an image processor, wherein the platform is based on a Caffe deep learning framework including a plurality of types of convolution neural network models. Through elastic deformation, shearing and small-angle range rotation, a training data set is expanded, a test data set with tags is prepared, and the training data set is utilized to train the convolution neural network models on the image processor. The training data set is obtained by processing HCL2000 level-1 handwritten Chinese characters. Raw images of handwritten Chinese characters in an HCL2000 database are classified and subjected to elastic deformation, shearing and small-angle range rotation and then input into the convolution neural network models for network training. Finally, unknown Chinese characters are input into the models for testing and recognition results of the Chinese character images are obtained. The recognition method is highly intelligent, accurate in classified recognition, and fast in testing speed. The recognition method is good in recognition performance of low-weight databases and excellent in recognition capability of handwritten Chinese characters.

Description

technical field [0001] The invention belongs to the technical field of image classification, and relates to a method for classifying Chinese characters through data set expansion and based on a convolutional neural network model. Background technique [0002] The problem of offline handwritten Chinese character recognition has always been one of the difficulties in the field of pattern recognition research. Although a lot of related research has been carried out in recent decades, there are still many difficulties to be overcome in the problem of offline handwritten Chinese character recognition. These difficulties mainly come from the following aspects: large-scale Chinese character categories (only the first-level font library of GB2312-80 has 3755 kinds of Chinese characters), intricate Chinese character structure, a large number of similar Chinese characters, and the rotation or rotation of Chinese characters due to different writing habits. deformation. Some non-limit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/68G06N3/08
CPCG06N3/08G06V30/2455
Inventor 宋旭晨杨雯高学王志鑫丁彦方
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products