Random sample generation method for recognition of complex character

A random sample, text recognition technology, applied in the field of image recognition, can solve problems such as a large number of human annotations

Active Publication Date: 2015-09-09
成都数联铭品科技有限公司
View PDF4 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

By analyzing the reasons for the complexity of the text, a large number of training samples containing various noise and distortion features that can be used by the deep neural network are automatically generated, which solves the problem of requiring a large amount of human labeling when using the deep neural network to recognize text in the prior art , significantly saving labor costs; improving the efficiency of identification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Random sample generation method for recognition of complex character
  • Random sample generation method for recognition of complex character
  • Random sample generation method for recognition of complex character

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention will be further described in detail below in conjunction with test examples and specific embodiments. However, it should not be understood that the scope of the above subject matter of the present invention is limited to the following embodiments, and all technologies realized based on the content of the present invention belong to the scope of the present invention.

[0029] The purpose of the present invention is to overcome the above-mentioned deficiencies in the prior art, and provide a method for generating random samples for complex character recognition. By analyzing the reasons for the complexity of the text, a large number of training samples containing various noise and distortion features that can be used by the deep neural network are automatically generated, which solves the problem of requiring a large amount of human labeling when using the deep neural network to recognize text in the prior art , significantly saving labor costs.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of image recognition, and especially relates to a random sample generation method for the recognition of a complex character. In the recognition of the complex character, a large number of samples, which are generated by a random sample generator and comprise a to-be-recognized image noise model and a distortion characteristic model are employed through the analysis of the reason of the character complexity on the basis of a standard character similar to a to-be-recognized character. A training sample automatically generated by the random sample generator comprises various types of complex noises and distortions, and can meet the recognition demands of various types of complex characters. The above random sample is enabled to serve as a training sample in a depth neural network, thereby solving a problem that a training neural network needs a large amount of manual marking for character recognition, enabling the automatic recognition of complex characters and images to be more simple and easier, and remarkably saving the related labor cost.

Description

technical field [0001] The invention relates to the field of image recognition, in particular to a random sample generation method for complex character recognition. Background technique [0002] Image recognition is of great significance in the field of intelligent recognition. With the advancement of technology and the development of society, the demand for automatic recognition of text in pictures is also increasing rapidly. Traditional Optical Character Recognition (OCR) systems are often used to identify documents scanned using optical devices, such as digitized ancient books, business cards, invoices, forms, etc. Usually this type of scanned document has a relatively high resolution and contrast, and the printed fonts are generally relatively single and regular, making it easier to extract a single text for recognition. Therefore, the core of this type of document recognition is to eliminate noise. There are many ways to eliminate noise: for example, use Gaussian for ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/02
CPCG06N3/02G06V30/333G06V10/758G06V30/10G06F18/10
Inventor 刘世林何宏靖吴雨浓
Owner 成都数联铭品科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products