Myancanda OCR method based on knowledge distillation

A technology of knowledge and distillation loss, applied in neural learning methods, character recognition, character and pattern recognition, etc., can solve problems such as enhancement and difficult extraction and recognition of complex characters, and achieve the effect of optimizing learning parameters and model weights

Active Publication Date: 2021-01-29
KUNMING UNIV OF SCI & TECH
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The invention provides a Burmese OCR method based on knowledge distillation, which solves the problem that complex characters in a Burmese image that are nested and combined with multiple characters in a receptive field are difficult to extract and recognize; the invention constructs a teacher network using a CNN+RNN framework and The model architecture in which the student network is trained in

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Myancanda OCR method based on knowledge distillation
  • Myancanda OCR method based on knowledge distillation
  • Myancanda OCR method based on knowledge distillation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0073] Embodiment 1: as Figure 1-2 As shown, the Burmese OCR method based on knowledge distillation, the method includes:

[0074] Step1. Construct a Burmese image dataset that meets the task requirements, then add noise to the generated image, and finally mark the corresponding label code of the image regularly;

[0075] Step2. Construct the model architecture of student network and teacher network using deep convolutional neural network and cyclic neural network framework;

[0076] Step3. Set the input of the student network and the teacher network. Based on the knowledge distillation method, the teacher network and the student network are jointly trained in an integrated learning manner, and the sub-network integrated by the teacher is coupled with the student network to realize a single receptive field in the student network. Align the corresponding local character image features with the overall character image features in the teacher network to enhance the acquisition ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Myanmao OCR method based on knowledge distillation. The method comprises the following steps: constructing a Myanmao image data set adapting to task requirements; adding noise to a generated image, and carrying out regularity labeling on a label code corresponding to the image; constructing a model architecture of a student network and a teacher network using a deep convolutional neural network and a recurrent neural network framework; based on a knowledge distillation method, training a teacher network and a student network in an ensemble learning mode, and by coupling of a teacher integrated sub-network and the student network, aligning local character image features corresponding to a single receptive field in the student network and overall character image features in the teacher network. Therefore, the acquisition of the local features in the long-sequence character image is enhanced, and the student network can efficiently identify the Myanhuang language complex scene text image by calling the deployment model on the server. The recognition of the Myanmai image text is realized, the recognition accuracy is high, and the effect is good.

Description

technical field [0001] The invention relates to a Burmese OCR method based on knowledge distillation, belonging to the technical field of natural language processing. Background technique [0002] Burmese text has Zawgyi-One, Myanmar Three and other font encodings. In order to avoid the problem of garbled Chinese and Burmese text content on the Internet, most Burmese text content is presented in the form of pictures. This has brought great difficulties to the research on Burmese-oriented natural language processing, machine translation, and information retrieval. Although the method combined with deep learning has achieved very impressive results in Chinese and English image text recognition tasks, due to the particularity of Burmese characters, as far as I know, there is no relevant research on Burmese OCR research. Burmese OCR research has very important theoretical and practical application value. Burmese language is different from Chinese or English. In a receptive fie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20G06K9/62G06F40/126G06N3/04G06N3/08
CPCG06F40/126G06N3/049G06N3/08G06V10/22G06V30/10G06N3/045G06F18/214
Inventor 毛存礼谢旭阳余正涛高盛祥王振晗刘福浩
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products