Hamming OCR method based on locality sensitive Hash network

A technology of local sensitive hashing and recognition methods, applied in the field of text recognition, can solve problems such as difficult deployment of models, bulky models, consumption of storage resources and computing space, etc.

Pending Publication Date: 2021-05-11
FOSHAN NANHAI GUANGDONG TECH UNIV CNC EQUIP COOP INNOVATION INST +1
View PDF1 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although scene text recognition has achieved certain breakthroughs and produced many representative models, most of these models are cumbersome.
The general scene text recognition model uses one-hot encoding to encode each character in the final regression classification. When the vocabulary is large, the mapping matrix generated by the character embedding layer and regression classification layer of the model will occupy a large amount of storage capacity. These models are difficult to deploy on mobile devices because it consumes a lot of storage resources and computing space

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hamming OCR method based on locality sensitive Hash network
  • Hamming OCR method based on locality sensitive Hash network
  • Hamming OCR method based on locality sensitive Hash network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Below, in conjunction with accompanying drawing and specific embodiment, the present invention is described further:

[0037] A Hamming OCR (Optical Character Recognition, Optical Character Recognition) recognition method based on local sensitive hash network. In the embedding layer (embedding, embedding) of the Hamming OCR main model, and initialize the Hamming OCR main model. Then input the text image into the main model based on the attention mechanism. The text image first obtains the character feature vector used for text feature representation through the feature encoder; at the same time, the local sensitivity hash of the character corresponding to the last output result of the main model The code is input to the Hamming Embedding layer (Hamming Embedding, Hamming Embedding), which greatly reduces the storage and calculation space of the embedding layer in the main Hamming OCR model; then, sequence modeling is performed through three Transformer decoders that rem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Hamming OCR method based on a locality sensitive Hash network. The method comprises the following steps of: S1, acquiring a scene text image with OCR for making a training set and a test set; s2, constructing an auxiliary model, optimizing the auxiliary model by using the text image of the training set, training the model, and generating locality sensitive hash codes of all characters; s3, constructing a Hamming OCR master model, and outputting the locality sensitive hash code as an embedded layer of the Hamming OCR master model to obtain an initialized Hamming OCR master model; s4, optimizing parameters of the initialized Hamming OCR main model by using the text image of the training set to obtain an optimal Hamming OCR network model; and S5, inputting a to-be-recognized text image into the optimal Hamming OCR network model, generating an output code by the optimal Hamming OCR network model, and obtaining a text recognition result through the output code. According to the method, the model storage occupation capacity is reduced, and the model stability and the operation efficiency are improved.

Description

technical field [0001] The invention relates to the field of text recognition, in particular to a Hamming OCR recognition method based on a locally sensitive hash network. Background technique [0002] Scene text recognition aims to extract text content from images, which has great commercial value in practical applications. Scene text recognition usually includes scene text positioning and text recognition. For example, in Chinese text recognition, there are already a large number of mature detection and recognition methods at home and abroad, and they have been applied to practical work. These methods not only detect quickly, but also have a good recognition rate, which has certain advantages. Although scene text recognition has achieved certain breakthroughs and produced many representative models, most of these models are cumbersome. The general scene text recognition model uses one-hot encoding to encode each character in the final regression classification. When the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V20/62G06V30/10G06N3/047G06F18/241G06F18/2415
Inventor 黄坤山葛任贤杨航
Owner FOSHAN NANHAI GUANGDONG TECH UNIV CNC EQUIP COOP INNOVATION INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products