Text recognition model training method, text recognition method, device and equipment

A text recognition and model training technology, applied in the field of image recognition, can solve the problems of weak model feature extraction, long-term and short-term dependence, and insufficient effective coding, so as to solve the problems of effective coding and long-term and short-term dependence, and improve the generalization ability of recognition and robustness, the effect of improving feature extraction capabilities

Active Publication Date: 2020-07-10
SUNING CLOUD COMPUTING CO LTD
View PDF2 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005]Based on traditional image processing methods, it is often necessary to manually design some features, and some rules are interspersed in the middle to correct the improper processing of the algorithm, and the image background is complex , There is a lot of interference, and the characters are seriously bonded, the effect of the traditional method is not very good
Although the CRNN-based method uses deep learning to reduce manual intervention and improve the accuracy of text recognition and the stability of the algorithm, the original CRNN model uses the traditional VGG network for convolution to extract feature sequences and uses two-way LSTM. CTC is used to deal with the problem of converting variable length sequences into text, so for more complex and changeable text recognition tasks, there will be problems of weak model feature extraction, insufficient effective coding, and long-term and short-term dependencies, which will affect the recognition results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition model training method, text recognition method, device and equipment
  • Text recognition model training method, text recognition method, device and equipment
  • Text recognition model training method, text recognition method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0082] The embodiment of the present invention provides a kind of text recognition model training method, and its executor can be server, and server can adopt independent server or server cluster, such as figure 1 As shown, the method may include:

[0083] Step 101, acquire an image sample set, wherein the image samples in the image sample set include text images and text labels associated with the text images.

[0084] In this embodiment, when preparing an image sample, the ratio of the area of ​​the text area in the text image to the area of ​​the entire text image needs to exceed a preset ratio, for example, the preset ratio is set to 80%, and at the same time prepare a text image with the same name and a different suffix The annotation file stores the content of the field to be recognized in the text image, that is, the text label associated with the text image.

[0085] Step 102, performing sample expansion on the image sample set, and dividing the sample-expanded image ...

Embodiment 2

[0154] Based on the text recognition model trained in the first embodiment above, the embodiment of the present invention also provides a text recognition method, such as Figure 7 As shown, the method may include:

[0155] Step 701, preprocessing the input text image to be recognized.

[0156] Step 702, input the preprocessed text image to be recognized into the pre-trained text recognition model for text recognition, and output the text recognition result of the text image to be recognized;

[0157] Wherein, the pre-trained text recognition model is trained based on the method in the first embodiment.

[0158] Specifically, the text images in the test set are preprocessed, and the preprocessing here does not need to enhance the text images, but only sets the image size to 64*256, and normalizes the images, and converts the pixel values Scale to (-1, 1). Input the preprocessed text image into the text recognition model after iterative training, including initializing the n...

Embodiment 3

[0166] Based on the first embodiment above, the embodiment of the present invention provides a text recognition model training device, such as Figure 8 As shown, the device includes:

[0167] The sample acquisition module 81 is used to acquire an image sample set, and the image samples in the image sample set include text images and text labels associated with the text images;

[0168] A sample expansion module 82, configured to perform sample expansion on the image sample set;

[0169] The sample division module 83 is used to divide the image sample set after the sample expansion into a training set, a verification set and a test set;

[0170] The iterative training module 84 is used to iteratively train the text recognition model according to the training set and the verification set, wherein the text recognition model replaces the original VGG network in the CRNN network model with the SE-ResNet network, and is combined with the BiLSTM network layer and The attention mec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text recognition model training method, a text recognition method, device and equipment, and belongs to the technical field of image recognition, and the text recognition model training method comprises the steps: obtaining an image sample set, an image sample in the image sample set comprising a text image and a text label associated with the text image; performing sample expansion on the image sample set, and dividing the image sample set after sample expansion into a training set, a verification set and a test set; performing iterative training on a text recognition model according to the training set and the verification set, the text recognition model being constructed by replacing an original VGG network in a CRNN network model with an SE-ResNet network andsequentially cascading the SE-ResNet network with a BiLSTM network layer and an attention mechanism layer; and performing performance test on the text recognition model after iterative training according to the test set. According to the embodiment of the invention, the feature extraction capability of the text recognition model can be improved, and the feature vector decoding effect is improved,so that the text recognition accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of image recognition, in particular to a text recognition model training method, text recognition method, device and equipment. Background technique [0002] The text recognition task requires certain image processing to recognize the text content in the image. Text recognition can be applied to many fields, such as sorting of letters and parcels, editing and proofreading of manuscripts, summary and analysis of a large number of statistical reports and cards, processing of bank checks, statistical summary of commodity invoices, identification of commodity codes, commodity warehouses management, as well as document retrieval, identification of various documents and office automation of financial bill processing, etc. It is convenient for users to quickly enter information and improve work efficiency in all walks of life. [0003] At present, there are two categories of text recognition methods. One is based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34G06N3/04G06N3/08
CPCG06N3/08G06V10/26G06N3/045
Inventor 金宏运杨现陈浩
Owner SUNING CLOUD COMPUTING CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products