Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sequence domain adaptation method based on adversarial learning in scene text recognition

A text recognition and adaptive method technology, applied in the direction of neural learning methods, character recognition, character and pattern recognition, etc., can solve the problem of sequence recognition tasks being powerless

Active Publication Date: 2020-05-08
FUDAN UNIV
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But powerless for sequence recognition tasks like text recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sequence domain adaptation method based on adversarial learning in scene text recognition
  • Sequence domain adaptation method based on adversarial learning in scene text recognition
  • Sequence domain adaptation method based on adversarial learning in scene text recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] Below, the method of the present invention is further introduced through specific examples, and performance testing and analysis are carried out.

[0060] The sequence domain adaptation method provided by the present invention is a supervised learning method based on end-to-end training of a deep neural network, which needs to prepare source domain data and target domain data for training in advance.

[0061] The concrete steps of the inventive method are as follows:

[0062] Step 1: Scale the original image files of the source domain and the target domain to obtain a fixed-size image. And perform data preprocessing on the image (normalize the data, subtract the mean and divide the standard deviation), the data in the target domain needs to be divided into a training set and a test set, and all the data in the source domain are used as the training set.

[0063] Step 2, shuffling all the samples in the training set, and selecting a batch of images from the source domai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of artificial intelligence, and particularly relates to a domain adaptation method based on a machine vision scene text recognition task. The method comprises the following steps: constructing a CNN-LSTM network and an attention network; combining the CNN-LSTM network and the attention network into a scene text recognition network; inputting the scene images from the source domain and the target domain into a scene text recognition network, extracting image features from the input scene images by CNN-LSTM, recoding the image features by an attentionnetwork, extracting corresponding features of each character, and segmenting text information in the images into character level information; and finally, constructing a domain classification networkby applying a transfer learning technology based on adversarial learning, forming an adversarial generation network together with the scene text recognition network, and finally enabling the model toeffectively adapt to a target domain. According to the method, a small number of target domain calibration samples are fully utilized, the problem of sample scarcity frequently occurring in an actualscene text recognition task is solved, and the recognition effect is improved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to a sequence domain adaptation method based on adversarial learning in scene text recognition. Background technique [0002] With the rapid development of the field of computer vision, the application of scene text recognition covers all aspects of life. However, a key factor for deep learning to ensure the effect is the need to provide a large number of calibration samples, but in practice, only a small number of calibration samples are often provided. A common solution is to use existing samples of related scenes with a large number of samples to participate in training, but due to the differences between scenes, the effect is often unsatisfactory. The existing methods of domain adaptation have a common problem, that is, they are all aimed at classification tasks in computer vision tasks, and reduce the difference between the source domain and the tar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/32G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V20/63G06V30/10G06N3/045G06F18/241G06F18/214Y02T10/40
Inventor 周水庚林景煌程战战
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products