Unlock instant, AI-driven research and patent intelligence for your innovation.

A training method to ensure the smooth convergence of the maximum and minimum loss functions of the gan model

A loss function, maximum and minimum technology, applied in the field of deep learning, can solve problems such as explosion, generator and opponent game imbalance, gradient increase of generator loss function, etc., to achieve the effect of improving the training effect

Active Publication Date: 2022-07-29
江苏艾佳家居用品有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These two steps are generally carried out alternately until the data distribution generated by the generator approaches the real data distribution, and the adversarial device cannot distinguish the output of the generator from the real data.
However, in the actual operation process, this method of blindly alternating training regardless of the convergence of the generator and the adversarial device will cause the game imbalance between the generator and the adversarial device
The loss function of the generator includes the calculation of the output of the generator by the adversarial device. When the adversarial device converges quickly, if the convergence speed of the generator cannot keep up, there is a high probability that the gradient of the loss function of the generator will continue to increase until the end Gradient explosion is not derivable in the field of numerical calculation, so that the optimizer cannot update the generator parameters, and finally the GAN model training fails

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A training method to ensure the smooth convergence of the maximum and minimum loss functions of the gan model
  • A training method to ensure the smooth convergence of the maximum and minimum loss functions of the gan model
  • A training method to ensure the smooth convergence of the maximum and minimum loss functions of the gan model

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0069] In the GAN model (generative adversarial network model), there are two models that play with each other, one is the generator (generator) and the other is the discriminator (discriminator). In short, the generator is the function G(.), and the adversary is the function D(.).

[0070] In the specific implementation, we use the tensorflow machine learning platform for algorithm development.

[0071] In order to easily reproduce the method, we use the open source MNIST data as an example. Note that the use of DCGAN to generate MNIST data pictures is not a feature of the present invention, and only the key steps are posted here mainly for the convenience of reproduction.

[0072] 1. Prepare the data

[0073] Download the MNIST dataset from the internet with the following command:

[0074] (train_images, train_labels), (_, _) = tf.keras.datasets.mnist.load_data()

[0075] Among them, the variable train_images stores the MNIST image information, and the variable train_lab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a training method for ensuring the stable convergence of the maximum and minimum loss functions of a GAN model, and relates to the field of deep learning of the GAN model training method. The method solves the above problem by reasonably setting the parameter update conditions and frequency of the generator and the antagonist. The game imbalance problem between the generator and the antagonist during the training of the GAN model. The so-called game imbalance problem refers to the rapid convergence of one of the generator and the antagonist during the training process, making the optimization surface of the other party almost unsteerable, so it cannot be smoothly training phenomenon.

Description

technical field [0001] The invention relates to the field of deep learning of a GAN model training method, in particular to a training method for ensuring the stable convergence of the maximum and minimum loss functions of the GAN model. Background technique [0002] GAN models are widely used in image generation, speech generation, text generation and other fields. The training of the GAN model generally includes two steps, one is to update the parameters of the generator according to the gradient of the loss function of the generator, and the other is to update the parameters of the adversary according to the gradient of the loss function of the adversary. These two steps are generally performed alternately until the data distribution generated by the generator is close to the real data distribution, and the adversary cannot distinguish the output of the generator from the real data. However, in actual operation, this method of training alternately regardless of the conve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04G06N3/08G06V10/774G06V10/776G06K9/62
CPCG06N3/08G06N3/045G06F18/2193G06F18/214
Inventor 陈旋吕成云林善冬
Owner 江苏艾佳家居用品有限公司