Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An image conversion method and system based on a generative adversarial network and a ResNts technology

An image conversion and generative technology, applied in the field of image processing, can solve the problems of inaccurate estimation of discriminator density ratio, failure to find the correlation between source domain and target domain features, and unstable training, so as to reduce image artifacts and enhance Stability, high stability effect

Pending Publication Date: 2019-04-23
EAST CHINA JIAOTONG UNIVERSITY
View PDF3 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to solve this problem, researchers currently implement image conversion through generative confrontation network technology, but this requires paired image data sets for training
Recent research results CycleGAN and DualGAN use unpaired data for training, but these two algorithms cannot mutually learn the feature correlation between the source domain image and the target image
In addition to these difficulties, in the design of the image conversion framework in the present invention, mutual dual GAN ​​will also be used, but only mutual dual GAN ​​cannot find the feature correlation between the source domain and the target domain
[0013] 2) In some current image conversion methods, such as Pix2pixGAN, CycleGAN, etc., most of the generator network structures use U-Net and encoder-decoder, and the traditional residual blocks form the ResNet network structure, and try to extract input feature information. Then, due to the proposed The image conversion algorithm must have strong versatility, so the input image style and objects are different, and the generator cannot extract the local feature information of the input image using the traditional network structure
This prevents conversion of high-res detailed texture images
[0014] 3) At present, there is a discriminator network training stability problem in the training of GAN
In a high-dimensional space, the density ratio estimation made by the discriminator is often inaccurate, and it is very unstable during training. After training such a discriminator, the training of the generator basically stagnates.
So far in academia, the Kullback-Leibler (KL) divergence is used to control the discriminator loss function, but KL speeds up the convergence of the training process, and the discriminator enters the ideal state very early, which prevents the discriminator from feeding back more information to the generator , causing the generator to fail to train

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An image conversion method and system based on a generative adversarial network and a ResNts technology
  • An image conversion method and system based on a generative adversarial network and a ResNts technology
  • An image conversion method and system based on a generative adversarial network and a ResNts technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0063] The traditional ResNets structure is difficult to extract the feature information of the low-dimensional space of the input image, which makes the image conversion based on this structure difficult to generate high-resolution images, and the generated images lack details and realistic textures. In addition, the training speed of the network is slow.

[0064] Traditional image conversion algorithms based on GAN, DCGAN, BlurGAN, etc. often can only achieve a single conversion style, and cannot achieve multiple image conversion tasks.

[0065] In view of the above problems, the application of the present invention will...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of image processing, and discloses an image conversion method and system based on a generative adversarial network and a ResNts technology. The image conversion method comprises the following steps of firstly, designing an enhanced high-resolution dual GAN image conversion algorithm by utilizing a dual learning method combining two GAN confrontation mechanisms and a norm loss function, and realizing image conversion based on unsupervised learning by adopting an unlabeled data set through a model; secondly, realizing the conversion from source distribution to target distribution by introducing constraint conditions for reconstructing a consistency loss function, and then reconstructing the source distribution; and finally, adding a stable normalization layer in the discriminator. Compared with the resolution of images generated by Pix2pixGAN, CycleGAN and DualGAN, the average values of the ERGAN algorithmPSNR / SSIM provided by the invention are improved by 16% / 35%, 2% / 9% and 4% / 6% respectively.

Description

technical field [0001] The invention belongs to the technical field of image processing, and in particular relates to an image conversion method and system based on generative confrontation network and ResNets technology. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: [0003] Image translation has recently gained increasing research attention. Image conversion aims to map an image in the original domain to an image in another domain, that is, to convert a given sample image into a variety of new scenes, such as between different seasons, different climates, and scenes at different times of the day. Many computer vision and image processing tasks, especially image segmentation and image super-resolution, can also be considered as image transformations. Currently, image transformation based on deep learning is mainly applied to data augmentation. Data enhancement technology based on image transformation has...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T3/40G06T5/00G06N3/04G06N3/08
CPCG06N3/088G06T3/4053G06N3/048G06N3/044G06N3/045G06T5/73
Inventor 胡辉崔淼
Owner EAST CHINA JIAOTONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products