Self-supervised attribute controllable image generation method based on deep twin network

A twin network and image generation technology, applied in the field of computer vision, can solve the problems of large model parameters, complex network optimization steps, and insufficient category attribute control effect, so as to improve reliability, improve category attribute control effect and image generation quality , good category attribute to control the effect of image generation effect

Pending Publication Date: 2022-05-13
YANGTZE DELTA REGION INST (QUZHOU) UNIV OF ELECTRONIC SCI & TECH OF CHINA
View PDF1 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The invention is a self-supervised attribute-controllable image generation method based on twin networks, which mainly solves the problem of insufficient category attribute control effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-supervised attribute controllable image generation method based on deep twin network
  • Self-supervised attribute controllable image generation method based on deep twin network
  • Self-supervised attribute controllable image generation method based on deep twin network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] Step 1: Preprocessing the experimental data;

[0085] The present invention selects the CIFAR-10 data set as the experimental data, which is obtained from the official release channel of the data set. The CIFAR-10 dataset consists of 60,000 true-color RGB images, of which 50,000 are training data and the remaining 10,000 are testing data. The CIFAR-10 dataset contains a total of 10 category attributes, and each category attribute has the same number of images. In addition, each image in the dataset is 3×32×32 in size. In order to apply RGB image data to the training of the deep learning model, convert all image data into Tensor form.

[0086] Step 2: Carry out image transformation operation;

[0087] The experimental image data will be applied to the training of the generative confrontation network and the training of the twin encoder network for decoupling attribute separation learning. For the training of the generative confrontation network, the Tensor format data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a self-supervised attribute-controllable image generation method based on a deep twin network, and belongs to the field of computer vision image generation. The method is based on a twin network idea, takes cosine similarity as a distance measurement method, restrains the similarity between two enhanced samples of the same image, and further stabilizes a network training process by using a gradient stopping strategy, thereby providing differentiated representation of category attributes in real data for a twin encoder network, and improving the accuracy of the twin encoder network. And the learned category representation is used for an image generation process, a generator is induced to realize category attribute control, meanwhile, common mean square error loss in other methods is deleted, and the model optimization difficulty is reduced. According to the self-supervised attribute control scheme based on the deep twin network, the class attribute control generation level of the generative adversarial network can be remarkably improved, the image generation quality of an existing method is improved, and a more excellent effect is achieved through fewer constraints and shorter training time.

Description

technical field [0001] The invention belongs to the field of computer vision, mainly relates to the task of controllable generation of category attributes of images, and is mainly used in cultural entertainment industry, industrial image data expansion, machine vision understanding and the like. Background technique [0002] Image generation technology is a technique that solves the maximum likelihood estimation of image data distribution by establishing a mathematical model, and samples from the estimated distribution to generate new images that are similar to the original data but do not exist in the original data. In recent years, with the rapid development of deep learning, a number of image generation technologies based on deep neural networks have emerged, and have achieved image generation effects comparable to real images. According to whether to directly calculate the maximum likelihood estimation of the real image distribution, it can be divided into two categories...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T11/00G06V10/762G06V10/774G06V10/82G06K9/62G06N3/04
CPCG06T11/00G06N3/045G06F18/23G06F18/214
Inventor 陈志勇
Owner YANGTZE DELTA REGION INST (QUZHOU) UNIV OF ELECTRONIC SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products