A semi-supervised multi-modal multi-category image translation method

A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computing, etc., to achieve good quality results

Active Publication Date: 2021-11-02
BEIFANG UNIV OF NATITIES
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different image domains, and in many scenarios, cross-domain image translation is multimodal, so , the existing cross-domain data translation cannot meet these needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A semi-supervised multi-modal multi-category image translation method
  • A semi-supervised multi-modal multi-category image translation method
  • A semi-supervised multi-modal multi-category image translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with specific examples.

[0042] The semi-supervised multi-modal and multi-category image translation method provided in this embodiment realizes multi-modal and multi-category image translation. Such as figure 1 Shown, shows our overall network framework, first, the sample image x 1 and x 2 and a small number of labels are fed into the encoder. Then, using decoupled representation learning from the style encoder and content encoder, the image is decoupled into style encoding and content encoding respectively, and finally, image reconstruction and multimodal transformation are realized by concatenating style encoding and content encoding. It includes the following steps:

[0043] 1) Input two images x from different domains 1 and x 2 and with a small number of labels, images x from different domains 1 and x 2 , which refers to the difference in content and style between the two input images. S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a semi-supervised multi-modal multi-category image translation method, comprising steps: S1, input two images from different domains and a small number of labels; S2, send the input images and labels to an encoder, and encode The encoder is divided into a content encoder and a style encoder, using decoupling representation learning to decouple the image from the style encoder and the content encoder to obtain style encoding and content encoding respectively; S3, input the style encoding into the adversarial autoencoder , to complete image multi-category training; input the content code into the content confrontation learning network to complete image multi-modal transformation training; S4, realize image reconstruction and multi-modal transformation by splicing style coding and content coding. The invention solves the dilemma caused by the requirement of image translation diversity, and can generate multi-modal and multi-category cross-domain images through joint decoding of latent content coding and style coding.

Description

technical field [0001] The present invention relates to the technical fields of computer vision, computer graphics and machine learning, in particular to a semi-supervised, multi-modal and multi-category image translation method. Background technique [0002] With the continuous development of deep learning technology and image generation technology, a large number of excellent works have emerged in the field of image translation. Semi-supervised multi-modal and multi-category image translation is an important and challenging research problem in the field of computer vision. Semi-supervised multi-modal Stateful multi-category image translation has obvious application value in industrial design and other fields, and can be applied to various aspects such as image colorization, super-resolution generation, and style transfer. At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06K9/34
CPCG06V10/267G06F18/2155G06F18/24G06F18/214
Inventor 白静陈冉李赛赛姬卉
Owner BEIFANG UNIV OF NATITIES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products