Semi-supervised multi-modal multi-class image translation method

A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computer components, etc.

Active Publication Date: 2019-09-20
BEIFANG UNIV OF NATITIES
View PDF9 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different image domains, and in many scenarios, cross-domain image translation is multimodal, so , the existing cross-domain data translation cannot meet these needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised multi-modal multi-class image translation method
  • Semi-supervised multi-modal multi-class image translation method
  • Semi-supervised multi-modal multi-class image translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with specific examples.

[0042] The semi-supervised multi-modal and multi-category image translation method provided in this embodiment realizes multi-modal and multi-category image translation. Such as figure 1 Shown, shows our overall network framework, first, the sample image x 1 and x 2 and a small number of tags are fed into the encoder. Then, using decoupled representation learning from the style encoder and content encoder, the image is decoupled into style encoding and content encoding respectively, and finally, image reconstruction and multimodal transformation are realized by concatenating style encoding and content encoding. It includes the following steps:

[0043] 1) Input two images x from different domains 1 and x 2 and with a small number of labels, images x from different domains 1 and x 2 , which refers to the difference in content and style between the two input images. Sem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a semi-supervised multi-modal multi-category image translation method. The method comprises the following steps of S1, inputting two images from different domains and a small number of labels; S2, inputting the input images and labels into the encoders, dividing the encoders into a content encoder and a style encoder, and respectively decoupling the images from the style encoder and the content encoder by using the decoupling representation learning to obtain the style codes and the content codes; S3, inputting the style codes into an adversarial auto-encoder to complete the image multi-class training; inputting the content codes into a content adversarial learning network to complete the image multi-mode transformation training; and S4, realizing the image reconstruction and multi-mode conversion through splicing the style codes and the content codes. According to the method, the problem caused by the requirement for image translation diversity is solved, and the multi-modal and multi-class cross-domain images can be generated by jointly decoding the potential content codes and the style codes.

Description

technical field [0001] The present invention relates to the technical fields of computer vision, computer graphics and machine learning, in particular to a semi-supervised, multi-modal and multi-category image translation method. Background technique [0002] With the continuous development of deep learning technology and image generation technology, a large number of excellent works have emerged in the field of image translation. Semi-supervised multi-modal and multi-category image translation is an important and challenging research problem in the field of computer vision. Semi-supervised multi-modal Stateful multi-category image translation has obvious application value in industrial design and other fields, and can be applied to various aspects such as image colorization, super-resolution generation, and style transfer. At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/34
CPCG06V10/267G06F18/2155G06F18/24G06F18/214
Inventor 白静陈冉李赛赛姬卉
Owner BEIFANG UNIV OF NATITIES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products