Semi-supervised multi-modal multi-class image translation method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computer components, etc.

Active Publication Date: 2019-09-20

BEIFANG UNIV OF NATITIES

View PDF9 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different image domains, and in many scenarios, cross-domain image translation is multimodal, so , the existing cross-domain data translation cannot meet these needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with specific examples.

[0042] The semi-supervised multi-modal and multi-category image translation method provided in this embodiment realizes multi-modal and multi-category image translation. Such as figure 1 Shown, shows our overall network framework, first, the sample image x 1 and x 2 and a small number of tags are fed into the encoder. Then, using decoupled representation learning from the style encoder and content encoder, the image is decoupled into style encoding and content encoding respectively, and finally, image reconstruction and multimodal transformation are realized by concatenating style encoding and content encoding. It includes the following steps:

[0043] 1) Input two images x from different domains 1 and x 2 and with a small number of labels, images x from different domains 1 and x 2 , which refers to the difference in content and style between the two input images. Sem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a semi-supervised multi-modal multi-category image translation method. The method comprises the following steps of S1, inputting two images from different domains and a small number of labels; S2, inputting the input images and labels into the encoders, dividing the encoders into a content encoder and a style encoder, and respectively decoupling the images from the style encoder and the content encoder by using the decoupling representation learning to obtain the style codes and the content codes; S3, inputting the style codes into an adversarial auto-encoder to complete the image multi-class training; inputting the content codes into a content adversarial learning network to complete the image multi-mode transformation training; and S4, realizing the image reconstruction and multi-mode conversion through splicing the style codes and the content codes. According to the method, the problem caused by the requirement for image translation diversity is solved, and the multi-modal and multi-class cross-domain images can be generated by jointly decoding the potential content codes and the style codes.

Description

technical field [0001] The present invention relates to the technical fields of computer vision, computer graphics and machine learning, in particular to a semi-supervised, multi-modal and multi-category image translation method. Background technique [0002] With the continuous development of deep learning technology and image generation technology, a large number of excellent works have emerged in the field of image translation. Semi-supervised multi-modal and multi-category image translation is an important and challenging research problem in the field of computer vision. Semi-supervised multi-modal Stateful multi-category image translation has obvious application value in industrial design and other fields, and can be applied to various aspects such as image colorization, super-resolution generation, and style transfer. At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06K9/62G06K9/34

CPCG06V10/267G06F18/2155G06F18/24G06F18/214

Inventor白静陈冉李赛赛姬卉

OwnerBEIFANG UNIV OF NATITIES

Semi-supervised multi-modal multi-class image translation method

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computer components, etc.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computer components, etc.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology