A semi-supervised multi-modal multi-category image translation method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computing, etc., to achieve good quality results

Active Publication Date: 2021-11-02

BEIFANG UNIV OF NATITIES

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different image domains, and in many scenarios, cross-domain image translation is multimodal, so , the existing cross-domain data translation cannot meet these needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with specific examples.

[0042] The semi-supervised multi-modal and multi-category image translation method provided in this embodiment realizes multi-modal and multi-category image translation. Such as figure 1 Shown, shows our overall network framework, first, the sample image x 1 and x 2 and a small number of labels are fed into the encoder. Then, using decoupled representation learning from the style encoder and content encoder, the image is decoupled into style encoding and content encoding respectively, and finally, image reconstruction and multimodal transformation are realized by concatenating style encoding and content encoding. It includes the following steps:

[0043] 1) Input two images x from different domains 1 and x 2 and with a small number of labels, images x from different domains 1 and x 2 , which refers to the difference in content and style between the two input images. S...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a semi-supervised multi-modal multi-category image translation method, comprising steps: S1, input two images from different domains and a small number of labels; S2, send the input images and labels to an encoder, and encode The encoder is divided into a content encoder and a style encoder, using decoupling representation learning to decouple the image from the style encoder and the content encoder to obtain style encoding and content encoding respectively; S3, input the style encoding into the adversarial autoencoder , to complete image multi-category training; input the content code into the content confrontation learning network to complete image multi-modal transformation training; S4, realize image reconstruction and multi-modal transformation by splicing style coding and content coding. The invention solves the dilemma caused by the requirement of image translation diversity, and can generate multi-modal and multi-category cross-domain images through joint decoding of latent content coding and style coding.

Description

technical field [0001] The present invention relates to the technical fields of computer vision, computer graphics and machine learning, in particular to a semi-supervised, multi-modal and multi-category image translation method. Background technique [0002] With the continuous development of deep learning technology and image generation technology, a large number of excellent works have emerged in the field of image translation. Semi-supervised multi-modal and multi-category image translation is an important and challenging research problem in the field of computer vision. Semi-supervised multi-modal Stateful multi-category image translation has obvious application value in industrial design and other fields, and can be applied to various aspects such as image colorization, super-resolution generation, and style transfer. At present, the existing image translation is to transform the problem into a one-to-one image mapping, which needs to be clearly given two different ima...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G06K9/62G06K9/34

CPCG06V10/267G06F18/2155G06F18/24G06F18/214

Inventor白静陈冉李赛赛姬卉

OwnerBEIFANG UNIV OF NATITIES

A semi-supervised multi-modal multi-category image translation method

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computing, etc., to achieve good quality results

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal, multi-category technology, applied in character and pattern recognition, instruments, computing, etc., to achieve good quality results

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology