Supercharge Your Innovation With Domain-Expert AI Agents!

A method for semantically multimodal image generation

A multi-modal image and semantic technology, applied in the field of computer vision, can solve the problem that the generation effect cannot be controlled independently

Active Publication Date: 2022-03-25
HUAZHONG UNIV OF SCI & TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These methods either generate natural images that are fixed, or can only make global changes to the image, and cannot individually control the generation effect of each category.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for semantically multimodal image generation
  • A method for semantically multimodal image generation
  • A method for semantically multimodal image generation

Examples

Experimental program
Comparison scheme
Effect test

example 2

[0127] Many different applications can be developed on the basis of this method, such as image editing, image mixing and matching, and the like. Figure 6The dressing process is shown. The three images on the upper left are natural images, and the three classes of hair, clothes and pants are selected for operation. The upper right is the input segmentation label, and the second row is the image generated according to the combination of different classes in the first row. This method can combine the different classes in the three images into one image. Figure 7 Demonstrate image editing, which can be achieved by changing the segmentation label to change the generated content of the image.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for generating semantically multimodal images. The present invention proposes a new task on the task of segmenting the label to generate an image, and the new task is to be able to individually control the category of the generated image by dividing the category of the label. On this task, the present invention proposes a solution that uses grouped convolution for the encoder to separate the encoding of different classes in the natural image, so that changing the encoding can change the style of the class in the generated image. In the generator part, the present invention adopts grouped convolution with gradually decreasing number of groups, which can improve the ability of independent control and reduce the occupation of video memory, so this method can be applied to data sets with many categories. At the same time, the present invention can be applied to many image generation applications, such as image editing and the like. Compared with the previous related work of segmenting tags to generate natural images, the present invention can individually control the categories in the generated images as much as possible.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and more specifically relates to a method for generating semantically multi-modal images. Background technique [0002] In recent years, the image quality generated by GAN has been getting higher and higher, and many interesting applications have been produced, such as image editing, image completion, etc. Another application is to generate natural images through semantic segmentation labels of images. This can be seen as the inverse process of image segmentation. Generating natural images by segmenting labels can facilitate the creation of various images. Only need to draw the segmentation map that needs to generate images, and then can generate images that are close to real through the network, simplifying the operation process and enhancing the control over images . [0003] Generating natural images through segmentation labels is a one-to-many problem, that is, one segmentation map ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06V10/776G06V10/764
CPCG06N3/084G06N3/045G06F18/2163G06F18/217G06F18/241
Inventor 白翔朱臻徐志良
Owner HUAZHONG UNIV OF SCI & TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More