Text-to-image generation method of adaptive attribute and instance mask embedded graph

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An image generation and self-adaptive technology, applied in 2D image generation, image coding, image data processing, etc., can solve problems such as pixel overlap and occlusion, inability to synthesize high-resolution images, and lack of instance shape mask constraints

Pending Publication Date: 2020-06-26

QUFU NORMAL UNIV

View PDF5 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] However, the input of existing text-image generation methods is mostly sentence vectors, which lack word-level fine-grained information, and the synthesized images lack instance-level texture features.

In addition, in the process of image generation, the generator easily ignores the spatial interaction relationship between different instances, lacks the instance shape mask constraint, and the synthesized image has problems such as unreasonable instance shape, pixel overlap and occlusion; at the same time, sentence-level The discriminator can only provide coarse-grained training feedback information, and it is difficult to distinguish the visual attributes of word-level instances. As a result, the generative model tends to synthesize the "average" mode of the object instead of the most relevant attribute features, and cannot synthesize real and accurate high-resolution images. rate image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0058] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0059] Such as figure 1 As shown, in an embodiment of the present invention, a text-to-image generation method of an adaptive attribute and an instance mask embedding graph is provided, and the method includes the following steps:

[0060] Step S1, according to the input text, use the frame regression network of the preset encoder-decoder structure to obtain the position and label information of the instance bounding box corresponding to each word in the text, and integrate the bounding boxes generated by all instances The location and label information of , get 64×64, 128×128 and 256×256 semantic layout;

[0061] The specific process is as figure 2 As shown, first use the pre-trained Bi-LSTM as a text encoder to encode the text into word vectors and a sentence...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a text-to-image generation method of an adaptive attribute and instance mask embedded graph. The method comprises the following steps: obtaining 64 * 64, 128 * 128 and 256 * 256semantic layouts by using a frame regression network according to an input text; according to the 64 * 64 semantic layout, the sentence embedding vector and the random noise, generating coarse-grained image potential features and a low-resolution 64 * 64 image in a low-resolution generator; forming a pixel-level feature vector in the first high-resolution generator according to the 128 * 128 semantic layout; generating a first fine-grained image potential feature and a high-resolution 128 * 128 image in a first high-resolution generator according to the 128 * 128 semantic layout, the coarse-grained image potential feature and the pixel-level feature vector; and generating a high-resolution 256 * 256 image in a second high-resolution generator according to the 256 * 256 semantic layout, the first fine-grained image potential feature and the pixel-level feature vector. By implementing the method, the resolution of the image is high, the shape constraint of the instance is met, and the attribute characteristics are consistent with the description.

Description

technical field [0001] The invention relates to the technical field of computer vision image generation, in particular to a text-to-image generation method of adaptive attributes and instance mask embedding graphs. Background technique [0002] In recent years, deep learning has achieved good results in the field of text-image generation. Generative confrontation network (GAN), as the most commonly used generation model, jointly learns generators and discriminators; among them, the generator is mainly used to learn pixel distribution and generate realistic images, while the discriminator needs to identify the authenticity of the generated images. Continually confront updates to achieve the final Nash equilibrium. [0003] Generative confrontation network has many input types, such as random noise, semantic segmentation map, sketch map, image pair, scene map, text, etc. Among them, text is the simplest and most operable input form, and more and more researchers It also tend...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06T11/00G06T9/00G06F40/126G06F40/30G06F16/35G06N3/04

CPCG06T11/001G06T9/001G06F16/35G06N3/049G06N3/045

Inventor 倪建成张素素

Owner QUFU NORMAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Text-to-image generation method of adaptive attribute and instance mask embedded graph

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology