Text-to-image generation method of adaptive attribute and instance mask embedded graph

An image generation and self-adaptive technology, applied in 2D image generation, image coding, image data processing, etc., can solve problems such as pixel overlap and occlusion, inability to synthesize high-resolution images, and lack of instance shape mask constraints

Pending Publication Date: 2020-06-26
QUFU NORMAL UNIV
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the input of existing text-image generation methods is mostly sentence vectors, which lack word-level fine-grained information, and the synthesized images lack instance-level texture features.
In addition, in the process of image generation, the generator easily ignores the spatial interaction relationship between different instances, lacks the instance shape mask constraint, and the synthesized image has problems such as unreasonable instance shape, pixel overlap and occlusion; at the same time, sentence-level The discriminator can only provide coarse-grained training feedback information, and it is difficult to distinguish the visual attributes of word-level instances. As a result, the generative model tends to synthesize the "average" mode of the object instead of the most relevant attribute features, and cannot synthesize real and accurate high-resolution images. rate image

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text-to-image generation method of adaptive attribute and instance mask embedded graph
  • Text-to-image generation method of adaptive attribute and instance mask embedded graph
  • Text-to-image generation method of adaptive attribute and instance mask embedded graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0059] Such as figure 1 As shown, in an embodiment of the present invention, a text-to-image generation method of an adaptive attribute and an instance mask embedding graph is provided, and the method includes the following steps:

[0060] Step S1, according to the input text, use the frame regression network of the preset encoder-decoder structure to obtain the position and label information of the instance bounding box corresponding to each word in the text, and integrate the bounding boxes generated by all instances The location and label information of , get 64×64, 128×128 and 256×256 semantic layout;

[0061] The specific process is as figure 2 As shown, first use the pre-trained Bi-LSTM as a text encoder to encode the text into word vectors and a sentence...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text-to-image generation method of an adaptive attribute and instance mask embedded graph. The method comprises the following steps: obtaining 64 * 64, 128 * 128 and 256 * 256semantic layouts by using a frame regression network according to an input text; according to the 64 * 64 semantic layout, the sentence embedding vector and the random noise, generating coarse-grained image potential features and a low-resolution 64 * 64 image in a low-resolution generator; forming a pixel-level feature vector in the first high-resolution generator according to the 128 * 128 semantic layout; generating a first fine-grained image potential feature and a high-resolution 128 * 128 image in a first high-resolution generator according to the 128 * 128 semantic layout, the coarse-grained image potential feature and the pixel-level feature vector; and generating a high-resolution 256 * 256 image in a second high-resolution generator according to the 256 * 256 semantic layout, the first fine-grained image potential feature and the pixel-level feature vector. By implementing the method, the resolution of the image is high, the shape constraint of the instance is met, and the attribute characteristics are consistent with the description.

Description

technical field [0001] The invention relates to the technical field of computer vision image generation, in particular to a text-to-image generation method of adaptive attributes and instance mask embedding graphs. Background technique [0002] In recent years, deep learning has achieved good results in the field of text-image generation. Generative confrontation network (GAN), as the most commonly used generation model, jointly learns generators and discriminators; among them, the generator is mainly used to learn pixel distribution and generate realistic images, while the discriminator needs to identify the authenticity of the generated images. Continually confront updates to achieve the final Nash equilibrium. [0003] Generative confrontation network has many input types, such as random noise, semantic segmentation map, sketch map, image pair, scene map, text, etc. Among them, text is the simplest and most operable input form, and more and more researchers It also tend...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T11/00G06T9/00G06F40/126G06F40/30G06F16/35G06N3/04
CPCG06T11/001G06T9/001G06F16/35G06N3/049G06N3/045
Inventor 倪建成张素素
Owner QUFU NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products