A text generation image method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for generating images and texts, applied in the field of image generation methods and devices based on text descriptions, to achieve the effects of complete content, reduced training difficulty, and reasonable logic

Active Publication Date: 2019-03-29

南京德磐信息科技有限公司

View PDF7 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] The technical problem to be solved by the present invention is to aim at the deficiencies in the above-mentioned prior art, and provide a text description based on conditional generation confrontation network and cyclic neural network A method for generating images, which can solve the problem of image generation in the case of multiple entities and complex relationships in the text description

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034] The present invention will be further illustrated below in conjunction with the accompanying drawings and specific embodiments, and it should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. After reading the present invention, modifications to various equivalent forms of the present invention by those skilled in the art fall within the scope defined by the appended claims of the present application.

[0035] A text-to-image method based on conditional generative adversarial networks and recurrent neural networks, such as figure 1 shown, including the following steps:

[0036] Step 1, build a text encoder, input a natural language text sequence, and output an embedded representation of the text. The natural language text sequence is a word sequence p = (w 1 ,w 2 ,...,w d ), where each word is represented by a pre-trained word vector.

[0037] For example: input the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a text generation image method and device. The text generation image method comprises the following steps: step 1, coding a natural language text describing the image to obtaina text semantic embedded representation; 2, mixing the text semantic embedding representation obtained in the step 1 with random noise, reading the text semantic embedding representation by a cyclicneural network transcoder, outputting the object hidden coding of each step by the random noise and the hidden state of the cyclic neural network transcoder; 3, decoding the hidden coding of each stepobject output from the step 2 to generate step images, and finally fusing all the step images to obtain the generated images; 4, carrying out confrontation training on that generate image and the real image. The generator of the invention decodes and generates the foreground and background pixel set of an image according to the object hidden coding through multi-step transcoding, and fuses the foreground and background pixel set of the image to generate a high-quality image, thereby reducing the training difficulty of directly generating the image.

Description

technical field [0001] The invention relates to the technical field of deep learning generation models, in particular to an image generation method and device based on text description. Background technique [0002] Generating photorealistic images from natural language text descriptions is an important problem and has wide applications, such as photo editing, computer-aided design, etc. [0003] There are many ways to learn generative models across image and text modalities. One of these lines of research is learning generative models for text conditioned on images, called "image captioning". The current mainstream processing flow of this type of model is to first extract image features and encode them with an encoder, and then use a decoder to generate unstructured text. [0004] Recently, Generative Adversarial Network (GAN) has achieved good results in generating realistic images. Its variant conditional-GAN (referred to as cGAN) can generate images related to the mea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/22G06T9/00

CPCG06T9/002G06F40/126

Inventor 周德宇胡名起蒋明敏

Owner 南京德磐信息科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A text generation image method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology