A task-oriented text generation image network model

A technology for generating images and network models, applied in the field of computer science, capable of solving problems such as poor ability to solve complex relationships, poor detail quality, and low image resolution

Active Publication Date: 2020-10-30
10TH RES INST OF CETC
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The object of the present invention is to provide a method for generating high-quality images that are consistent with text semantics, rich in content, and harmonious in order to overcome the above-mentioned deficiencies in the prior art The task-oriented text generation image network model solves the problems of poor ability to deal with multiple entities and complex relationships between multiple entities, low resolution of generated images, and poor quality of details such as textures and edges in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A task-oriented text generation image network model
  • A task-oriented text generation image network model
  • A task-oriented text generation image network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] Such as figure 1 As shown, in the preferred embodiment described below, a task-oriented text generation image network model includes: a task-oriented common sense reasoning module, a text-based feature expression module, a global generation model and a local refinement model, wherein: The task-oriented commonsense reasoning module combines natural language text descriptions, natural language task descriptions and commonsense knowledge bases to perform task-oriented commonsense reasoning, reasonably enriches and expands the entities and entity attributes in the text descriptions, and obtains the expanded text descriptions and Feature expression: Task descriptions of task types, scenarios, entities, etc., and common sense knowledge base for common sense reasoning, reasonable enrichment of entities and entity attributes in natural language text descriptions, in order to enhance the text-based feature expression module obtained The harmony, diversity and authenticity of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a task-oriented text generation image network model, and aims to provide a text generation image network model for generating high-quality images consistent with text semanticsand rich in content. The invention is realized through the following technical scheme: a common sense reasoning module enriches and expands entities and entity attributes in natural language text description, and then respectively constructs an entity relationship scene graph and entity attribute semantic vectors for the natural language text description; the global generation model inputs the entity relationship scene graph into a GCN, respectively inputs the obtained embedded vectors into a mask regression network and a bounding box regression network, estimates a segmentation mask and a bounding box of each entity, fuses all entity layouts to form a scene layout, and generates an initial image in combination with a convolutional neural network; and the local refinement model takes theentity attribute semantic vector and the initial image feature mapping as input, and generates a high-quality image with rich and harmonious content in combination with the RRRN and the convolutionalneural network.

Description

technical field [0001] The present invention relates to the field of computer science, especially the Generative Adversarial Networks (GAN) technology in the field of deep learning, in particular to a task-oriented text generation image network model. Background technique [0002] With the widespread use of camera-capable mobile smart terminals and the rapid development of the Internet, multi-modal data that integrates visual and textual information is increasing dramatically, for example, photos with text annotations, graphic-text content in newspaper articles, captioned Video and multimodal interaction data appearing in social media. Image text description method can effectively organize image data, and combine text information retrieval technology to search massive image data conveniently. In addition, using the image-to-text description method can not only understand the content of the speaker from the images in the slides, but also help the visually impaired to underst...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F16/33G06F40/30G06F40/284G06F40/289G06N5/04G06N3/04G06N3/08
CPCG06F16/367G06F16/3344G06F40/30G06F40/284G06F40/289G06N5/04G06N3/08G06N3/045
Inventor 李春豹崔莹代翔刘鑫
Owner 10TH RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products