Text image generation method and system based on multi-stage generative adversarial network
An image generation, multi-stage technology, applied in biological neural network models, image data processing, 2D image generation, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] Such as figure 1 As shown, this embodiment provides a text generation image method based on a multi-stage generative adversarial network. This embodiment uses the method applied to a server as an example for illustration. It can be understood that this method can also be applied to a terminal, or The application includes terminals, servers and systems, and is realized through the interaction between terminals and servers. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or it can provide cloud services, cloud database, cloud computing, cloud function, cloud storage, network server, cloud communication, intermediate Cloud servers for basic cloud computing services such as software services, domain name services, security service CDN, and big data and artificial intelligence platforms. The terminal may be a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart speak...
Embodiment 2
[0081] This embodiment provides a text generation image system based on a multi-stage generation confrontation network.
[0082] Such as figure 2 As shown, the multi-stage generative adversarial network with integrated attention mechanism consists of three parts: text feature extraction, generative network and discriminative network. The text description is encoded into a sentence vector and a word vector by a text encoder, the sentence vector is used as the initial feature input, and the word vector is used for initial image generation and post-image refinement, respectively. In the generation stage of the image, the initial features add text features to the generated image through the upward module and the traditional attention module. The discriminative network predicts an adversarial loss to evaluate the visual authenticity and semantic consistency of generated image features by extracting features from generated images and spatially stitching them with textual informati...
Embodiment approach
[0088] As one or more implementations, the generation network module includes: an initial image generation module, a first refinement module, and a second refinement module;
[0089] The initial image generation module is configured to: receive word vectors and splicing vectors, perform word-level deep fusion processing, output initial image feature vectors, and convolute the initial image feature vectors to obtain a first resolution image;
[0090] The first refinement module is configured to: receive a word vector, use a traditional attention mechanism to convert the word vector into a common semantic space of image features, and calculate the word context vector and the initial image feature vector according to the initial image feature vector splicing, outputting the first image feature vector, and convoluting the first image feature vector to obtain a second resolution image;
[0091] The second refinement module is configured to: receive a word vector, use a traditional ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



