Picture description generation method and system based on Actor-Critic generative adversarial network

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A picture description and generation system technology, applied in the direction of biological neural network model, neural learning method, character and pattern recognition, etc., can solve the problems of rough text quality and not meeting expectations, so as to increase diversity and creativity, improve The effect of high performance and training efficiency

Pending Publication Date: 2020-05-15

ZHEJIANG UNIV

View PDF5 Cites 29 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] SeqGAN solves the training problem of discrete data through reinforcement learning, and provides a set of methods for GAN in the field of text generation, but many experimental results show that the quality of the text generated by the SeqGAN model is rough, far below expectations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0070] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0071] The architecture of the picture text description generation system of the present invention is as follows: figure 1 As shown, it includes feature extraction module, generation module, discrimination module and Actor-Critic module, and its workflow is as follows figure 2 As shown, the details are: after obtaining the data, perform data processing, including operations such as establishing word vectors and data enhancement, to obtain the data required for subsequent model building. An image is extracted through the HCP (Hypotheses-CNN-Pooling) feature extraction module to extract high-level semantic features, which are used to initialize the generation module to obtain text descripti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a picture description generation method and system based on an Actor-Critic generative adversarial network, and the method comprises the following steps: (1) obtaining a picture described by a known text, carrying out the preprocessing, and constructing a training set, (2) establishing a target network based on a generative adversarial network and an Actor-Critic algorithm,wherein the target network comprises a generator network, a discriminator network and a Critic network, (3) inputting the pictures in the training set and the text description of the pictures into the target network, performing pre-training and adversarial training on the generator and the discriminator, and performing single-step updating on the parameters of the generator by adopting an Actor-Critic algorithm, and (4) inputting the target picture needing to generate the text description into the trained generator to obtain the text description of the target picture. According to the method,based on the Actor-Critic algorithm, an adversarial network technology is adopted, and diversified text description can be generated on a given image.

Description

technical field [0001] The present invention relates to the fields of reinforcement learning, computer vision and natural language processing, in particular to a method and system for generating picture descriptions based on Actor-Critic generative confrontation networks. Background technique [0002] Image summarization (Image Caption) is an important problem in computer vision. [0003] In traditional image summarization tasks, the commonly used model is the Encoder-Decoder (encoding-decoding) model. In general, Encoder uses Convolutional Neural Networks (CNN) to extract image features, while Decoder uses Recursive Neural Network (RNN) to generate summaries from image features. This is because CNN performs well in extracting image features, while RNN has a memory ability for input and is outstanding in text generation. [0004] Proposed in a paper published by Qi Wu (Q.Wu, C.Shen, A.van den Hengel, L.Liu, and A.Dick.What Value DoExplicit High-Level Concepts Have in Visio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F16/58G06N3/04G06N3/08G06K9/62

CPCG06F16/5866G06N3/08G06N3/044G06N3/045G06F18/214

Inventor陈建海李天旭季博楼智豪袁嘉琪何钦铭

OwnerZHEJIANG UNIV

Picture description generation method and system based on Actor-Critic generative adversarial network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology