An image description generation system and method based on a weighing attention mechanism

An image description and attention technology, applied in the field of image understanding, can solve the problems of lack of polishing process, low recognition of generated description, inconsistent training and testing process, etc., to achieve accurate image description, improve accuracy, ease training and testing The effect of process inconsistency

Active Publication Date: 2019-05-07
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF8 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide an image description generation system and method based on deliberation and attention mechanism, which solves the problems of lack of polishing process, inconsistent training and testing process, and low recognition degree of generated description existing in existing image description schemes. question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An image description generation system and method based on a weighing attention mechanism
  • An image description generation system and method based on a weighing attention mechanism
  • An image description generation system and method based on a weighing attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The purpose of the present invention is to provide an image description generation system and method based on deliberation and attention mechanism, which solves the problems of lack of polishing process, inconsistent training and testing process, and low recognition degree of generated description existing in existing image description schemes.

[0037] The image description generation system based on the deliberate attention mechanism in the present invention includes three parts: an encoder, a decoder based on the deliberate attention mechanism and a reinforcement learning module. The following is a detailed introduction to each part:

[0038] ① Encoder is an important part of the image description generation model. Encoders are generally used to extract visual information from images. Convolutional neural networks are generally used to extract global features of images. For a specific object, the local features extracted based on R-CNN contain richer information th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of image understanding, discloses an image description generation system and method based on a weighing attention mechanism, and solves the problems that an existingimage description scheme lacks a polishing process, the training process and the testing process are inconsistent, and the generation description recognition degree is not high. The method comprisesthe following steps: a, processing a data set: extracting global features and local features of an image, constructing the data set, marking words in the data set, and generating corresponding word embedding vectors; B, training an image description generation model: generating rough image description by adopting a first layer of decoder based on a residual attention mechanism, and carrying out polishing on the generated image description by adopting a second layer of decoder based on the residual attention mechanism; And c, further training the model in combination with reinforcement learning: simulating a test process of the model in the training process, guiding the training of the model by generating a described CIDEr score, and adjusting the model in combination with reinforcement learning.

Description

technical field [0001] The invention relates to the field of image understanding, in particular to an image description generation system and method based on deliberation and attention mechanism. Background technique [0002] The task of image description is: given a picture, automatically generate the corresponding natural language description. The generated sentences are required to be fluent and can describe the objects and scenes in the pictures. This research direction can be applied in many ways. For example: helping blind people understand the content of pictures. [0003] Traditional image description models generally adopt an encoder-decoder framework combined with an attention mechanism. The framework has achieved great results. But it still has the following defects: [0004] First, the training and testing process of the traditional model is to generate a description as the final result through a decoder. This method lacks the polishing process, so the resu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06K9/72
Inventor 宋井宽樊凯旋高联丽
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products