Unlock instant, AI-driven research and patent intelligence for your innovation.

Image description generation method and device based on deep residual network and attention

An image description and attention technology, applied in the field of image processing, can solve the problems of deep neural network accuracy decline, etc., to solve the problem of gradient explosion, speed up and effect, and solve the effect of tending to saturation and decline

Active Publication Date: 2022-02-18
QILU UNIV OF TECH
View PDF15 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to overcome the deficiencies of the above-mentioned existing technologies, the present disclosure provides a method and device for generating image descriptions based on deep residual networks and attention, which solves the problem of decreased accuracy of deep neural networks, and uses deep residual networks to learn images from Low-level to high-level image features, by embedding the input image into a fixed vector to generate a rich input image representation, and then combined with an attentional recurrent long short-term memory network to generate natural and smooth description sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image description generation method and device based on deep residual network and attention
  • Image description generation method and device based on deep residual network and attention
  • Image description generation method and device based on deep residual network and attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] The present disclosure will be further described below in conjunction with the accompanying drawings and embodiments.

[0061] It should be noted that the following detailed description is exemplary and intended to provide further explanation of the present disclosure. Unless defined otherwise, all technical and scientific terms used in this disclosure have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.

[0062] It should be noted that the terminology used here is only for describing specific implementations, and is not intended to limit the exemplary implementations according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural, and it should also be understood that when the terms "comprising" and / or "comprising" are used in this specification, they mean There are features, steps, operations, means, components and / or combi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an image description generation method and device based on a deep residual network and attention, which solves the problem of decreased accuracy of the deep neural network, uses the deep residual network to learn image features from the bottom layer to the high layer, and generates rich The input image representation is then combined with an attentional recurrent long-short-term memory network to generate natural and fluent description sentences. The method includes the following steps: acquiring a large amount of image sample data and preprocessing it; extracting image features of the preprocessed image sample data; using a residual neural network model to process the extracted image features to generate an image representation; The image representation is mapped to the input of the attention-based cyclic long-short-term memory network language model, and the attention-based cyclic long-short-term memory network language model is used to predict the word vector of the image and generate a description sentence for the image.

Description

technical field [0001] The present disclosure relates to the field of image processing, in particular to an image description generation method and device based on a deep residual network and attention. Background technique [0002] Image description generation technology is closely related to technologies such as image semantic analysis, image annotation and image advanced semantic extraction. Deep learning has shown promising performance in recent years on both image and natural language processing tasks. [0003] In recent years, deep convolutional networks have made a series of breakthroughs in image classification and image recognition. The deep network makes the features richer by superimposing the depth of the layer. Many important visual recognition tasks also benefit from deep models. However, as the depth of the network increases, the accuracy begins to saturate, and then declines rapidly, and the problem of model degradation occurs. During the research and dev...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/774G06V10/764G06V10/82G06K9/62G06N3/04G06N3/08
Inventor 杨振宇张姣
Owner QILU UNIV OF TECH