Supercharge Your Innovation With Domain-Expert AI Agents!

Automatic image description method and system based on mixed attention mechanism

An automatic image and image description technology, applied in image analysis, image data processing, computer parts and other directions, can solve the problem of inaccurate attention

Active Publication Date: 2022-07-01
JIANGXI UNIVERSITY OF FINANCE AND ECONOMICS
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, although the attention mechanism can effectively improve the performance of automatic image description methods, the current methods still have problems such as insufficient attention, which leads to the description of objects that do not appear in the image when performing image description.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic image description method and system based on mixed attention mechanism
  • Automatic image description method and system based on mixed attention mechanism
  • Automatic image description method and system based on mixed attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, only used to explain the present invention, and should not be construed as a limitation of the present invention.

[0069] These and other aspects of embodiments of the present invention will become apparent with reference to the following description and accompanying drawings. In these descriptions and drawings, some specific implementations of the embodiments of the invention are specifically disclosed to represent some ways of implementing the principles of the embodiments of the invention, but it should be understood that the scope of the embodiments of the invention is not limited by this limit....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an automatic image description method and system based on a mixed attention mechanism. The method comprises the following steps: acquiring regional image features and position information of a target bounding box in a to-be-described image; inputting the regional image features into a machine attention module to obtain machine attention features; acquiring cognitive data when the human beings perform the image description task, and constructing a visual cognitive model when the human beings perform the image description task according to the cognitive data; and obtaining attention features according to the visual cognition model, and performing fusion according to the attention features to obtain final image description. According to the automatic image description method, the attention guided by a human cognitive mechanism is combined with the attention of a traditional machine, and better reference is provided for the attention weight in the description generation process, so that more accurate description is generated, the performance of the automatic image description method is improved, and a more excellent result is obtained.

Description

technical field [0001] The invention relates to the technical field of computer images, in particular to an automatic image description method and system based on a mixed attention mechanism. Background technique [0002] In the computer field, image description generation is a comprehensive problem that integrates computer vision and natural language processing. Although the image description task is very easy for humans, it is very difficult for machines to understand the content of images and describe them in natural language due to the heterogeneous nature of data from different modalities. Not only are machines required to generate fluent and human-understandable sentences, but sentences are also required to represent complete image content. [0003] Inspired by the application of attention mechanism in machine translation, some researchers have introduced attention mechanism in the traditional "encode-decode" framework, which significantly improves the performance of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/73G06T7/11G06N3/04G06K9/62G06V10/80
CPCG06T7/73G06T7/11G06N3/044G06F18/253
Inventor 姜文晖李钦方玉明沈飞刘扬
Owner JIANGXI UNIVERSITY OF FINANCE AND ECONOMICS
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More