Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fine-grained image retrieval method based on self-attention mechanism weighting

An image retrieval and attention technology, which is applied in the fields of image retrieval and computer vision, can solve the problems of high computational time consumption and inaccurate selection of target features in the visual attention method, and achieve the effect of improving retrieval accuracy and reducing computational complexity

Active Publication Date: 2020-11-24
HUNAN UNIV
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the shortcomings of existing methods such as inaccurate target feature selection and high time-consuming calculation of visual attention methods, the present invention proposes an effective method to create a powerful feature representation for fine-grained image retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fine-grained image retrieval method based on self-attention mechanism weighting
  • Fine-grained image retrieval method based on self-attention mechanism weighting
  • Fine-grained image retrieval method based on self-attention mechanism weighting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] 1. Image preprocessing

[0024] The present invention does not have specific requirements on the resolution of the image, but for consideration of retrieval efficiency, the longest side length of the image is set as an upper limit of 500 pixels during implementation. When the image needs to be scaled down, the original aspect ratio will be maintained. In addition, use the data provided by the ImageNet data to perform zero-mean processing on the pixel values ​​on each channel of the image.

[0025] 2. Obtain the output of the convolutional neural network

[0026] The present invention is an unsupervised method and therefore only uses convolutional neural networks pre-trained in the ImageNet dataset. After inputting the image into the network, the output of the last convolutional layer is selected, and a three-dimensional tensor T of shape h×w×c can be obtained. The 3D tensor output by a convolutional network has two widely used concepts: (1) a feature map consisting o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of image retrieval and computer vision, in particular to a fine-grained image retrieval method based on visual attention mechanism weighting. The method comprises the following steps of: image preprocessing: setting the length of the longest side of an image to be 500 pixels; feature extraction: inputting the image into a convolutional neural network, and then selecting and outputting the features of the last convolutional layer; target feature selection: firstly, optimizing a local activation graph, and then selecting a local feature vector according to an activation graph result, so as to realize more accurate target feature selection; feature weighted aggregation: evaluating the importance degree of each feature, so as to enable the weightedfine-grained local features to still be embodied during pooling aggregation and improve the precision of fine-grained retrieval; and performing image retrieval, and calculating cosine similarity between the characteristic vectors of the queried image and a database image. An image feature extraction and coding detail graph is shown in figure 1. According to the method, fine-grained image retrievalcan be realized, and the retrieval accuracy is improved.

Description

technical field [0001] The invention relates to the technical fields of image retrieval and computer vision. In particular, it relates to a fine-grained image retrieval method based on visual attention mechanism weighting. Background technique [0002] Although image retrieval has achieved remarkable performance, it is still a challenging problem at the fine-grained image level. Compared to general image retrieval tasks, fine-grained image retrieval methods should be able to localize and express subtle visual differences within subcategories. For example, given a query image that contains a subcategory of a base category, such as the "black-footed albatross" subcategory of birds, we should return images from the database that are in the same subcategory as the query, rather than simply returning an arbitrary bird class image. [0003] Existing deep learning-based image retrieval methods can be divided into two groups according to whether they need to train models on new d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/583G06K9/46G06K9/62
CPCG06F16/583G06V10/44G06F18/22
Inventor 林红利吴汉王伟胜贺可心
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products