Fine-grained image retrieval method based on self-attention mechanism weighting

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An image retrieval and attention technology, which is applied in the fields of image retrieval and computer vision, can solve the problems of high computational time consumption and inaccurate selection of target features in the visual attention method, and achieve the effect of improving retrieval accuracy and reducing computational complexity

Active Publication Date: 2020-11-24

HUNAN UNIV

View PDF4 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] Aiming at the shortcomings of existing methods such as inaccurate target feature selection and high time-consuming calculation of visual attention methods, the present invention proposes an effective method to create a powerful feature representation for fine-grained image retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] 1. Image preprocessing

[0024] The present invention does not have specific requirements on the resolution of the image, but for consideration of retrieval efficiency, the longest side length of the image is set as an upper limit of 500 pixels during implementation. When the image needs to be scaled down, the original aspect ratio will be maintained. In addition, use the data provided by the ImageNet data to perform zero-mean processing on the pixel values on each channel of the image.

[0025] 2. Obtain the output of the convolutional neural network

[0026] The present invention is an unsupervised method and therefore only uses convolutional neural networks pre-trained in the ImageNet dataset. After inputting the image into the network, the output of the last convolutional layer is selected, and a three-dimensional tensor T of shape h×w×c can be obtained. The 3D tensor output by a convolutional network has two widely used concepts: (1) a feature map consisting o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of image retrieval and computer vision, in particular to a fine-grained image retrieval method based on visual attention mechanism weighting. The method comprises the following steps of: image preprocessing: setting the length of the longest side of an image to be 500 pixels; feature extraction: inputting the image into a convolutional neural network, and then selecting and outputting the features of the last convolutional layer; target feature selection: firstly, optimizing a local activation graph, and then selecting a local feature vector according to an activation graph result, so as to realize more accurate target feature selection; feature weighted aggregation: evaluating the importance degree of each feature, so as to enable the weightedfine-grained local features to still be embodied during pooling aggregation and improve the precision of fine-grained retrieval; and performing image retrieval, and calculating cosine similarity between the characteristic vectors of the queried image and a database image. An image feature extraction and coding detail graph is shown in figure 1. According to the method, fine-grained image retrievalcan be realized, and the retrieval accuracy is improved.

Description

technical field [0001] The invention relates to the technical fields of image retrieval and computer vision. In particular, it relates to a fine-grained image retrieval method based on visual attention mechanism weighting. Background technique [0002] Although image retrieval has achieved remarkable performance, it is still a challenging problem at the fine-grained image level. Compared to general image retrieval tasks, fine-grained image retrieval methods should be able to localize and express subtle visual differences within subcategories. For example, given a query image that contains a subcategory of a base category, such as the "black-footed albatross" subcategory of birds, we should return images from the database that are in the same subcategory as the query, rather than simply returning an arbitrary bird class image. [0003] Existing deep learning-based image retrieval methods can be divided into two groups according to whether they need to train models on new d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F16/583G06K9/46G06K9/62

CPCG06F16/583G06V10/44G06F18/22

Inventor 林红利吴汉王伟胜贺可心

Owner HUNAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Fine-grained image retrieval method based on self-attention mechanism weighting

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology