Semantic image retrieval method based on attention mechanism

A technology of image retrieval and attention, which is applied in the field of image processing, can solve problems that have not been completely solved, have many types of features, and cannot retrieve similar pictures, so as to achieve good conversion effects and overcome the semantic gap.

Active Publication Date: 2020-10-16
XIDIAN UNIV
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, many improved algorithms appeared on the basis of CNN+hash coding. However, this method also has shortcomings, that is, the problem of "semantic gap" in image retrieval has not been completely solved, that is, it cannot realize the image semantics. angle to retrieve similar images
The disadvantage of this method is that the retrieval system is too complex and there are too many types of features, which will greatly affect the speed of retrieval, and cannot effectively overcome or reduce the "semantic gap" problem in the retrieval process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic image retrieval method based on attention mechanism
  • Semantic image retrieval method based on attention mechanism
  • Semantic image retrieval method based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Below with reference to accompanying drawing and embodiment, the present invention is described in further detail:

[0028] refer to figure 1 , the concrete steps that the present invention realizes are as follows:

[0029] Step 1, construct a CNN-RNN network model including attention mechanism and train it:

[0030] (1a) Perform preprocessing operations on pictures and corresponding image titles in the MS COCO dataset, including word segmentation, syntactic analysis and word vectors;

[0031] (1b) Construct a convolutional neural network VGG encoder and a cyclic neural network LSTM decoder, and add an attention mechanism to the decoder to obtain a CNN-RNN network model composed of an encoder and a decoder;

[0032] The core structure of the above-mentioned convolutional neural network VGG encoder, that is, the inception module, such as image 3 As shown, the inception v2 network is formed by stacking the modules; the construction of the convolutional neural network ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a semantic image retrieval method based on an attention mechanism, and mainly solves the problem that a semantic gap affects retrieval accuracy in a picture retrieval process.The method comprises the following steps: 1) constructing a CNN-RNN network model containing an attention mechanism and training the CNN-RNN network model; 2) extracting text features of pictures in an image library by using the trained network model; 3) extracting semantic feature vectors of the text features by using a text vector doc2vec model and storing the semantic feature vectors; 4) extracting text features of the query picture by using the trained network model, and extracting semantic feature vectors corresponding to the text features; and 5) calculating and comparing the feature vector of the query picture with the feature vector in the image library by using a cosine method, and outputting a result. According to the method, the influence caused by a semantic gap can be effectively reduced, so that the system can perform similarity retrieval on semantic information shown by the pictures, and the method can be used for quick retrieval of mass data in the Internet and search of mobile phone pictures in daily life.

Description

technical field [0001] The invention belongs to the technical field of image processing, and further relates to an image-based pattern recognition technology, specifically a semantic image retrieval method based on an attention mechanism. During the image retrieval process, for a query image (query image), an image similar to the query image in the image library is searched and output. Background technique [0002] Image retrieval refers to given an image containing specific content, and then finds images containing similar content in the image database, but because different images are quite different under the influence of factors such as shooting angle, occlusion, and illumination, how to solve the above-mentioned impossible problems? Finding the desired image quickly under the influence of control factors is a challenging problem. In today's Internet era, a huge amount of images are uploaded to the server every moment on the Internet, especially with the rise of social ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/583G06F40/30G06K9/62G06K9/46G06N3/04G06N3/08
CPCG06F16/583G06F40/30G06N3/049G06N3/08G06V10/50G06V10/464G06V10/56G06N3/045G06F18/22
Inventor 韩红杨慎全
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products