Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and device for generating hash codes using multi-layer feature fusion

A technology of feature fusion and feature map, applied in neural learning methods, biological neural network models, special data processing applications, etc., can solve problems such as missing spatial information, achieve high retrieval accuracy, improve average accuracy, and reduce information loss effect

Active Publication Date: 2022-04-08
SHANGHAI UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above methods based on deep convolutional deep network learning only use the features extracted from a certain layer of deep convolutional neural network to represent data of different modalities. The extracted features are usually high-level features in the network, such as Simonyan, Karen, and Andrew Zisserman. "Very Deep Convolution Networks for Large-Scale Image Recognition." International Conference on Learning Representations. 2015. The output of the fc8 layer of the proposed VGG network, however, most of the high-level features encode semantic features, and a lot of spatial information is lost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for generating hash codes using multi-layer feature fusion
  • A method and device for generating hash codes using multi-layer feature fusion
  • A method and device for generating hash codes using multi-layer feature fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] The embodiments of the present invention are described in detail below. This embodiment is implemented on the premise of the technical solution of the present invention, and detailed implementation methods and specific operating procedures are provided, but the protection scope of the present invention is not limited to the following implementation example.

[0102] Such as figure 1 Shown is a flow chart of a method for generating hash codes using multi-layer feature fusion according to an embodiment of the present invention.

[0103] Please refer to figure 1 , the method for generating a hash code using multi-layer feature fusion in this embodiment includes:

[0104] S11: Establish a similarity matrix of image-text pairs;

[0105] According to the label information in the data set, the similarity matrix S of the image-text pair is established, and the element S in S is ij Indicates the similarity between the i-th image and the j-th text, if the image is similar to ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for producing hash codes by fusion of multi-layer features, including: establishing a similarity matrix of image-text pairs; obtaining the features of different layers through the output of different residual blocks, and combining the The features of the image are converted into feature maps with the same number of channels and sizes, and then fused, and finally the hash code corresponding to the image is obtained through global pooling and full connection and discretization; the multi-scale fusion module is used to generate corresponding multi-scale for each text BOW model, then obtain features of different scales through the convolutional layer and fuse them, and finally obtain the hash code corresponding to the text through the fully connected layer; design a loss function; train the model; use the trained model to input samples into it to obtain the corresponding hashcode. Through the present invention, the generated hash code is more discriminative, and can effectively improve the average retrieval accuracy rate when used for cross-modal retrieval.

Description

technical field [0001] The invention relates to the technical field of image retrieval, in particular to a method and device for generating a hash code by fusion of multi-layer features. Background technique [0002] With the rapid development of the network, more and more data of different modalities appear on the Internet, such as images, texts, etc. People hope to find the information they need in various forms of data. Traditional single-modal retrieval can no longer meet people's needs, so cross-modal retrieval is proposed. The goal of cross-modal retrieval is to use a query from one modality (such as image) to find semantically similar instances in another modality (such as text). However, the similarity measurement between different modality data is very challenging due to the heterogeneity difference between different modality data, and the semantic gap between low-level features and high-level semantics. A common approach to make up this difference is to map diffe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/901G06F16/903G06N3/04G06N3/08
CPCG06F16/9014G06F16/90335G06N3/08G06N3/045
Inventor 马然余海波苏敏安平
Owner SHANGHAI UNIV