Discrete supervision cross-modal hash retrieval method based on semantic preservation

A cross-modal, hashing technology, applied in the field of discrete-supervised cross-modal hash retrieval based on semantic preservation, can solve problems such as high computational complexity and memory overhead, difficult to solve, and loss of scalability.

Pending Publication Date: 2020-11-10
LUDONG UNIVERSITY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the one hand, the similarity matrix between two samples is , in the era of big data The value of is very large, so this matrix usually leads to very high computational complexity and memory overhead, making it lose its scalability
On the other hand, using the Hamming distance between hash codes to approximate the similarity matrix between two samples is a decomposition problem of a symmetric matrix, which is difficult to solve

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Discrete supervision cross-modal hash retrieval method based on semantic preservation
  • Discrete supervision cross-modal hash retrieval method based on semantic preservation
  • Discrete supervision cross-modal hash retrieval method based on semantic preservation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0140] This embodiment takes the public dataset Mirflickr25K as an example, which contains 25,000 image and text sample pairs, and all sample pairs are distributed in 24 categories. First remove the labels that appear less than 20 times, and then remove the sample pairs that do not contain any labels and categories. There are 20015 sample pairs left in the data set. The image and text samples in the data set were extracted with CaffeNet and BOW (Bag Of Words) algorithms, respectively, with 4096-dimensional CNN features and 500-dimensional BOW features, and the features were averaged. 75% (15011) of the sample pairs are randomly selected to constitute the training set, while the remaining 25% (5004) of the sample pairs constitute the test set. In order to objectively evaluate the retrieval performance of the method of the present invention, the average accuracy rate MPA@r is used as the evaluation standard, where r represents the number of returned samples, and r=100 in the exp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a discrete supervision cross-modal hash retrieval method based on semantic preservation, which realizes semantic-based image and text cross-modal retrieval, and comprises the following steps of: firstly, collecting an image and text sample pair from a network, and respectively extracting features of an image and a text sample; dividing the data set into a training set and atest set; learning a shared Hamming space for the image and text modalities by keeping semantic consistency of samples in the training set, and learning hash functions for the image and text modalities respectively; taking the training set and the test set as a target set and a query set respectively, and calculating Hamming distances between samples in the query set and heterogeneous samples inthe target set; and sorting from small to large according to the Hamming distance, and returning the sample sorted in the front as a retrieval result. According to the method, images and text samplescan be mapped to a shared Hamming space for efficient retrieval, the training speed is high, the accuracy is high, and the method has a good application prospect.

Description

technical field [0001] The invention relates to the fields of multimedia retrieval and pattern recognition, in particular to a discrete-supervised cross-modal hash retrieval method based on semantic preservation. Background technique [0002] With the rapid development of information technology and personal social networks, the data on the network not only increases rapidly, but also presents a variety of data representations, such as images, texts, voices, etc. This brings two challenges to multimedia retrieval: 1) how to achieve real-time retrieval in massive data; 2) retrieval across modalities. On the one hand, the amount of data on the network is very large, and the representation dimension of samples is also very high, even reaching tens of thousands of dimensions, so the traditional nearest neighbor retrieval method is difficult to apply on large-scale data sets. Hash method is one of the most effective methods to solve large-scale applications. Its idea is to map hi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/532G06F16/583G06K9/62
CPCG06F16/532G06F16/5846G06F18/22
Inventor 姚涛于泓刘莉王增峰苏庆堂崔光海
Owner LUDONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products