Discrete supervised cross-media Hash retrieval method based on collaborative matrix decomposition

A matrix factorization, cross-media technology, applied in the field of discrete supervised cross-media hash retrieval, which can solve problems such as loss of category information, difficulty in directly solving the objective function of hashing methods, and complex semantic associations.

Pending Publication Date: 2019-08-09
LUDONG UNIVERSITY
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are two problems in this type of method: 1) Using labels to construct the similarity matrix between two samples will cause the loss of category information; 2) The size of the similarity matrix between two samples is , however in large-scale applications, The value of is very large, so it will bring excessive memory overhead and computational complexity, making it lose the ability to expand
Most existing methods only keep learning hash codes based on the semantic similarity of class labels. However, due to the complex semantic association between heterogeneous samples, it is difficult for many heterogeneous samples with the same class labels to be mapped into similar Hamming codes.
In addition, due to the discrete constraints of the hash code, the objective function of the hash method is difficult to solve directly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Discrete supervised cross-media Hash retrieval method based on collaborative matrix decomposition
  • Discrete supervised cross-media Hash retrieval method based on collaborative matrix decomposition
  • Discrete supervised cross-media Hash retrieval method based on collaborative matrix decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0124]This embodiment takes the public dataset NUS-WIDE as an example, which contains 269,648 image and text sample pairs, and all sample pairs are distributed in 81 categories. In order to make each class have enough samples for training, the 21 classes with the most samples are selected, so 196,776 image and text sample pairs are reserved. The image and text samples in the data set were extracted with CaffeNet and BOW (BagOf Words) algorithms, respectively, with 4096-dimensional CNN features and 1000-dimensional BOW features, and the features were averaged. 99% of the sample pairs are randomly selected to form the training set, and the remaining 1% of the sample pairs form the test set. In order to objectively evaluate the performance of the method of the present invention, the average accuracy rate MPA@100 is used as the evaluation standard, and MPA@100 means that the MAP is calculated from the first 100 returned samples. On this dataset, the MAP@100 results are shown in T...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a discrete supervised cross-media Hash retrieval method based on collaborative matrix decomposition, which comprises the following steps of: 1) establishing a cross-media retrieval database, and dividing sample pairs into a training set and a test set; 2) extracting features of all sample pairs in the training set and the test set, and performing mean value removal; 3) projecting features and category labels of the samples to a low-dimensional feature space and a Hash code by using collaborative matrix decomposition and semantic embedding respectively, learning an orthogonal rotation matrix to construct semantic association of the low-dimensional features and the Hash code, and learning a Hash function for each mode; 5) generating a hash code of the test sample byutilizing the learned hash function; 6) taking the samples in the training set as to-be-retrieved samples, taking the samples in the test set as query samples, and calculating the Hamming distance between the query samples and the to-be-retrieved samples; 7) arranging according to a Hamming distance descending order, and returning the first r heterogeneous samples as retrieval results. According to the cross-media retrieval method based on the Hamming distance, cross-media retrieval can be achieved, occupied resources in the training process are few, the accuracy rate is high, and the cross-media retrieval method based on the Hamming distance has wide application prospects.

Description

technical field [0001] The invention relates to the fields of multimedia retrieval and artificial intelligence, in particular to a discrete supervised cross-media hash retrieval method based on collaborative matrix decomposition. Background technique [0002] With the rapid growth of data volume on the web, how to retrieve semantically similar samples in large-scale data becomes a challenge. On the one hand, due to the high time complexity and storage overhead, it is difficult to directly apply the traditional nearest neighbor retrieval method to large-scale data. On the other hand, the media types of samples on the Internet are diverse, and how to measure the similarity between heterogeneous samples across the gap between media types has become a challenge. Cross-media hashing maps the high-dimensional features of heterogeneous samples to a shared low-dimensional Hamming space to measure the similarity of heterogeneous samples. Due to its high efficiency and effectiveness,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/432G06K9/62
CPCG06F16/432G06F18/214
Inventor 姚涛唐文静李阿莉付海燕盛国瑞于泓刘莉
Owner LUDONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products