Discrete supervision cross-modal hashing retrieval method based on semantic alignment

A cross-modal and modal technology, applied in the field of discrete supervised cross-modal hash retrieval based on semantic alignment, which can solve problems such as algorithm performance degradation

Active Publication Date: 2018-02-23
LUDONG UNIVERSITY
View PDF5 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of the above methods directly learn discrete hash codes, resulting in a decre

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Discrete supervision cross-modal hashing retrieval method based on semantic alignment
  • Discrete supervision cross-modal hashing retrieval method based on semantic alignment
  • Discrete supervision cross-modal hashing retrieval method based on semantic alignment

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0050] The specific embodiment: the specific embodiment of the present invention is described in detail again below:

[0051] The data set of the present invention is divided into a training set and a test set, the training set is used to train the hash functions of each modality, and the test set is used to test the performance of the algorithm;

[0052] see figure 1 , a discrete-supervised cross-modal hash retrieval method based on semantic alignment, characterized in that it consists of two parts: an offline training process and an online retrieval process; the offline training process includes extracting the BOW feature of the text modal sample in the training set, extracting Semantic attributes of the image modality samples in the training set and the learning of the hash function; the online retrieval process first utilizes the BOW algorithm or CNN to extract the features of the text modality samples in the test set or the semantic attribute representation of the image m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a DSAH (discrete semantic alignment hashing) method based on semantic alignment for cross-modal retrieval. In the training process, a heterogeneous gap is reduced by the aid ofimage attributes and modal alignment semantic information. In order to reduce memory overhead and training time, a latent semantic space is learned by synergistic filtering, and the internal relationbetween a hash code and a label is directly built. Finally, in order to decrease quantization errors, a discrete optimization method is proposed to obtain a hash function with better performances. Inthe retrieval process, samples in a testing set are mapped to a binary space by the hash function, the Hamming distance between a binary code of a query sample and a heterogeneous sample to be retrieved is calculated, and front ranked samples are returned according to the sequence from small to large. Experimental results of two representative multi-modal data sets prove superior performances ofDSAH.

Description

Technical field: [0001] The invention belongs to the technical field of multimedia retrieval, and relates to a cross-modal hash retrieval method, in particular to a discrete-supervised cross-modal hash retrieval method based on semantic alignment. Background technique: [0002] In the era of Web 2.0, people can upload data of various modalities through the network, such as images, texts, and videos. Utilizing data in various modalities to represent information allows netizens to obtain the information they need more intuitively and easily. Although the information representations of these different modalities are different, they may contain the same semantic information. Therefore, for the search content submitted by the user, the data returned by the search engine in multiple modalities can describe the user's search intention more vividly and vividly. , which can improve the experience of network users. [0003] The representations of samples of different modalities are ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/325G06F16/3344G06F16/338G06F16/583G06F16/5866
Inventor 姚涛孔祥维付海燕
Owner LUDONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products