Supercharge Your Innovation With Domain-Expert AI Agents!

A Cross-modal Hash Retrieval Method Guided by Class Semantics

A cross-modal and modal technology, applied in the field of cross-modal hash retrieval based on class semantic guidance, can solve problems such as enhanced learning to hash codes, and achieve the effect of improving quality

Active Publication Date: 2021-03-26
DALIAN UNIV OF TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such a model not only enhances the discriminativeness of learned hash codes, but also enables the present invention to solve the retrieval problem of invisible domains

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Cross-modal Hash Retrieval Method Guided by Class Semantics
  • A Cross-modal Hash Retrieval Method Guided by Class Semantics
  • A Cross-modal Hash Retrieval Method Guided by Class Semantics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0022] figure 1 A framework diagram for a cross-modal hash retrieval method guided by class semantics. For ease of explanation, the method designed in the present invention only considers the retrieval scenarios of the most common two modalities (ie, text and image).

[0023] A cross-modal hash retrieval method guided by class semantics, which mainly consists of two steps, namely, projection learning and hash code learning guided by class semantics. In the first step, firstly, according to the ready-made word2vec model, the class names are converted into word vectors, and the class semantic space is constructed. Then, discriminative projections are learned based on an encoder-decoder paradigm guided by class label semantics. In the second step, the raw data is first projected into the common latent semantic space using the projection learned in the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A cross -modal hash retrieval method based on semantic guidance belongs to the field of computer technology, including: 1) projection learning guided by semantic semantics; 2) hash code learning; 3) similarity between modes and modes between modalsKeep; 4) the construction and optimization of the total target function.The present invention mainly ignores the problem of class semantics in the supervision method. Considering the semantic correlation between the semantic semantic association through the semantic embedded space, and the semantic semantics as the middle layer, the encoder ‑ decoder paradigm is used for projection learning, further furtherGround generates a hash function with a hash code and a specific modal.In addition, because the semantic semantics has established visible and invisible classes, it also solves the problem of retrieval of the invisible domains at the same time.Experiments show that the present invention effectively capture the semantic correlation between categories, improves the quality of the hash code and the performance of cross -modular retrieval, and has the ability to deal with cross -modal retrieval tasks and zero sample cross -modal retrieval tasks.

Description

technical field [0001] The invention belongs to the technical field of computers and relates to a cross-modal hash retrieval method based on class semantic guidance. Background technique [0002] With the advent of the era of big data, multimedia data on the Internet, such as images, text, audio, etc., is growing exponentially. The diversification and multi-dimensional characteristics of these multimedia data make people's retrieval needs change from traditional single-modal data retrieval, such as image retrieval, text retrieval, etc., to mutual retrieval between multi-modal data, especially cross-modal data. search. Cross-modal retrieval is to use samples from one modality to obtain related results from another modality, such as using an image to retrieve text or video related to it. In recent years, many researchers have made unremitting efforts and produced a large number of research results. However, in large-scale retrieval tasks, cross-modal retrieval methods suffe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31G06F16/33G06F16/53G06F16/901G06F16/903G06K9/62
CPCG06F16/9014G06F16/903G06F16/53G06F16/325G06F16/33G06F16/3344G06F18/214
Inventor 陈志奎杜佳宁钟芳明
Owner DALIAN UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More