Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-modal hash retrieval method based on self-learning

A cross-modal, self-learning technology, applied in the computer field, can solve problems such as ignoring binary constraints, errors, and reducing the ability to distinguish binary codes, so as to reduce coding errors and improve quality

Active Publication Date: 2020-07-28
DALIAN UNIV OF TECH
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, both unsupervised hashing methods and supervised hashing methods have a common limitation: in the quantization stage, most of them ignore the binary constraints and adopt a simple threshold strategy to generate the final binary hash code, which will lead to a large number of quantization errors and reduce the discrimination ability of binary codes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal hash retrieval method based on self-learning
  • Cross-modal hash retrieval method based on self-learning
  • Cross-modal hash retrieval method based on self-learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0020] figure 1 A framework diagram of a self-learning based cross-modal hash retrieval method. For simplicity of description, the present invention uses the two most common modalities (text and image) as samples for cross-modal retrieval. First, the co-matrix factorization technique is employed to project the feature data of different modalities into a common latent semantic space. Secondly, the common semantic space is rotated to minimize the variance of data of different dimensions through orthogonal transformation technology, so that the binary quantization loss is minimized, so that samples of the same category but not spatially related can be further converted into similar binary codes. Also, consider maintaining intra-modal and inter-modal similarities. For intra-modal similarity, local geometry is used for learning; for inter-modal similarity,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A cross-modal hash retrieval method based on self-learning belongs to the technical field of computers, and comprises the following steps: 1) learning potential public semantic features of different modals through a co-matrix decomposition technology; 2) learning a unified binary code with discrimination through orthogonal transformation and quantization processes; 3) maintaining and integrating the similarity in the modes and the similarity between the modes into a graph regularization item, and embedding the graph regularization item into a binary code generation process; 4) calculating andoptimizing a target function, and iteratively updating a plurality of matrix variables until a convergence condition is met; and 5) completing learning of a specific modal hash function by adopting aself-learning framework. Aiming at the problem of large quantization error caused by a threshold strategy, the binary coding loss of common representation of different modes is minimized, the similarity between the interiors of the modes and the similarity between the modes are embedded, and a self-learning hash scheme is introduced to learn a hash function with higher discrimination. Coding errors in a binary quantization stage can be effectively reduced, and the quality of hash codes and the performance of cross-modal retrieval are improved.

Description

technical field [0001] The invention belongs to the technical field of computers and relates to a cross-modal hash retrieval method based on self-learning. Background technique [0002] With the rapid development of information retrieval technology and the popularization of various digital devices, a large amount of multimedia data, such as text, images, and videos, appears in the Internet. These multimedia data not only have a considerable amount, but also contain a variety of modalities of different dimensions. Since data of different modalities usually describe the same object or event, how to use one of the modal data to retrieve the results of other modalities related to it has become an urgent problem to be solved. In recent years, many researchers have invested in the field of cross-modal retrieval and achieved great success. However, when the data dimension is high and the scale is large, the retrieval cost of most cross-modal retrieval methods can be very large. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06F16/48G06F16/40
CPCG06F18/2411G06F18/24
Inventor 陈志奎钟芳明杜佳宁仇希如
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products