Cross-modal hash retrieval method based on self-learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A cross-modal, self-learning technology, applied in the computer field, can solve problems such as ignoring binary constraints, errors, and reducing the ability to distinguish binary codes, so as to reduce coding errors and improve quality

Active Publication Date: 2020-07-28

DALIAN UNIV OF TECH

View PDF2 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, both unsupervised hashing methods and supervised hashing methods have a common limitation: in the quantization stage, most of them ignore the binary constraints and adopt a simple threshold strategy to generate the final binary hash code, which will lead to a large number of quantization errors and reduce the discrimination ability of binary codes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] Embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0020] figure 1 A framework diagram of a self-learning based cross-modal hash retrieval method. For simplicity of description, the present invention uses the two most common modalities (text and image) as samples for cross-modal retrieval. First, the co-matrix factorization technique is employed to project the feature data of different modalities into a common latent semantic space. Secondly, the common semantic space is rotated to minimize the variance of data of different dimensions through orthogonal transformation technology, so that the binary quantization loss is minimized, so that samples of the same category but not spatially related can be further converted into similar binary codes. Also, consider maintaining intra-modal and inter-modal similarities. For intra-modal similarity, local geometry is used for learning; for inter-modal similarity,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A cross-modal hash retrieval method based on self-learning belongs to the technical field of computers, and comprises the following steps: 1) learning potential public semantic features of different modals through a co-matrix decomposition technology; 2) learning a unified binary code with discrimination through orthogonal transformation and quantization processes; 3) maintaining and integrating the similarity in the modes and the similarity between the modes into a graph regularization item, and embedding the graph regularization item into a binary code generation process; 4) calculating andoptimizing a target function, and iteratively updating a plurality of matrix variables until a convergence condition is met; and 5) completing learning of a specific modal hash function by adopting aself-learning framework. Aiming at the problem of large quantization error caused by a threshold strategy, the binary coding loss of common representation of different modes is minimized, the similarity between the interiors of the modes and the similarity between the modes are embedded, and a self-learning hash scheme is introduced to learn a hash function with higher discrimination. Coding errors in a binary quantization stage can be effectively reduced, and the quality of hash codes and the performance of cross-modal retrieval are improved.

Description

technical field [0001] The invention belongs to the technical field of computers and relates to a cross-modal hash retrieval method based on self-learning. Background technique [0002] With the rapid development of information retrieval technology and the popularization of various digital devices, a large amount of multimedia data, such as text, images, and videos, appears in the Internet. These multimedia data not only have a considerable amount, but also contain a variety of modalities of different dimensions. Since data of different modalities usually describe the same object or event, how to use one of the modal data to retrieve the results of other modalities related to it has become an urgent problem to be solved. In recent years, many researchers have invested in the field of cross-modal retrieval and achieved great success. However, when the data dimension is high and the scale is large, the retrieval cost of most cross-modal retrieval methods can be very large. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/62G06F16/48G06F16/40

CPCG06F18/2411G06F18/24

Inventor 陈志奎钟芳明杜佳宁仇希如

Owner DALIAN UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cross-modal hash retrieval method based on self-learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology