Cross-modal hash retrieval method based on supervision graph embedding

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A graph embedding, cross-modal technology, applied in digital data information retrieval, instrumentation, computing and other directions, can solve the problems of reduced hash code effectiveness, unsatisfactory retrieval results, quantization errors, etc., to enhance the ability to distinguish, The effect of improving retrieval performance and improving representation ability

Inactive Publication Date: 2022-04-15

LUDONG UNIVERSITY +1

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the threshold operation will bring quantization error, which usually leads to the reduction of the effectiveness of the hash code; 3) most hashing methods based on graph embeddings fail to fully utilize the class labels of the samples during the training process, resulting in their retrieval results hard to satisfy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0054] The specific embodiment of the present invention is described in detail below in conjunction with accompanying drawing:

[0055] Although the present invention specifies two modalities, image and text, the algorithm can be easily extended to other modalities and situations with more than two modalities. For convenience of description, the present invention only considers two modes of image and text.

[0056] Such as figure 1 As shown, a cross-modal hash retrieval method based on supervised graph embedding, which includes the following steps:

[0057] 1) Step S1, crawl image-text sample pairs from the webpage where images and text modals co-occur, construct image-text dataset, and randomly divide the dataset into training set and test set;

[0058] 2) Step S2, extracting the 512-dimensional GIST feature of all images in the training set and the test set and the 1000-dimensional BOW feature of the text;

[0059] 3) Step S3, designing the overall objective function of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a cross-modal hash retrieval method based on supervision graph embedding, and belongs to the technical field of multimedia retrieval. Constructing an image and text modal data set, and dividing the image and text modal data set into a training set and a test set; extracting features of all image and text samples of the training set and the test set, and mapping the extracted features to a nonlinear kernel space by using a radial basis kernel function to improve the characterization capability of the features; generating a pairwise similarity matrix of the samples by utilizing the category labels, and further generating a Laplacian matrix; generating a semantic subspace by utilizing the category label; learning a mapping matrix for image and text modalities by utilizing intra-modal similarity keeping based on an image embedding method and inter-modal similarity keeping based on a semantic subspace; learning an orthogonal rotation matrix to minimize a quantization error; and an efficient iterative discrete optimization algorithm is utilized to reduce the calculation complexity of the training process.

Description

technical field [0001] The invention relates to a cross-modal hash retrieval method based on supervision graph embedding, belonging to the technical field of multimedia retrieval. Background technique [0002] In recent years, graph embedding based hashing methods have attracted much attention due to their effectiveness in cross-modal retrieval. The hash method is to map similar samples in the original space into similar hash codes, and then calculate the Hamming distance between the hash codes by XOR operation, and use the Hamming distance to measure the similarity between samples, which can significantly reduce the Computational complexity and memory overhead. Inspired by this, researchers have proposed many hashing methods for large-scale retrieval tasks in recent years. However, most methods are only applied to unimodal data, i.e. the modality type of the retrieved sample is the same as the modality type of the query sample, e.g. using images to retrieve images. But t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F16/432

Inventor 姚涛张林梁李朝霞彭守永李艺茹王丽丽张淑宁

Owner LUDONG UNIVERSITY

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cross-modal hash retrieval method based on supervision graph embedding

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology