Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-modal hash retrieval method based on supervision graph embedding

A graph embedding, cross-modal technology, applied in digital data information retrieval, instrumentation, computing and other directions, can solve the problems of reduced hash code effectiveness, unsatisfactory retrieval results, quantization errors, etc., to enhance the ability to distinguish, The effect of improving retrieval performance and improving representation ability

Inactive Publication Date: 2022-04-15
LUDONG UNIVERSITY +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the threshold operation will bring quantization error, which usually leads to the reduction of the effectiveness of the hash code; 3) most hashing methods based on graph embeddings fail to fully utilize the class labels of the samples during the training process, resulting in their retrieval results hard to satisfy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal hash retrieval method based on supervision graph embedding
  • Cross-modal hash retrieval method based on supervision graph embedding
  • Cross-modal hash retrieval method based on supervision graph embedding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The specific embodiment of the present invention is described in detail below in conjunction with accompanying drawing:

[0055] Although the present invention specifies two modalities, image and text, the algorithm can be easily extended to other modalities and situations with more than two modalities. For convenience of description, the present invention only considers two modes of image and text.

[0056] Such as figure 1 As shown, a cross-modal hash retrieval method based on supervised graph embedding, which includes the following steps:

[0057] 1) Step S1, crawl image-text sample pairs from the webpage where images and text modals co-occur, construct image-text dataset, and randomly divide the dataset into training set and test set;

[0058] 2) Step S2, extracting the 512-dimensional GIST feature of all images in the training set and the test set and the 1000-dimensional BOW feature of the text;

[0059] 3) Step S3, designing the overall objective function of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-modal hash retrieval method based on supervision graph embedding, and belongs to the technical field of multimedia retrieval. Constructing an image and text modal data set, and dividing the image and text modal data set into a training set and a test set; extracting features of all image and text samples of the training set and the test set, and mapping the extracted features to a nonlinear kernel space by using a radial basis kernel function to improve the characterization capability of the features; generating a pairwise similarity matrix of the samples by utilizing the category labels, and further generating a Laplacian matrix; generating a semantic subspace by utilizing the category label; learning a mapping matrix for image and text modalities by utilizing intra-modal similarity keeping based on an image embedding method and inter-modal similarity keeping based on a semantic subspace; learning an orthogonal rotation matrix to minimize a quantization error; and an efficient iterative discrete optimization algorithm is utilized to reduce the calculation complexity of the training process.

Description

technical field [0001] The invention relates to a cross-modal hash retrieval method based on supervision graph embedding, belonging to the technical field of multimedia retrieval. Background technique [0002] In recent years, graph embedding based hashing methods have attracted much attention due to their effectiveness in cross-modal retrieval. The hash method is to map similar samples in the original space into similar hash codes, and then calculate the Hamming distance between the hash codes by XOR operation, and use the Hamming distance to measure the similarity between samples, which can significantly reduce the Computational complexity and memory overhead. Inspired by this, researchers have proposed many hashing methods for large-scale retrieval tasks in recent years. However, most methods are only applied to unimodal data, i.e. the modality type of the retrieved sample is the same as the modality type of the query sample, e.g. using images to retrieve images. But t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/432
Inventor 姚涛张林梁李朝霞彭守永李艺茹王丽丽张淑宁
Owner LUDONG UNIVERSITY
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More