Discrete supervision cross-modal hash retrieval method based on semantic preservation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A cross-modal, hashing technology, applied in the field of discrete-supervised cross-modal hash retrieval based on semantic preservation, can solve problems such as high computational complexity and memory overhead, difficult to solve, and loss of scalability.

Pending Publication Date: 2020-11-10

LUDONG UNIVERSITY

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

On the one hand, the similarity matrix between two samples is , in the era of big data The value of is very large, so this matrix usually leads to very high computational complexity and memory overhead, making it lose its scalability

On the other hand, using the Hamming distance between hash codes to approximate the similarity matrix between two samples is a decomposition problem of a symmetric matrix, which is difficult to solve

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0140] This embodiment takes the public dataset Mirflickr25K as an example, which contains 25,000 image and text sample pairs, and all sample pairs are distributed in 24 categories. First remove the labels that appear less than 20 times, and then remove the sample pairs that do not contain any labels and categories. There are 20015 sample pairs left in the data set. The image and text samples in the data set were extracted with CaffeNet and BOW (Bag Of Words) algorithms, respectively, with 4096-dimensional CNN features and 500-dimensional BOW features, and the features were averaged. 75% (15011) of the sample pairs are randomly selected to constitute the training set, while the remaining 25% (5004) of the sample pairs constitute the test set. In order to objectively evaluate the retrieval performance of the method of the present invention, the average accuracy rate MPA@r is used as the evaluation standard, where r represents the number of returned samples, and r=100 in the exp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a discrete supervision cross-modal hash retrieval method based on semantic preservation, which realizes semantic-based image and text cross-modal retrieval, and comprises the following steps of: firstly, collecting an image and text sample pair from a network, and respectively extracting features of an image and a text sample; dividing the data set into a training set and atest set; learning a shared Hamming space for the image and text modalities by keeping semantic consistency of samples in the training set, and learning hash functions for the image and text modalities respectively; taking the training set and the test set as a target set and a query set respectively, and calculating Hamming distances between samples in the query set and heterogeneous samples inthe target set; and sorting from small to large according to the Hamming distance, and returning the sample sorted in the front as a retrieval result. According to the method, images and text samplescan be mapped to a shared Hamming space for efficient retrieval, the training speed is high, the accuracy is high, and the method has a good application prospect.

Description

technical field [0001] The invention relates to the fields of multimedia retrieval and pattern recognition, in particular to a discrete-supervised cross-modal hash retrieval method based on semantic preservation. Background technique [0002] With the rapid development of information technology and personal social networks, the data on the network not only increases rapidly, but also presents a variety of data representations, such as images, texts, voices, etc. This brings two challenges to multimedia retrieval: 1) how to achieve real-time retrieval in massive data; 2) retrieval across modalities. On the one hand, the amount of data on the network is very large, and the representation dimension of samples is also very high, even reaching tens of thousands of dimensions, so the traditional nearest neighbor retrieval method is difficult to apply on large-scale data sets. Hash method is one of the most effective methods to solve large-scale applications. Its idea is to map hi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F16/532G06F16/583G06K9/62

CPCG06F16/532G06F16/5846G06F18/22

Inventor 姚涛于泓刘莉王增峰苏庆堂崔光海

Owner LUDONG UNIVERSITY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Discrete supervision cross-modal hash retrieval method based on semantic preservation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology