Unsupervised cross-modal hash retrieval method and system based on virtual label regression

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A virtual label and cross-modal technology, applied in the field of cross-modal retrieval, can solve problems such as large-scale errors, limited semantic information, and limited expression ability of hash codes, so as to reduce quantization errors, improve performance, and ensure semantic consistency Effect

Active Publication Date: 2020-01-10

SHANDONG NORMAL UNIV

View PDF8 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The inventors of the present disclosure found in the research that although a variety of unsupervised cross-modal hash retrieval methods have been proposed, there are still the following problems: (1) most of the existing methods are based on shallow models, simply using linear or non-linear Linear mapping is used for hash learning, resulting in limited expression ability of the learned hash code; (2) Without the guidance of semantic labels, the semantic information contained in the learned hash code is limited, and the lack of semantic information will directly affect the retrieval accuracy ; (3) Most of them use the two-step optimization strategy of "relaxation + quantization" to solve the hash code, and there is a large quantization error in the solution process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0041] Such as Figure 1-2 As shown, Embodiment 1 of the present disclosure provides an unsupervised cross-modal hash retrieval method based on virtual tag regression, including:

[0042] S1: Obtain cross-modal retrieval datasets and divide them into training set, test set and database set, where each sample includes paired image and text data of two modalities.

[0043] The training set has n samples, and each sample includes paired image and text modal data. The image feature matrix of the sample is expressed as d 1 Represents the dimension of image features, and the text feature matrix of samples is expressed as d 2 represents the dimensionality of text features, and the goal is to learn a shared hash code B ∈ [-1,1] n×r , r represents the length of the hash code.

[0044] S2: Build a deep hash model and initialize network parameters.

[0045] A deep hash network consists of two parts. For the image modality, the VGG-16 model is adopted as the basic deep hashing n...

Embodiment 2

[0103] Embodiment 2 of the present disclosure provides an unsupervised deep cross-modal hash retrieval system based on virtual label regression, including:

[0104] The image preprocessing module is configured to: obtain a cross-modal retrieval data set, and divide them into a training set, a test set and a database set, wherein each sample includes paired image and text data;

[0105] The network model building module is configured to: build a deep hash model and initialize network parameters;

[0106] The depth feature matrix and deep network output acquisition module are configured to: respectively input the original data of the two modalities into the constructed deep hash network to obtain a depth feature matrix, and the depth feature matrix continues to be transmitted along the network, Get the output value of the deep hash network;

[0107] The objective function building module is configured to: according to the deep feature matrix of the training set, the virtual lab...

Embodiment 3

[0111] Embodiment 3 of the present disclosure provides a computer-readable storage medium on which a computer program is stored, which is characterized in that, when the program is executed by a processor, the unsupervised depth based on virtual label regression described in Embodiment 1 of the present disclosure is implemented. Cross-modal hash retrieval method.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an unsupervised cross-modal hash retrieval method and system based on virtual label regression, and the method comprises the steps: feature representation and hash function learning are integrated into a unified depth frame, and a shared hash code is learned through the cooperative matrix decomposition of multi-modal depth features, so as to guarantee that a plurality of modes share the same semanteme; on the basis, the concept of a virtual label is introduced, the virtual label is learned through non-negative spectrum analysis, and meanwhile, the learned virtual labelis returned to the hash code, so that the semantic consistency between the hash code and the virtual label is ensured; in the framework, collaborative matrix decomposition of the depth features and learning and regression of the virtual tags are beneficial to learning of depth feature representation and hash functions, and improved depth feature representation and hash models are beneficial to collaborative matrix decomposition and learning and regression of the virtual tags, and the collaborative matrix decomposition and the virtual tags promote each other; and meanwhile, through a new discrete optimization strategy, the deep hash function and the hash code are directly updated, the quantization error of a relaxation strategy in an existing method is effectively reduced, and the cross-modal retrieval performance is improved.

Description

technical field [0001] The present disclosure relates to the technical field of cross-modal retrieval, in particular to an unsupervised cross-modal hash retrieval method and system based on virtual tag regression. Background technique [0002] The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art. [0003] With the rapid development of the mobile Internet, multimodal data on the Internet is showing an explosive growth trend. In the field of information retrieval, the rapid growth of multimodal data has brought about a huge demand for cross-modal retrieval applications. Cross-modal retrieval is to model the relationship between different modalities to achieve retrieval between modalities. The modalities of the query data and the data to be retrieved do not have to be the same, such as retrieving images by text and retrieving text by images. Exploring new cross-modal retrieval modes ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F16/51G06F16/53G06F16/583G06K9/46G06N3/04G06N3/08

CPCG06F16/51G06F16/53G06F16/583G06F16/5846G06N3/084G06V10/464G06N3/045

Inventor 朱磊王菲王彤

Owner SHANDONG NORMAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Unsupervised cross-modal hash retrieval method and system based on virtual label regression

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology