Online cross-modal retrieval method and system using three-step strategy

A retrieval system and cross-modal technology, applied in digital data information retrieval, special data processing applications, instruments, etc., can solve problems such as inability to effectively update hash functions and ignore global information

Active Publication Date: 2021-08-31
SHANDONG UNIV
View PDF7 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most existing cross-modal online methods only update hash functions based on newly arrived data or the correlation between new data and existing data, ignoring the global information
[0006] 2) How to enhance the adaptability of the model to adapt to the variable-length label space is a problem that needs to be solved. So far, most existing online methods implicitly assume that the label space is fixed, that is, all class labels are Should be given in the first data block
In practice, this assumption may be too restrictive, and these methods may not be effective in updating the hash function when new labels appear in newly arrived data blocks.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Online cross-modal retrieval method and system using three-step strategy
  • Online cross-modal retrieval method and system using three-step strategy
  • Online cross-modal retrieval method and system using three-step strategy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] This embodiment discloses an online cross-modal retrieval method using a three-step strategy, which is a three-step online cross-modal hashing method (three-step online cross-modal hashing), referred to as THOR. THOR generates a representation of each class label by introducing a hadamard matrix (hadamard matrix), and uses it as global information to guide the learning of hash codes. It also maintains local similarity information, that is, the relationship between newly arrived data and existing data to learn more discriminative hash codes. Furthermore, based on learnable class label embeddings, THOR can be freely adapted to incremental label space problems.

[0056] In order to adapt to the online retrieval task, the training set is divided into the form of T rounds of data, which is used to simulate the arrival of streaming data.

[0057] Specifically, THOR is a three-step online cross-modal hashing method, which consists of three steps:

[0058] Step (1): By introd...

Embodiment 2

[0150] The purpose of this embodiment is to provide a cross-modal retrieval system based on online hashing, including:

[0151] The simulated stream data acquisition module is configured to: acquire simulated stream data composed of different modalities;

[0152] The hash code learning module is configured to: for the simulated stream data, generate a representation of each class label by introducing a hadamard matrix, and use the representation of each class label as global information for learning the hash code, and at the same time, The representation of each class label also maintains local similarity information, exploiting the correlation between newly arrived data and existing data in simulated streaming data to learn more discriminative hash codes;

[0153] Among them, the steps to generate the representation of each class label by introducing the hadamard matrix are:

[0154] Learning embedding representations of labels that simulate the first round of streaming data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an online cross-modal retrieval method and system using a three-step strategy. The online cross-modal retrieval method comprises the following steps: acquiring analog stream data formed by different modals; for analog stream data, generating the representation of each class label by introducing a hadamard matrix, the representation of each class label is used as global information for learning a hash code, and the representation of each class label also keeps local similarity information, using the correlation between newly arrived data in the analog stream data and existing data to learn a Hash code with more discriminative power; updating a hash function by using the learned hash code; and calculating a hash code of the to-be-retrieved sample by using the updated hash function, and calculating a Hamming distance of a binary sample based on the hash code, thereby returning a sample of another modal similar to the to-be-retrieved sample according to the Hamming distance. According to the method, the THOR can reserve more semantic information and learn more accurate hash codes.

Description

technical field [0001] The invention belongs to the technical field of cross-modal hash retrieval, and in particular relates to an online cross-modal retrieval method and system using a three-step strategy. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] With the explosive growth of data composed of many different modalities, it has become a major challenge to find similar information across modalities from massive data under the condition of a given query statement. In many cases, traditional nearest neighbor search methods do not lead to optimal performance due to high complexity in terms of time and storage space. In recent years, Approximate Nearest Neighbor Search (ANN), especially hash learning, has attracted extensive attention and gradually replaced traditional nearest neighbor search methods. Cross-modal hashing methods aim to m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/242G06F40/30
CPCG06F16/2433G06F40/30
Inventor 罗昕詹雨薇刘家乐许信顺
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products