Supercharge Your Innovation With Domain-Expert AI Agents!

Fast cross-modal retrieval method and system for incremental data carrying new categories

An incremental data and cross-modal technology, applied in the field of deep learning and cross-modal retrieval, can solve the problems of violation of fast and accurate retrieval, waste of computing resources and training time, and inability to adapt to incremental data of unknown category labels in time , to achieve the effect of improving performance and high efficiency

Active Publication Date: 2021-11-02
SHANDONG UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the existing deep cross-modal hashing methods cannot solve the problem of incremental learning well
In other words, most deep cross-modal hashing methods cannot adapt to the incremental data of emerging unknown category labels in time, and need to be fed to the network at the same time as the original data to retrain the model
This mode has the following disadvantages: First, retraining all data means completely ignoring the hash code of the original data obtained through previous training, wasting computing resources and training time; in addition, as more and more unknown categories The generation of incremental data of tags, feeding the original data and incremental data to the network at the same time will increase the computational complexity of the model, which violates the demand for fast and accurate retrieval of large-scale multimedia data; and the current incremental hash learning Method cannot handle cross-modal retrieval issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fast cross-modal retrieval method and system for incremental data carrying new categories
  • Fast cross-modal retrieval method and system for incremental data carrying new categories
  • Fast cross-modal retrieval method and system for incremental data carrying new categories

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] This embodiment discloses a fast cross-modal retrieval method for incremental data carrying new categories, which mainly includes two aspects:

[0044] 1) How to extract the information of different category labels from the hash code of the existing data to model the incremental category label space while keeping the original hash code unchanged, and then use the representation of the unknown category label to supervise the generation Hash code of incremental data, so as to avoid repeated training and improve model efficiency.

[0045] 2) How to further shorten the model training time while ensuring the quality of the hash code.

[0046] The overall idea is: first extract the binary representation of the known category label from the known hash code, and then obtain the binary representation of the unknown category label according to the similarity relationship between the existing category label and the unknown category label to supervise the increment The generation of...

Embodiment 2

[0132] The purpose of this embodiment is to provide a computing device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor implements the steps of the above method when executing the program.

Embodiment 3

[0134] The purpose of this embodiment is to provide a fast cross-modal retrieval method for incremental data carrying new categories, including:

[0135] The incremental hash learning module is configured to: extract the binary representation of the known category label from the known hash code stored in the multimedia known category database, and then according to the similarity relationship between the existing category label and the unknown category label, Obtain the binary representation of the unknown category label, which is used to supervise the generation of the hash code of the incremental data in the incremental category database;

[0136] The hash function learning module is configured to: in the learning process of the hash function, obtain the anchor point set by sampling from the known category database and the incremental category database, and use an asymmetric strategy to update the depth network based on the anchor point set Parameters, learn the hash functio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention proposes a fast cross-modal retrieval method and system for incremental data carrying new categories, including: an incremental hash learning step: extracting known category labels from known hash codes stored in a multimedia known category database Then, according to the similarity relationship between the existing category label and the unknown category label, the binary representation of the unknown category label is obtained, which is used to supervise the generation of the hash code of the incremental data in the incremental category database; the hash function Learning steps: In the learning process of the hash function, the anchor point set is obtained by sampling from the known category database and the incremental category database, based on the anchor point set, an asymmetric strategy is used to update the parameters of the deep network, and the hash function is Learn to get the desired model. The hash code of unknown incremental data can be directly learned while keeping the hash code of the original data unchanged, so as to meet the mode requirements of fast training.

Description

technical field [0001] The invention belongs to the technical fields of cross-modal retrieval and deep learning, and in particular relates to a fast cross-modal retrieval method and system for incremental data carrying new categories. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] With the development of the Internet, multimedia data such as images, texts, videos, and audios generated by various smart terminal devices and websites show an explosive growth trend. Faced with these rich and massive multimedia data, how to quickly and accurately retrieve multi-modal data in the huge database according to user needs is a hot spot in the research of multimedia information retrieval. Therefore, cross-modal retrieval emerges as the times require, and its main purpose is to use query data of one modality to retrieve similar data of another modali...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2453G06N3/04
CPCG06F16/2453G06N3/045
Inventor 罗昕孙钰詹雨薇许信顺
Owner SHANDONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More