Supercharge Your Innovation With Domain-Expert AI Agents!

Rapid cross-modal retrieval method and system for incremental data carrying new categories

An incremental data and cross-modal technology, applied in the field of deep learning and cross-modal retrieval, can solve problems such as violation of fast and accurate retrieval, waste of computing resources and training time, and inability to deal with cross-modal retrieval problems, achieving high efficiency , the effect of improving performance

Active Publication Date: 2021-08-31
SHANDONG UNIV
View PDF8 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the existing deep cross-modal hashing methods cannot solve the problem of incremental learning well
In other words, most deep cross-modal hashing methods cannot adapt to the incremental data of emerging unknown category labels in time, and need to be fed to the network at the same time as the original data to retrain the model
This mode has the following disadvantages: First, retraining all data means completely ignoring the hash code of the original data obtained through previous training, wasting computing resources and training time; in addition, as more and more unknown categories The generation of incremental data of tags, feeding the original data and incremental data to the network at the same time will increase the computational complexity of the model, which violates the demand for fast and accurate retrieval of large-scale multimedia data; and the current incremental hash learning Method cannot handle cross-modal retrieval issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rapid cross-modal retrieval method and system for incremental data carrying new categories
  • Rapid cross-modal retrieval method and system for incremental data carrying new categories
  • Rapid cross-modal retrieval method and system for incremental data carrying new categories

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] This embodiment discloses a fast cross-modal retrieval method for incremental data carrying new categories, which mainly includes two aspects:

[0044] 1) How to extract the information of different category labels from the hash code of the existing data to model the incremental category label space while keeping the original hash code unchanged, and then use the representation of the unknown category label to supervise the generation Hash code of incremental data, so as to avoid repeated training and improve model efficiency.

[0045] 2) How to further shorten the model training time while ensuring the quality of the hash code.

[0046] The overall idea is: first extract the binary representation of the known category label from the known hash code, and then obtain the binary representation of the unknown category label according to the similarity relationship between the existing category label and the unknown category label to supervise the increment The generation of...

Embodiment 2

[0132] The purpose of this embodiment is to provide a computing device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor implements the steps of the above method when executing the program.

Embodiment 3

[0134] The purpose of this embodiment is to provide a fast cross-modal retrieval method for incremental data carrying new categories, including:

[0135] The incremental hash learning module is configured to: extract the binary representation of the known category label from the known hash code stored in the multimedia known category database, and then according to the similarity relationship between the existing category label and the unknown category label, Obtain the binary representation of the unknown category label, which is used to supervise the generation of the hash code of the incremental data in the incremental category database;

[0136] The hash function learning module is configured to: in the learning process of the hash function, obtain the anchor point set by sampling from the known category database and the incremental category database, and use an asymmetric strategy to update the depth network based on the anchor point set Parameters, learn the hash functio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a rapid cross-modal retrieval method and system for incremental data carrying new categories, and the method comprises an incremental hash learning step: extracting a binary representation of a known category label from a known hash code stored in a multimedia known category database, and then according to a similarity relationship between the known category label and an unknown category label, obtaining a binary representation of an unknown category label to supervise the generation of a hash code of incremental data in an incremental category database; and a Hash function learning step: in a Hash function learning process, obtaining an anchor point set from the known category database and the incremental category database through sampling, updating parameters of the deep network by adopting an asymmetric strategy based on the anchor point set, and carrying out Hash function learning to obtain a required model. The method can directly learn the hash code of the unknown incremental data under the condition of keeping the hash code of the original data unchanged, so the mode requirement of rapid training can be met.

Description

technical field [0001] The invention belongs to the technical fields of cross-modal retrieval and deep learning, and in particular relates to a fast cross-modal retrieval method and system for incremental data carrying new categories. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] With the development of the Internet, multimedia data such as images, texts, videos, and audios generated by various smart terminal devices and websites show an explosive growth trend. Faced with these rich and massive multimedia data, how to quickly and accurately retrieve multi-modal data in the huge database according to user needs is a hot spot in the research of multimedia information retrieval. Therefore, cross-modal retrieval emerges as the times require, and its main purpose is to use query data of one modality to retrieve similar data of another modali...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2453G06N3/04
CPCG06F16/2453G06N3/045
Inventor 罗昕孙钰詹雨薇许信顺
Owner SHANDONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More