Cross-modal generalized zero sample retrieval method based on dual learning generative adversarial network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A cross-modal, network model technology, applied in the field of cross-modal retrieval, can solve the problems of information loss, semantic information loss, inability to encode, etc., and achieve the effect of reducing cumbersomeness and expensive cost and improving retrieval effect.

Active Publication Date: 2020-08-25

UNIV OF ELECTRONICS SCI & TECH OF CHINA

View PDF4 Cites 30 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] 1. There are "semantic gaps" and intra-class differences between different modal data, which lead to inconsistencies between data distribution and feature representation of different media types, so it is difficult to directly measure the similarity between multiple media data sex;

[0005] 2. The zero-sample retrieval problem involves transferring the knowledge learned from known classes to unknown classes. There are information loss and over-fitting problems in the learning and transfer process, so generalized zero-sample retrieval is often better than traditional zero-sample retrieval. Retrieval is more difficult

[0007] The main problem of existing methods is that there is a loss of semantic information when mapping from high-dimensional visual features to low-dimensional semantics, and the adversarial generative network is often unstable during training; The feature and semantic information are encoded definitely, so that the model will tend to the known classes that have appeared in the training process during the retrieval process.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and advantages of the present invention more clear, the specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings, so that those skilled in the art can better understand the present invention. It should be noted that the described embodiments are some, not all, embodiments of the present invention, and are not intended to limit the scope of the claimed invention. All other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0037] Such as figure 1 As shown, in the present invention, the cross-modal generalization zero-shot retrieval method based on generative confrontation network includes the following steps:

[0038] Step 1: Select the training dataset. In this example, two large-scale datasets Sketchy Ext. and TU-Berlin Ext. were selected...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a cross-modal generalized zero sample retrieval method based on a dual learning generative adversarial network. The method comprises: constructing a generative adversarial network based on dual learning; mapping the high-dimensional visual features of different modes to a common low-dimensional semantic embedding space; secondly, constructing multiple constraint mechanisms to perform cyclic consistency constraint, generative adversarial constraint and classifier constraint so as to maintain visual-semantic consistency and generated feature-source feature consistency, andperforming cross-modal retrieval after training of the whole network, so that the model is more powerful in performance in generalization of zero-sample retrieval. Meanwhile, in the whole training process, paired multimedia data pairs on the pixel level do not need to serve as training samples, only paired data on the category are needed, so that the complexity and expensive cost of data set collection are reduced, the retrieval effect is better, and performance improvement is more obvious in the zero-sample generalization retrieval problem.

Description

technical field [0001] The invention belongs to the technical field of cross-modal retrieval in computer vision, and in particular relates to a retrieval method for multimedia data, that is, a cross-modal generalization zero-sample retrieval method based on dual learning generative adversarial networks. Background technique [0002] Cross-media retrieval means that users can retrieve semantically related data in all media types by inputting query data of any media type. With the increasing amount of multimedia data such as text, images, and videos on the Internet, retrieval across different modalities has become a new trend in information retrieval. The goal of traditional cross-modal zero-shot retrieval is to perform cross-modal retrieval on unseen new category data; the test set of cross-modal generalization zero-shot retrieval includes unknown classes and some known classes, but its class labels Unknown to the model, it is more difficult to retrieve. [0003] Currently,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F16/48G06F16/45G06K9/62G06N3/04G06N3/08

CPCG06F16/48G06F16/45G06N3/08G06N3/045G06F18/214G06F18/24

Inventor 徐行朱佳文沈复民汪政杨阳申恒涛

Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cross-modal generalized zero sample retrieval method based on dual learning generative adversarial network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology