Method for training cross-modal retrieval model, cross-modal retrieval method and related device

A cross-modal and model technology, applied in the field of training cross-modal retrieval models, can solve the problems of labor-consuming and time-consuming, cross-modal retrieval model limitations, and high cross-modal retrieval model accuracy, and achieve high accuracy. Effect

Pending Publication Date: 2020-06-30
HUAWEI CLOUD COMPUTING TECH CO LTD
View PDF17 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, labeling large-scale data requires a lot of manpower and time
Therefore, training cross-modal retrieval models with supervised methods is limited in practical applications.
[0005] Although unsupervised methods do not require labeling when training cross-modal retrieval models, the accuracy of cross-modal retrieval models trained by unsupervised methods is usually not as high as that of cross-modal retrieval models trained by supervised methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for training cross-modal retrieval model, cross-modal retrieval method and related device
  • Method for training cross-modal retrieval model, cross-modal retrieval method and related device
  • Method for training cross-modal retrieval model, cross-modal retrieval method and related device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The technical solution in this application will be described below with reference to the accompanying drawings.

[0049] In addition, in the embodiments of the present application, words such as "exemplary" and "for example" are used as examples, illustrations or explanations. Any embodiment or design described herein as "example" is not to be construed as preferred or advantageous over other embodiments or designs. Rather, the use of the word example is intended to present concepts in a concrete manner.

[0050] In the embodiments of the present application, "corresponding (corresponding, relevant)" and "corresponding (corresponding)" may sometimes be used interchangeably. It should be noted that when the difference is not emphasized, the meanings they intend to express are consistent.

[0051] In the embodiment of this application, sometimes subscripts such as W 1 There may be a typo in a non-subscripted form such as W1. When the difference is not emphasized, the me...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for training a cross-modal retrieval model, a cross-modal retrieval method and related devices, relating to the field of artificial intelligence. The method comprises the following steps: determining a reference model by utilizing unsupervised learning; performing knowledge distillation based on the reference model and the training data to obtain similar data of thetraining data; carrying out supervised learning by utilizing the similar data of the training data and the training data, so that a cross-modal retrieval model is obtained. According to the method and the device, relatively high accuracy of the trained cross-modal retrieval model can be ensured under the condition that the label of the training data used for supervised learning does not need to be manually labeled.

Description

technical field [0001] The present application relates to the field of artificial intelligence, and more specifically, to a method for training a cross-modal retrieval model, a method for cross-modal retrieval, and related devices. Background technique [0002] Early retrieval was based on text-to-text retrieval. For example, search engines (such as Google, Bing, etc.) are used to retrieve webpages through keywords. However, with the rapid growth of multimedia services, this kind of retrieval based on text-to-text in the same mode can no longer meet the demand. Cross-modal retrieval has gained increasing attention in the industry. Every kind of multimedia data such as text, pictures, and videos can be regarded as a modal. Using cross-modal retrieval can realize functions such as searching pictures by text, searching text by using pictures, or searching videos by text. [0003] At present, common methods for training cross-modal retrieval models can be divided into superv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/903G06F16/907G06K9/62G06N3/04G06N3/08
CPCG06F16/90335G06F16/907G06N3/084G06N3/088G06N3/045G06F18/22
Inventor 杜泽伟胡恒通谢凌曦田奇
Owner HUAWEI CLOUD COMPUTING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products