Cross-modal correlation learning method based on multi-granularity hierarchical network

A learning method and cross-modal technology, applied in neural learning methods, biological neural network models, multimedia data retrieval, etc., can solve the problems of ignoring supplementary effects, ignoring rich fine-grained information, and unable to fully balance the associated learning process, etc. The effect of improving accuracy

Active Publication Date: 2017-11-14
PEKING UNIV
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are three limitations in the existing methods. One is that in the first stage, the existing methods only model the association relationship within the modality, while ignoring the supplementary role of the inter-modal association for the separation of feature representation learning; the second is that in the In the second stage, the existing methods only use a single loss function to constrain, and cannot fully balance the association learning process between the modalities and the modalities; in addition, the existing methods only consider the original data of different modalities, while ignoring other modalities. The rich and fine-grained information provided by various internal parts cannot fully mine the cross-modal relationship

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal correlation learning method based on multi-granularity hierarchical network
  • Cross-modal correlation learning method based on multi-granularity hierarchical network
  • Cross-modal correlation learning method based on multi-granularity hierarchical network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0023] A cross-modal association learning method based on a multi-granularity hierarchical network of the present invention, its process is as follows figure 1 shown, including the following steps:

[0024] (1) Establish a cross-modal database containing multiple modal types, and divide the database into a training set, a verification set, and a test set, process the data of different modalities in the database, and extract all modal original The data and the eigenvectors of the partitioned data.

[0025] In this embodiment, the cross-modal database may include multiple modality types, and different block processing methods are used for different modality data to divide the original data into multiple parts. Taking images and texts as examples, the Selective Search algorithm is used to extract multiple candidate areas containing ric...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a cross-modal correlation learning method based on multi-granularity hierarchical network. The method comprises the following steps: 1, building a cross-modal database containing multiple modal types, dividing the data in the database into a training set, a validation set and a testing set, processing the different modal data in the database in a partitioning way and extracting all modal original modal data and feature vector of data after partitioning; 2, training a multi-granularity hierarchical structure by utilizing the original data and partitioned data and learning unified representation for different modal data; 3, obtaining the unified representation of different modal data by utilizing the trained multi-granularity hierarchical structure, and calculating the similarity of different modal data; 4, taking any modal type in the testing set as a query modal, taking another modal type as a target modal, calculating the similarity between a query sample and a query target and obtaining the related result list of the target modal data according to the similarity. According to the method of the invention, the accuracy rate of cross-modal searching can be improved.

Description

technical field [0001] The invention relates to the field of multimedia retrieval, in particular to a cross-modal association learning method based on a multi-granularity hierarchical network. Background technique [0002] In recent years, with the rapid development of computer technology, the acquisition and processing of information has changed from a single mode of text, image, audio, video, etc. to a form of integration of multiple modes. Multimodal retrieval has become an important issue in the field of information retrieval, with wide applications in search engines and big data management. The traditional retrieval method is mainly in the form of a single modality, that is, the user submits a modality type of data as a query, and the retrieval system returns the retrieval results of the same modality, such as image retrieval, text retrieval, etc. This retrieval method cannot directly measure the similarity between data of different modalities, such as the similarity b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/04G06N3/08
CPCG06F16/43G06N3/08G06N3/045
Inventor 彭宇新綦金玮
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products