Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-modal classification method based on graph convolutional neural network

A technology of convolutional neural network and classification method, applied in the field of artificial intelligence, can solve the problems that cannot be directly applied to multi-modal scenes, easy to ignore structural information, etc.

Active Publication Date: 2020-11-24
NANJING UNIV +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the other hand, the graph convolutional neural network can embed graph structure information into the neural network, and is suitable for processing large-scale data, but it cannot be directly applied to multi-modal scenarios. Objects in practical applications often have multi-modal information, but the traditional multi-modal method only trains the learner separately on multiple modalities and then integrates them. This way is easy to ignore the useful structural information in different modalities. Therefore, we propose a graph-based convolutional neural network The multimodal classification method of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal classification method based on graph convolutional neural network
  • Multi-modal classification method based on graph convolutional neural network
  • Multi-modal classification method based on graph convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0015] Step 1, create a The object library of information is used as the training object library, and a small number of objects in the object library are given a category mark by manual labeling. On behalf of class label for an object. For the two-category problem, for example, the military news webpage is the first category, and the entertainment news webpage is the second category. If the first The content contained in an object is military news, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-modal classification method based on a graph convolutional neural network. The method comprises the following steps: 1, ; the method comprises the following steps: (1)firstly, a user needs to prepare an object library, each object comprises a mode, a category mark is provided for a small number of objects in the library through a manual marking method, the objectswith the category marks are called initial marked training data, and the objects with the category marks and a large number of remaining unmarked objects form a training data set together; according to the method, graph structure information of different modes is comprehensively considered through the innovative multi-mode graph convolutional neural network, and trainable weights are distributed in each layer of the multi-mode graph convolutional neural network, so that representation learned by each mode can gradually consider structure information of other modes.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence in computer science and technology, and specifically relates to a multimodal classification method based on graph convolutional neural network. Background technique [0002] In recent years, more and more multi-modal data have appeared in practical applications. For example, multimedia data in the Internet often contains multiple modal information: video, image, and text information appearing around; Status information: the text information of the web page itself and the hyperlink information linked to the web page. These multimodal data contain huge economic value, and using these multimodal data can often obtain better results than single modality data. For example, in user content recommendation based on information flow, different modal information (such as pictures and texts) in the information flow can be considered simultaneously to recommend content of interest to users. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/2411G06F18/214
Inventor 王魏李述
Owner NANJING UNIV