Multi-modal classification method based on graph convolutional neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of convolutional neural network and classification method, applied in the field of artificial intelligence, can solve the problems that cannot be directly applied to multi-modal scenes, easy to ignore structural information, etc.

Active Publication Date: 2020-11-24

NANJING UNIV +1

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

On the other hand, the graph convolutional neural network can embed graph structure information into the neural network, and is suitable for processing large-scale data, but it cannot be directly applied to multi-modal scenarios. Objects in practical applications often have multi-modal information, but the traditional multi-modal method only trains the learner separately on multiple modalities and then integrates them. This way is easy to ignore the useful structural information in different modalities. Therefore, we propose a graph-based convolutional neural network The multimodal classification method of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0014] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0015] Step 1, create a The object library of information is used as the training object library, and a small number of objects in the object library are given a category mark by manual labeling. On behalf of class label for an object. For the two-category problem, for example, the military news webpage is the first category, and the entertainment news webpage is the second category. If the first The content contained in an object is military news, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-modal classification method based on a graph convolutional neural network. The method comprises the following steps: 1, ; the method comprises the following steps: (1)firstly, a user needs to prepare an object library, each object comprises a mode, a category mark is provided for a small number of objects in the library through a manual marking method, the objectswith the category marks are called initial marked training data, and the objects with the category marks and a large number of remaining unmarked objects form a training data set together; according to the method, graph structure information of different modes is comprehensively considered through the innovative multi-mode graph convolutional neural network, and trainable weights are distributed in each layer of the multi-mode graph convolutional neural network, so that representation learned by each mode can gradually consider structure information of other modes.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence in computer science and technology, and specifically relates to a multimodal classification method based on graph convolutional neural network. Background technique [0002] In recent years, more and more multi-modal data have appeared in practical applications. For example, multimedia data in the Internet often contains multiple modal information: video, image, and text information appearing around; Status information: the text information of the web page itself and the hyperlink information linked to the web page. These multimodal data contain huge economic value, and using these multimodal data can often obtain better results than single modality data. For example, in user content recommendation based on information flow, different modal information (such as pictures and texts) in the information flow can be considered simultaneously to recommend content of interest to users. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06N3/045G06F18/2411G06F18/214

Inventor 王魏李述

Owner NANJING UNIV

Multi-modal classification method based on graph convolutional neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology