Adjacency matrix-based graph feature extraction system, graph classification system and methods

An adjacency matrix and feature extraction technology, applied in the field of artificial intelligence, can solve the problems of inability to feature representation, process sensitivity, low classification accuracy, etc., to solve the limitation of computational complexity and window size, and improve accuracy and speed. , the effect of reducing computational complexity and amount of computation

Pending Publication Date: 2018-05-22
ZHEJIANG UNIV
View PDF0 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it still has some disadvantages
First of all, the selection of w central vertices will limit the number of subgraphs, so it is impossible to guarantee that all subgraph structures can be extracted; secondly, the PSCN method is still limited by the window size, and the selection of the neighborhood is from a window size smaller than 10 k decision, because a larger window size k will result in unacceptable time-consuming and memory usage; again, PSCN is not effective for deep learning when using a smaller window size k, because when the input graph has more than the default window size It loses the complex subgraph features when densely connected features of , and PSCN's classification results are sensitive to the labeling process, which is to sort the vertices in the domain; so their labeling method is suitable for a data set , but may fail on another dataset
[0012] To sum up, there are two main problems in the classification of graphs in the prior art methods: one is that when analyzing a graph (graph) as a whole object, it is impossible to select features that can contain both display topological information and deep hidden information. Represent graphs; secondly, when subgraphs are used as the features of graphs, the size of subgraphs is subject to the selection of window size (windowsize) k, which makes it difficult to capture large and complex subgraphs, making the classification of graphs Not very accurate
However, the methods in the prior art cannot effectively extract the larger subgraph structure in the graph, and thus cannot perform a good feature representation on the graph.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adjacency matrix-based graph feature extraction system, graph classification system and methods
  • Adjacency matrix-based graph feature extraction system, graph classification system and methods
  • Adjacency matrix-based graph feature extraction system, graph classification system and methods

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0205] Taking a 6-vertex graph as an example, the graph feature extraction system based on the adjacency matrix in the computer environment of the present invention is described in detail. For this 6-vertex graph, each of its vertices is represented by a, b, c, d, e, f. In alphabetical order, the six sides are (a, b), (a, c), (b, e), (b, f), (e, f) and (e, d), its graph structure and its first adjacency matrix sorted according to this vertex are as follows Figure 9 shown.

[0206] The connection information regularization module reorders all vertices in the first adjacency matrix to obtain a second adjacency matrix, and the connection information elements in the second adjacency matrix are concentratedly distributed in the width of the second adjacency matrix is ​​n Diagonal area of ​​, wherein n is a positive integer, n>=2 and n is much smaller than |V|, where |V| is the number of rows (or columns) of the second adjacency matrix. The diagonal area with a width of n of the ...

Embodiment 2

[0226] This embodiment describes in detail the specific implementation of the adjacency matrix-based graph classification system in the computer environment of the present invention, and verifies the effect of this implementation by using a public dataset.

[0227] For datasets with irregularly sized graphs, it is necessary to find an appropriate window size n for it. When n is set too small, it may cause most graphs to lose connection information elements after passing through the connection information regularization module. In addition, too small n may lead to possible overfitting of the feature generation module, since fewer possible subgraph structural features are captured. First, we unify the size of the adjacency matrix of all graphs, and select the graph with the largest number of vertices |V| max The size (number of rows or columns) of the adjacency matrix as uniform. For vertices less than |V| max , such as a graph with 3 vertices, we use zero padding (append 0) ...

Embodiment 3

[0270] This embodiment mainly illustrates the important characteristic of the graph classification system based on the adjacency matrix proposed by the present invention: it can use a smaller window to capture a large multi-vertex subgraph structure.

[0271] Taking a graph consisting of ten vertices (|V|=10) as an example, Figure 28 The physical implications of using a feature generation module on this diagram are shown. It can be seen that the graph has two rings of size six vertices, and two vertices are shared by these two ring structures. To capture such ring-based graph patterns, existing methods usually require window sizes larger than 10. However, the method of the present invention is effective even when using only a window of size 6. consider Figure 28 In the figure on the upper left, we use the connection information regularization module to concentrate the connection information elements into the diagonal area of ​​n=6, and reorder the vertices. The upper righ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides an adjacency matrix-based graph feature extraction system, an adjacency matrix-based graph feature extraction picture classification system and corresponding methods. According to the systems and methods, connection information elements in an adjacency matrix corresponding to a graph are concentrated at a specific diagonal region of the adjacency matrix, and non-connection information elements are reduced in advance; a filtering matrix is used to extract the subgraph structure of the graph along a diagonal direction; a stack convolutional neural network is used to extract a larger subgraph structure; and therefore, computational complexity and computational quantity are greatly reduced, the restrictions of computational complexity and the restrictions of window size can be eliminated; and a large multi-vertex subgraph structure and the deep features of implicit correlation structures from vertices and edges can be captured through a small window, and theaccuracy and speed of graph classification can be improved.

Description

technical field [0001] The invention belongs to the field of artificial intelligence, and in particular relates to a graph feature extraction system based on an adjacency matrix, a graph classification system and a method. Background technique [0002] A graph in graph theory is a graph composed of a number of given points and a line connecting two points. This graph is usually used to describe a certain relationship between certain things, using points to represent things, and using A line connecting two points indicates that there is a certain relationship between the corresponding two things. A graph (Graph) G in graph theory is an ordered binary (V, E), where V is called a vertex set (vertex set), that is, a set composed of all vertices in the graph, and E is called an edge set (edge ​​set). ), which is the set of edges between all vertices. Simply put, vertices represent things, and edges represent relationships between things. Graph is a kind of non-grid data. The c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/46G06K9/62G06V10/426G06V10/764
CPCG06V10/44G06F18/2411G06F17/10G06F16/9024G06N3/084G06N3/105G06N5/022G06V10/426G06V10/764G06N5/01G06N3/048G06N3/045G06F16/9038G06F17/16G06N3/04G06N3/08G06F18/2163G06F18/24147
Inventor 尹建伟罗智凌吴朝晖邓水光李莹吴健
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products