Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Filtering-based graph clustering method

A graph clustering and clustering technology, applied in the field of attribute graph clustering, can solve problems such as the influence of clustering results, the lack of consideration of graph structure information, and the existence of noise.

Inactive Publication Date: 2021-01-26
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the original subspace clustering method does not fully consider the characteristics of the data.
Generally speaking, there is noise in the data, which will have a great impact on the clustering results.
At the same time, the original subspace clustering method only mines the characteristic information of the graph, does not consider the structural information of the graph, and does not fully mine the known information of the graph

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Filtering-based graph clustering method
  • Filtering-based graph clustering method
  • Filtering-based graph clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to facilitate those skilled in the art to understand the technical content of the present invention and understand the key role of graph filtering in clustering algorithms, the content of the present invention will be further explained below in conjunction with the accompanying drawings.

[0035] The present invention adopts five data sets, respectively citing network Cora, Citeseer, Pubmed, LargeCora, webpage network Wiki.

[0036] First, preprocess the data sets of the present invention, and determine some parameter settings of each data set. The parameter settings of the present invention are mainly the order k of graph filtering, the model parameter α, and the order P of the adjacency matrix. The sample points of Cora, the number of data categories, and the data dimensions are: 2708, 7, and 1433; the sample points of Citeseer, the number of data categories, and the data dimensions are: 3327, 6, and 3703; the sample points of Pubmed, the number of data catego...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a filtering-based graph clustering method, which comprises the following steps of: firstly, carrying out filtering operation on original data characteristics by using a low-passfilter, and then, improving an optimization objective function based on a subspace clustering model, so that the optimization objective function excavates structural information and characteristic information of a graph at the same time, and calculates an affinity matrix; and finally, enabling the affinity matrix to be symmetrical, and obtaining a graph clustering result through spectral clustering. The method is very universal, a concise and efficient graph clustering algorithm model is provided based on the filtering technology, and compared with a method based on deep learning, the calculation process of a large number of parameters is avoided. Compared with the existing method, the method disclosed by the invention shows huge advantages on a plurality of widely applied data sets.

Description

technical field [0001] The invention belongs to the field of attribute graph clustering, in particular to a filter-based graph clustering method. Background technique [0002] Graph clustering is a long-standing problem in machine learning, data mining, and pattern recognition, which has countless application scenarios, such as community analysis, protein structure analysis, etc. The input data of graph clustering is generally an attribute graph, and the output result is the category of each node. The attribute graph is composed of each node's own characteristics and the edge set between nodes, which is a good representation method of Non-Euclidean Structure Data. Since graph clustering is an unsupervised learning task, it is very difficult to achieve good results on data. Its performance is easily affected by many factors, such as clustering methods, data noise, etc. Various graph clustering techniques, such as K-means and spectral clustering, are particularly popular. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06T5/00G06N20/00
CPCG06N20/00G06F18/23G06T5/70
Inventor 康昭刘展宇林治平田玲罗光春
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products