Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Later fusion multi-kernel clustering machine learning method and system based on proxy graph improvement

A machine learning and clustering technology, applied in the field of machine learning, can solve problems such as unsatisfactory clustering effect, difficulty in applying multi-core data, separation, etc., to achieve the effect of facilitating view fusion, excellent performance, and improved clustering effect

Pending Publication Date: 2021-09-24
ZHEJIANG NORMAL UNIVERSITY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing post-fusion clustering algorithms still have the following shortcomings: First, the clustering process of the basic kernel and the post-fusion process of the basic partition are separated
In this case, the quality of the basic division has a great influence on the performance of the final clustering. If there are abnormal points and noise, the clustering effect will be unsatisfactory.
The second is that the existing methods simply regard the consistent partition as a linear transformation of the basic partition, making it difficult to apply to multi-core data in reality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Later fusion multi-kernel clustering machine learning method and system based on proxy graph improvement
  • Later fusion multi-kernel clustering machine learning method and system based on proxy graph improvement
  • Later fusion multi-kernel clustering machine learning method and system based on proxy graph improvement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0096] This embodiment provides an improved post-fusion multi-core clustering machine learning method based on the proxy graph, such as Figure 1-2 shown, including steps:

[0097] S1. Acquire clustering tasks and target data samples;

[0098] S2. Initialize the agent graph improvement matrix;

[0099] S3. Run k-means clustering and graph improvement on each view corresponding to the clustering task and the target data sample, and construct an objective function by combining kernel k-means clustering and graph improvement;

[0100] S4. Solving the objective function constructed in step S3 in a cyclic manner to obtain a graph matrix of fusion basic nuclear information;

[0101] S5. Perform spectral clustering on the obtained graph matrix to obtain a final clustering result.

[0102] In step S3, run k-means clustering and graph improvement on each view corresponding to the clustering task and the target data sample, and construct an objective function by combining kernel k-me...

Embodiment 2

[0146] The difference between the post-fusion multi-core clustering machine learning method based on agent graph improvement provided in this embodiment and Embodiment 1 is that:

[0147] In this embodiment, the clustering performance of the method of the present invention is tested on six MKL standard data sets.

[0148] The 6 MKL standard datasets include AR10P, YALE, Protein fold prediction, OxfordFlower17, Nonplant, Oxford Flower102. For information about the dataset, see Table 1.

[0149] Dataset Samples Kernels Clusters AR10P 130 6 10 YALE 165 5 15 Protein Fold 694 12 27 Flower17 1360 7 17 nonplant 2372 69 3 Flower102 8189 4 102

[0150] Table 1

[0151] For ProteinFold, this embodiment generates 12 benchmark kernel matrices, in which the first 10 feature sets use the second-order polynomial kernel, and the last two use the cosine inner product kernel. Kernel matrices for other datasets are available ...

Embodiment 3

[0162] This embodiment provides an improved post-fusion multi-core clustering machine learning system based on proxy graphs, including:

[0163] Obtaining module, used for obtaining clustering tasks and target data samples;

[0164] The initialization module is used to initialize the agent graph improvement matrix;

[0165] A building block for performing k-means clustering and graph improvement on each view corresponding to the clustering task and the target data sample, and constructing an objective function by combining kernel k-means clustering and graph improvement;

[0166] The solution module is used to solve the constructed objective function in a cyclic manner to obtain a graph matrix fused with basic kernel information;

[0167] The clustering module is used to perform spectral clustering on the obtained graph matrix to obtain the final clustering result.

[0168] Further, the objective function of kernel k-means clustering in the building block is expressed as:

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a later fusion multi-kernel clustering machine learning method and system based on proxy graph improvement. The later fusion multi-kernel clustering machine learning method based on proxy graph improvement involved in the invention comprises the following steps: S1, acquiring a clustering task and a target data sample; S2, initializing a proxy graph improvement matrix; S3, performing k-means clustering and graph improvement on each view corresponding to the obtained clustering task and the target data sample, and constructing a target function in combination with a kernel k-means clustering and graph improvement method; S4, solving the objective function constructed in the step S3 by adopting a circulation mode to obtain a graph matrix fused with basic kernel information; and S5, performing spectral clustering on the obtained graph matrix to obtain a final clustering result. The optimized basic division not only has information of a single core, but also can obtain global information through the proxy graph, and fusion of views is more facilitated, so that the learned proxy graph can better fuse information of each core matrix, and the purpose of improving the clustering effect is achieved.

Description

technical field [0001] The present invention relates to the technical field of machine learning, in particular to a later-stage fusion multi-core clustering machine learning method and system based on agent graph improvement. Background technique [0002] Clustering plays an important role in machine learning and data analysis, and its goal is to divide unlabeled data into several unrelated classes. In the era of big data, data is collected from multiple sources, and this type of data is called multi-view data. Methods for clustering multi-view data are known as multi-view clustering algorithms. Multi-kernel clustering algorithm is an important branch of multi-view clustering, which aims to make full use of a series of predefined base kernels to improve clustering performance. [0003] The existing multi-kernel clustering algorithms can be roughly divided into two types: early fusion and late fusion according to the timing of fusion. Early fusion refers to the fusion of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N20/00G06K9/62
CPCG06N20/00G06F18/2323G06F18/23213
Inventor 朱信忠徐慧英刘新旺李苗苗梁伟轩殷建平赵建民
Owner ZHEJIANG NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products