Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Optimal neighbor multi-kernel clustering method and system based on local kernel

A kernel clustering and neighbor technology, applied in the field of data analysis, can solve the problems of low reliability, neglect, and insufficient consideration of the local density representation ability of a single data sample, and achieve the effect of excellent performance and improved performance.

Pending Publication Date: 2021-08-17
ZHEJIANG NORMAL UNIVERSITY
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The above MKC algorithm has two problems: it does not fully consider the local density around a single data sample and excessively restricts the representation ability of learning the optimal kernel
Therefore, such local kernels cannot minimize unreliability due to ignoring the local density around a single data sample
At the same time, most multi-kernel clustering algorithms assume that the optimal kernel is a weighted combination of pre-specified kernels, while ignoring some more robust kernels in the complement of kernel combinations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimal neighbor multi-kernel clustering method and system based on local kernel
  • Optimal neighbor multi-kernel clustering method and system based on local kernel
  • Optimal neighbor multi-kernel clustering method and system based on local kernel

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0074] A local kernel-based optimal neighbor multi-kernel clustering method provided in this embodiment, such as figure 1 shown, including:

[0075] S11. Obtain clustering tasks and target data samples;

[0076] S12. Calculate the kernel matrix of each view corresponding to the target data sample, and perform centralization and normalization processing on the kernel matrix to obtain the processed kernel matrix;

[0077] S13. According to the obtained processed kernel matrix, establish an optimal neighbor multi-kernel clustering objective function based on the local kernel;

[0078] S14. Solving the established objective function in a cyclic manner to obtain a partition matrix after view fusion;

[0079] S15. Perform k-means clustering on the obtained partition matrix to obtain a clustering result.

[0080] This embodiment proposes a new method based on local kernel-based optimal neighbor multi-kernel clustering. Compared with existing methods, it includes building an adapti...

Embodiment 2

[0150] A local kernel-based optimal neighbor multi-core clustering method provided in this embodiment is different from Embodiment 1 in that:

[0151] The main content of this embodiment includes the design of an adaptive local kernel for the current multi-kernel clustering algorithm that does not fully consider the local density of a single data sample and the representation ability of the optimal kernel learned by over-restriction, and from the pre-specified The optimal kernel is located in the neighborhood of the linear combination of kernels; the two techniques are utilized in a single multi-kernel cluster framework; the generalization range of the optimal neighborhood multi-kernel clustering algorithm based on adaptive local kernels is studied.

[0152] The above-mentioned adaptive local kernel is a sub-matrix of the kernel function, and its main function is to reflect the relationship between the sample and its neighbors. First, define the threshold ζ, and the correspond...

Embodiment 3

[0169] A local kernel-based optimal neighbor multi-core clustering method provided in this embodiment is different from Embodiment 1 in that:

[0170] This embodiment is compared with existing methods on multiple data sets to verify the effectiveness of the present invention.

[0171] data set:

[0172] Flower102: This dataset contains 8189 samples evenly distributed in 102 categories with 4 kernel matrices.

[0173] Digital: This dataset includes 2000 samples, evenly distributed in 10 categories, with 3 kernel matrices.

[0174] Caltech101: This dataset contains 1530 samples uniformly distributed in 102 categories with 25 kernel matrices.

[0175] Protein Fold: This dataset includes 694 samples, evenly distributed in 27 categories, with 12 kernel matrices.

[0176] The statistics of the above datasets are shown in Table 2.

[0177]

[0178] Table 2

[0179] Data preparation and parameter setting:

[0180] In the initialization phase, follow [C.Cortes, M.Mohri, and A....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an optimal neighbor multi-kernel clustering method and system based on a local kernel, and the method comprises the steps: S11, obtaining a clustering task and a target data sample; S12, calculating a kernel matrix of each view corresponding to the target data sample, and carrying out centralization and normalization processing on the kernel matrix to obtain a processed kernel matrix; s13, according to the obtained processed kernel matrix, establishing an optimal neighbor multi-kernel clustering objective function based on a local kernel; s14, solving the established objective function by adopting a circulation mode to obtain a division matrix after view fusion; and S15, performing k-means clustering on the obtained division matrix to obtain a clustering result.

Description

technical field [0001] The invention relates to the technical field of data analysis, in particular to an optimal neighbor multi-kernel clustering method and system based on local kernels. Background technique [0002] Kernel clustering has been extensively explored in the current machine learning and data mining literature. It implicitly maps the original inseparable data to a high-dimensional Hilbert space, where the corresponding vertices have well-defined decision boundaries. Then, various clustering methods are applied, including k-means [K.Krishna and N.M.Murty, “Genetic k-means algorithm,” IEEE Transactions on Systems Man And Cybernetics-Part B: Cybernetics, vol.29, no.3, pp. 433–439, 1999], Fuzzy c-means [J.C. Bezdek, R. Ehrlich, and W. Full, “Fcm: The fuzzy c-means clustering algorithm,” Computers & Geosciences, vol.10, no.2-3, pp.191– 203, 1984], spectral clustering [A.Y.Ng, M.I.Jordan, and Y.Weiss, “Onspectral clustering: Analysis and an algorithm,” in Advances ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06F17/16
CPCG06F17/16G06F18/23G06F18/23213G06F18/213
Inventor 朱信忠徐慧英刘吉元刘新旺赵建民
Owner ZHEJIANG NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products