Data clustering method based on bivariant weighted kernel FCM algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A data clustering and variable weighting technology, applied in the field of data clustering, to achieve the effect of strong anti-noise, accurate clustering results, and increased robustness

Active Publication Date: 2018-11-06

HEFEI UNIV OF TECH

View PDF6 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] In order to overcome the shortcomings of the above-mentioned prior art, the present invention provides a data clustering method based on the bivariate weighted kernel FCM algorithm, in order to accurately avoid the fact that FCM is sensitive to noise points, and PCM is easy to produce consistent clustering problems, increase the accuracy of the algorithm, and more accurately mine the data structure information existing in the data set

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0044] see figure 1 , in this embodiment, the data clustering method based on the bivariate weighted kernel FCM algorithm is used to cluster customer information, and recommend products to customers according to the clustering results, as follows:

[0045] Step 1, data set X is customer information, X={x 1 ,x 2 ,...,x n},x j is the jth data point; j=1,2,...,n, n is the total number of data, for example: take 10,000 customer information, that is, n=10000; optimally divide the data set X, so that the formula ( The value of the objective function J in 1) is the minimum:

[0046]

[0047] In formula (1), i represents the i-th category, and c represents the number of categories divided, that is, the type of product, such as: c=10, 1≤i≤c, 2≤c≤n; u ij represents the jth data point x j The membership degree value of the i-th category; for u ij m power of , m is a weighted index indicating the degree of clustering fuzziness, m can take a value of 2, t ij represents the jth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a data clustering method based on a bivariant weighted kernel FCM algorithm. The data clustering method comprises the steps that firstly, a data set is divided optimally to minimize an objective function; an initial membership degree matrix, a typical value matrix and an initial clustering center are obtained; the distances between data points in a multi-kernel high-dimensional space and the clustering center are obtained; iteration is conducted to obtain membership degree values and probability typical values; and a clustering result corresponding to the minimum valueobtained by the objective function is adopted as a final clustering result. According to the data clustering method, a kernel function guided by a combined kernel is adopted to replace an ordinary Euclidean distance function, and thus linear data and nonlinear data can be better divided; and the anti-noise property of the algorithm is improved through the typical value matrix, the accuracy of algorithm clustering is improved, the proportion of various kernels in the combined kernel can be automatically adjusted to meet the requirements of different datasets for different kernel functions, andthe problem of selection uncertainty of an ordinary kernel algorithm to the kernel functions is solved.

Description

technical field [0001] The invention relates to the technical field of data clustering, in particular to a data clustering method based on a bivariate weighted kernel FCM algorithm. Background technique [0002] Clustering is an important research content in the field of data mining and artificial intelligence, and it has played a great role in many fields such as big data, pattern recognition, image segmentation, and machine learning. Clustering is the process of dividing data according to the similarity rules of the data. The result of the division is determined by the rules. The divided groups or sets are often called clusters. Fuzzy c-means clustering algorithm (FCM) is the most basic method of fuzzy clustering, which lays the foundation for clustering algorithms based on objective functions, but this method is not only sensitive to the initialization of cluster centers, but also susceptible to noise points. In order to improve the noise immunity of the algorithm, Krish...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30

Inventor 唐益明胡相慧丰刚永华丹阳任福继张有成宋小成

Owner HEFEI UNIV OF TECH

Data clustering method based on bivariant weighted kernel FCM algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology