A data clustering method and device
A data clustering and data technology, applied in the field of data processing, can solve the problems of reducing the accuracy of clustering and not considering the impact of clustering, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0060] Currently, the data clustering problem is in the dataset C j (j from 1 to K) find a dataset C where dataset C j by the similarity-based mean c j (It can be regarded as a dataset C j The preset initial centroid of the Minimizing the distance between data in the same dataset can also be regarded as minimizing the distance between each piece of data in the same dataset and minimizing the distance between each piece of data and a preset initial centroid in that dataset.
[0061] The applicant studies a clustering algorithm suitable for uncertain data starting from the hard clustering algorithm-means clustering (K-means) algorithm, wherein the purpose of the K-means algorithm is to find a data set C from K data sets to minimize the sum of squared errors (SSE). The formula for calculating the sum of squared errors is as follows:
[0062]
[0063] ||.|| represents a data x i with the preset initial centroid c of the dataset j the distance. For example, Euclidean dis...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


