Check patentability & draft patents in minutes with Patsnap Eureka AI!

Method for clustering non-spherical data by using IK-means algorithm

A data clustering and non-spherical technology, applied in computing, computer components, instruments, etc., can solve problems such as reducing clustering efficiency, high computing costs, and difficulty in selecting initial parameters

Pending Publication Date: 2022-01-11
LIANYUNGANG TECHN COLLEGE
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

And those algorithms that can cluster non-spherical data, such as DBSCAN, spectral clustering, but they have some shortcomings in common: high computational cost, insurmountable difficulties in selecting initial parameters; high time complexity, for massive data The analysis is relatively slow, and it is difficult to meet the real-time requirements
However, the model selection of the kernel function is a difficult problem, and different kernel functions have different clustering results.
In addition, the kernel function increases the calculation cost and reduces the clustering efficiency
It cannot afford the timely processing of massive data streams generated in the current Internet era

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for clustering non-spherical data by using IK-means algorithm
  • Method for clustering non-spherical data by using IK-means algorithm
  • Method for clustering non-spherical data by using IK-means algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the content of the present invention more clearly understood, the present invention will be further described in detail below based on specific embodiments.

[0025] A method for clustering non-spherical data using the IK-means algorithm, characterized in that,

[0026] The method steps include:

[0027] Use the classic K-means algorithm to perform initial clustering on the original data set to obtain K sub a sub-cluster;

[0028] The connectivity between each sub-cluster is calculated, and the sub-clusters whose connectivity is greater than the connectivity threshold are merged to obtain the final clustering result of the data set.

[0029] In this example, K sub The acquisition methods include:

[0030] Step S11: Analyze the data distribution using the density grid, obtain the number of clusters and the approximate initial cluster center according to the data distribution, and calculate the M value of the grid division;

[0031] Step S12: Calcula...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for clustering non-spherical data by using an IK-means algorithm, and the method comprises the steps: carrying out the initial clustering of an original data set through employing a classical K-means algorithm, and obtaining a Ksub sub-cluster; and calculating the connectivity among the sub-clusters, and merging the sub-clusters of which the connectivity is greater than a connectivity threshold to obtain a final clustering result of the data set. According to the method, data of any shape can be clustered, the clustering effect is good, and particularly, the method has more advantages in clustering of non-spherical data.

Description

technical field [0001] The invention relates to a method for realizing clustering of non-spherical data by using IK-means algorithm. Background technique [0002] The rapid development of Internet technology has generated a large amount of data. To analyze and process these data in a timely manner, some tools are needed. Clustering algorithm is one of the important ones. It is an unsupervised learning method that can be processed for data mining. There are many clustering algorithms, which can be roughly divided into several categories according to their working mechanism: layer-based, such as BIRCH algorithm, CURE algorithm; grid-based, such as STING algorithm, CLIQUE algorithm; partition-based, such as K-means algorithm; Based on graph theory, such as spectral clustering algorithm; density-based, such as DBSCAN algorithm, OPTICS algorithm; model-based, such as Gaussian mixture model (GMM) algorithm; fuzzy-based clustering, such as FCM algorithm. Each clustering algorit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 何洪磊
Owner LIANYUNGANG TECHN COLLEGE
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More