Self-adaptive K-means clustering method based on local density and ball hash

A technology of local density and clustering method, which is applied to computer parts, instruments, characters and pattern recognition to improve the clustering effect, reduce the number of clustering iterations, and improve the convergence speed.

Inactive Publication Date: 2019-12-03
SOUTHWEST JIAOTONG UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the above-mentioned deficiencies in the prior art, an adaptive K-means clustering method based on local density and spherical hashing provided by the present invention solves the optimization of the number of categories k and the initial cluster center existing in the K-means clustering process The selection problem, this method can adaptively select the number of categories k and the initial cluster center for different data sets, and improve the adaptability and clustering effect of K-means clustering

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-adaptive K-means clustering method based on local density and ball hash
  • Self-adaptive K-means clustering method based on local density and ball hash
  • Self-adaptive K-means clustering method based on local density and ball hash

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The specific embodiments of the present invention are described below so that those skilled in the art can understand the present invention, but it should be clear that the present invention is not limited to the scope of the specific embodiments. For those of ordinary skill in the art, as long as various changes Within the spirit and scope of the present invention defined and determined by the appended claims, these changes are obvious, and all inventions and creations using the concept of the present invention are included in the protection list.

[0033] Such as figure 1 As shown, in one embodiment of the present invention, an adaptive K-means clustering method based on local density and spherical hashing comprises the following steps:

[0034] S1. Perform normalization processing on the data set D′ to be clustered to obtain a normalized data set D={x 1 , x 2 ,...,x N}, where x i is the i-th M-dimensional data sample in the data set, i is an integer in the closed...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a self-adaptive K-means clustering method based on local density and ball hash. Firstly, a target data set is normalized, the local density and median of samples in the data set are calculated to determine the clustering category number k, then k optimized initial clustering centers are obtained according to the quartile and the ball hash value under the local density of the adjacent samples, and finally clustering of the data set is completed through a K-means clustering algorithm. According to the method, the category number k and the initial clustering center of thedata set are automatically determined, the defects that the category number needs to be set in advance and the initial clustering center needs to be set at will in a traditional K-means clustering algorithm are overcome, and K-means clustering of the data set can be carried out in a self-adaptive mode.

Description

technical field [0001] The invention relates to the field of data mining, in particular to an adaptive K-means clustering method based on local density and spherical hashing. Background technique [0002] In the definition of the field of data mining, a data set is a collection of data, clustering is a process of classifying and organizing data members that are similar in some aspects in a data set, and clustering is a process of discovering such data. Intrinsically structured techniques, clustering techniques are often referred to as unsupervised learning. [0003] Cluster analysis is an effective data mining method, which can express the internal structure characteristics of data, and the scatter diagram generated by the clustering results is also an effective means of data visualization. K-means clustering algorithm (K-means clustering algorithm) is a cluster analysis algorithm for iterative solution. Its steps are to randomly select k objects as initial cluster centers,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/24147G06F18/22
Inventor 王小敏张文芳何卓兵
Owner SOUTHWEST JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products