Density peak clustering algorithm based on density adaptive distance

A density peak clustering and self-adaptive technology, applied in the field of cluster analysis, can solve the problems of large differences in density, inability to select, and cluster centers are easily selected by mistake, so as to achieve the effect of reducing and amplifying differences.

Inactive Publication Date: 2016-09-07
JIANGNAN UNIV
View PDF0 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for those data sets with complex structures, due to the large density difference between different clusters, or there are multiple high-density areas in the same cluster, or the density distribution of the same cluster is relatively uniform, t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Density peak clustering algorithm based on density adaptive distance
  • Density peak clustering algorithm based on density adaptive distance
  • Density peak clustering algorithm based on density adaptive distance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to clarify the purpose, technical solutions and advantages of the present invention, the present invention will be further described in detail below in conjunction with specific embodiments and accompanying drawings.

[0045] refer to figure 1 , the specific implementation process of the present invention comprises the following steps:

[0046] (1) Input data set X={x 1 ,x 2 ,...,x n}∈R D , the proportion value p of the total number of neighbor points of the data point to the total number of samples in the data set, and the distance adjustment factor α; where, n represents the number of samples, and D represents the dimension of the sample.

[0047] (2) First calculate the data point x i with x j The Euclidean distance between:

[0048] d ( x i , x j ) = Σ ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a density peak clustering algorithm based on the density adaptive distance, and aims at solving the problem that a density peak clustering algorithm based on the Euclidean distance is incapable of processing a data set of complex structure effectively. The density peak clustering algorithm based on the density adaptive distance is realized by that (1) the density adaptive distance is calculated according to the Euclidean distance and the adaptive similarity, so that a data space distribution structure is described in a better way; (2) an input parameter, namely the cutoff distance, of the algorithm is calculated according to the proportion of the total number of neighbor points of data points to the total number of a data set sample on the basis of the density adaptive distance; (3) according to the cutoff distance and the density adaptive distance, the local density of each data point as well as the shortest distance from the data point to a point of higher local density are calculated, a decision diagram is drafted, and a clustering center is selected; and (4) each residual point is distributed to a cluster to which the nearest neighbor point of the higher local density belongs, and a clustering result is obtained. Experiments on artificial data sets and UCI real data sets show that the density peak clustering algorithm based on the density adaptive distance, compared with the density peak clustering algorithm based on the Euclidean distance, can handle the data set of complex structure and is higher in accuracy.

Description

technical field [0001] The invention belongs to the technical field of cluster analysis, and mainly relates to the improvement and optimization of a density peak clustering algorithm. Specifically, it is a clustering algorithm based on density-adaptive distance, which can be applied in the fields of pattern recognition, data mining and image processing. Background technique [0002] As an important unsupervised data analysis method, clustering can be used not only as an independent tool to discover hidden information in data, but also as a preprocessing step of other data analysis algorithms. , image processing and other fields have been widely researched and applied. [0003] Clustering is to divide the unknown classification data set into different classes or clusters according to the similarity of data objects, so that the data objects in the same cluster have the maximum similarity, and the data objects between different clusters have the minimum similarity. At present...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/23211
Inventor 葛洪伟李涛李莉朱嘉钢
Owner JIANGNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products