Check patentability & draft patents in minutes with Patsnap Eureka AI!

Clustering algorithm for quickly searching and finding density peak value for complex data

A technology of peak density and complex data, applied in the field of big data analysis, it can solve the problems associated with errors, unable to find the correct peak density point and distribution method, etc., and achieve the effect of improving clustering accuracy.

Inactive Publication Date: 2020-03-27
HOHAI UNIV
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Purpose of the invention: for the problems existing in the prior art, the present invention provides a clustering algorithm (A Fast Clustering Algorithm for Searching and Finding Density Peaks for Complex Data, FCA-SFDPCD) for complex data-oriented fast searching and finding density peaks, To solve the inability to find the correct density peak point and the distribution method are prone to errors and joint problems when dealing with complex data sets of multi-scale, cross-winding and flow patterns

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering algorithm for quickly searching and finding density peak value for complex data
  • Clustering algorithm for quickly searching and finding density peak value for complex data
  • Clustering algorithm for quickly searching and finding density peak value for complex data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The technical solutions of the present invention will be further illustrated below in conjunction with specific embodiments.

[0049] Density peak clustering algorithm is a density-based clustering algorithm, which can not only find the density peak points of each cluster, cluster each cluster, but also eliminate outliers. And the algorithm is not affected by the shape and size of micro-clusters. The DPC algorithm is based on the assumption that (1) the cluster center is surrounded by other data points with lower density in the cluster; (2) the relative distance between the cluster centers is large. Therefore, in order to find the density peak point of each cluster, the DPC algorithm introduces two concepts: (1) The local density ρ of data point i i ; (2) Relative distance δ of data point i i .

[0050] For the local density of data point i, the DPC algorithm gives two measurement methods: truncated kernel and Gaussian kernel. The truncated kernel metric is given by...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a clustering algorithm for quickly searching and finding a density peak value for complex data. The method includes: defining the eccentricity of the sample through the distance between the sample and the k-nearest neighbor sample based on the local density definition mode of the accumulated neighborhood, determining the local density of the sample through the comparison ofthe eccentricity of the sample and the k-nearest neighbor sample, and enabling the local information of the sample to be fully utilized through the local density; according to the micro-cluster merging and distributing strategy based on graphic degree connection, firstly, using a density peak clustering and distributing strategyto divide samples into a plurality of micro-clusters, then calculating the weighted proximity between the samples, determining the similarity degree between the micro-clusters, merging the micro-clusters with the highest similarity degree in sequence, and forming a final cluste. Experimental results show that the density peak point can be correctly found in a multi-scale, cross-wound and flow pattern complex data set, other samples can be correctly distributed, andthe clustering precision is greatly improved.

Description

technical field [0001] The invention relates to a clustering algorithm in the field of big data analysis, in particular to a clustering algorithm for fast searching and finding density peaks for complex data. Background technique [0002] The density peak clustering algorithm (Clustering by fast search and find of density peaks, DPC) was proposed by Alex Rodriguez and Alessandro Laio in 2014, and the paper was published in Science. Because of its simple algorithm principle, high-efficiency operation, fast finding of density peak points (cluster centers) without iterative calculation of the objective function, and suitable for cluster analysis of large-scale data sets, it has attracted the attention of scholars since it was proposed. It has been widely used in image processing, community network discovery, gene sequence recombination, travel agency problems, etc., but it is difficult for the DPC algorithm to correctly find the density peak point when processing complex data s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 徐立中赵嘉黄晶郝振纯陈哲许叶军
Owner HOHAI UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More