Density peak-core fusion-based adaptive clustering method

An adaptive clustering and density peak technology, applied in the field of pattern recognition, can solve the problems of the influence of the clustering effect, the urgent need to improve the accuracy and efficiency of the clustering method, and the limited application effect.

Active Publication Date: 2019-06-04
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF10 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the CFSFDP method performs adaptive clustering on data sets with arbitrary shape distributions, this method also has some shortcomings.
First, the clustering effect of the CFSFDP method is easily affected by the density estimation results
Secondly, the process of artificially determining the class center in the CFSFDP method limits its application effect in automated tasks
The most important point is that there may be multiple density peak points in a class. The CFSFDP method will...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Density peak-core fusion-based adaptive clustering method
  • Density peak-core fusion-based adaptive clustering method
  • Density peak-core fusion-based adaptive clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] The invention will be described in further detail below in conjunction with the accompanying drawings.

[0063] In this embodiment, a two-dimensional actual medical industry data set is taken as an example to specifically illustrate the implementation process of the method of the present invention. The distribution of two-dimensional actual medical industry datasets is as follows: figure 1 shown. The data set actually contains two classes, the distance between these two classes is very close and the dividing line is not obvious; secondly, the distribution within the class is complex, figure 1 Middle class 2 contains multiple density peak points. The two-dimensional actual medical industry data set contains 240 data points in total, so n=240, and the dimension d=2.

[0064] Whole method flow process of the present invention is as figure 2 shown.

[0065] 1. Calculate the distance between data points in the data set and calculate the cut-off distance d c , computin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a density peak-core fusion-based adaptive clustering method, and belongs to the field of pattern recognition. The method comprises the following steps: (1) firstly, carrying outdensity neighbor clustering based on a density peak value: calculating the density of each data point in the to-be-clustered data set by using a core density-k-nearest neighbor density estimation method, determining density peak points through an adaptive threshold value, and performing density nearest neighbor clustering by taking the density peak points as a class center to obtain an initial clustering result; And (2) performing core fusion operation based on intra-class divergence: adaptively determining a core point in each initial class, fusing the two initial classes if the core pointsof the two initial classes are adjacent and the intra-class divergence can be reduced after fusion, and fusing all the initial classes to be fused to obtain a final clustering result. The method is simple, accurate and efficient, and can perform self-adaptive clustering on the data set with any shape and density distribution.

Description

technical field [0001] The invention relates to an adaptive clustering method based on density peak-core fusion, which belongs to the field of pattern recognition. Background technique [0002] Clustering method is an important technology in the field of pattern recognition and machine learning, and is widely used in face recognition, search engineering, image partitioning and other fields. Clustering is the process of dividing a data set into classes or clusters according to the similarity between data points. Data points belonging to the same class have a greater similarity, while data points belonging to different classes should be as dissimilar as possible. . The similarity between data points can be measured by distance, and the most common distance is Euclidean distance. Due to the diversity of data sources, properties, and distributions, as well as the needs of automated industrial processes, clustering methods are required in many fields to automatically determine ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
Inventor 邱雷房芳袁慎芳任元强
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products