Minimum spanning tree clustering algorithm and system based on density core

A clustering algorithm and tree-spanning technology, applied in the computer field, can solve problems such as inability to process data sets and high time complexity

Inactive Publication Date: 2021-02-12
CHONGQING UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The density peak clustering algorithm can effectively cluster data sets, but it needs to manually select the cluster center, cannot handle data sets with complex shapes, and has disadvantages such as high time complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Minimum spanning tree clustering algorithm and system based on density core
  • Minimum spanning tree clustering algorithm and system based on density core
  • Minimum spanning tree clustering algorithm and system based on density core

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0040]In the description of the present invention, unless otherwise specified and limited, it should be noted that the terms "installation", "connection" and "connection" should be understood in a broad sense, for example, it can be mechanical connection or electrical connection, or two The internal communication of each element may be directly connected or indirectly connected through an intermediary. Those skilled in the art can understand the specific meanings of the above terms according to specific situations.

[0041] Such as figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a minimum spanning tree clustering algorithm and system based on a density core. The algorithm comprises the following steps: constructing a KD tree; obtaining reverse neighborinformation and natural characteristic values of the data points by adopting a natural neighbor method, and counting the number of reverse neighbors of each data point; taking the data points of whichthe reverse neighbor number is not less than the natural characteristic value as core points, and forming a density core point set by the core points; establishing a minimum spanning tree according to the density core set to obtain a set of weights of each edge in the minimum spanning tree; calculating a trimming threshold value according to the set of the weights of the edges in the minimum spanning tree, and cutting off the edges connected with different clusters in the minimum spanning tree according to the trimming threshold value to obtain a minimum spanning sub-tree of each sub-cluster;generating a sub-tree clustering density core according to the obtained sub-cluster minimum; and distributing the non-density core points to the cluster of the density core closest to the non-densitycore points to complete clustering. According to the algorithm, the approximate shape and structure of the cluster can be well reserved, so that the algorithm can adapt to a data set with a complex shape.

Description

technical field [0001] The invention relates to the field of computers, in particular to a minimum spanning tree clustering algorithm and system based on a density kernel. Background technique [0002] In recent years, data information has shown an explosive growth trend, forming huge and complex massive data. These valuable data cannot be analyzed and understood by humans without the help of automatic analysis methods. In order to extract effective information in a timely manner from huge and complex data, effective data analysis tools are needed, and data mining technology has emerged along with the trend. [0003] Data mining can be divided into the following three steps: preprocessing, data analysis and knowledge evaluation. After obtaining the original data from the data source, use data preprocessing methods such as data cleaning, data conversion, feature selection, and feature extraction for preprocessing, and then select data analysis algorithms to analyze the proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/2458
CPCG06F16/2465G06F18/23G06F18/24323
Inventor 高强高琴琴熊忠阳张玉芳
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products