Characteristic vector group's best selected spectrum clustering method based on density self-adaptation

An eigenvector and self-adaptive technology, applied in the field of spectral clustering algorithm, can solve problems such as unsatisfactory multi-scale data sets, difficulty in selecting eigenvector groups, and manually determining the number of clusters.

Inactive Publication Date: 2017-10-10
ZHEJIANG UNIV OF TECH
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to overcome the existing spectral clustering methods, some custom parameters need to be set manually, the effect is not ideal when dealing with multi-scale data sets, the number of clusters needs to be manua...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Characteristic vector group's best selected spectrum clustering method based on density self-adaptation
  • Characteristic vector group's best selected spectrum clustering method based on density self-adaptation
  • Characteristic vector group's best selected spectrum clustering method based on density self-adaptation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The present invention will be further described below in conjunction with the accompanying drawings.

[0071] refer to Figure 1 to Figure 5 , a density adaptive eigenvector group optimal selection spectral clustering method, including the following steps:

[0072] 1) Data preprocessing:

[0073] 1.1) Initialize the settings, determine the iteration range of the number of adjacent points and the initial number of adjacent points, set the initial Fitness function value to Fitness_g=0, and divide the interval number block;

[0074] 1.2) Analyzing the actual data set, we can see that some dimensions of some data sets are much larger than other dimensions of the data set, and the difference between the values ​​of these dimensions is large, which may lead to the weakening of the importance of other dimensions even ignored. Without obtaining the weight information of each dimension of the data set, the minimum-maximum method is used to normalize the data so that it falls ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention proposes a characteristic vector group's best selected spectrum clustering method based on density self-adaptation. The method comprises the following steps: 1) first, performing pre-treatment; 2) calculating the sparse similarity matrix based on density self-adaptation; 3) calling the automatically determining clustering central algorithm; 4) decomposing the characteristics and seeking the characteristic vector group; 5) performing the standardizing treatment to all elements in the data set in the mapped characteristic vector group in the characteristic space, followed by the K-means clustering to obtain the clustering result; and 6) calculating the Fitness function values; performing iterations constantly; and selecting and outputting the clustering result of the number of the best clear-cut points corresponding to the highest Fitness function value. According to the invention, it is possible to introduce the data point density information to a similar function so as to improve the time gathering effect of multi-scale data of the algorithm processing, to select the best characteristic vector group and to utilize the Fitness function to realize the parameter self-adaptation of the number of clear-cut points.

Description

technical field [0001] The invention belongs to the field of spectral clustering algorithms, in particular to a spectral clustering method for optimal selection of feature vector groups based on density self-adaptation. Background technique [0002] Clustering is one of the important research contents in data mining, pattern recognition, machine learning and other fields. Clustering is to gather samples with higher similarity together and put samples with lower similarity in different clusters. With the development of clustering technology, scholars have proposed a variety of clustering algorithms, such as the most classic K-means algorithm in cluster analysis, which is relatively simple and can be used for clustering of various types of data. But when dealing with non-convex data sets, the K-means algorithm tends to fall into a local optimal solution. In order to solve the problem that K-means algorithm cannot cluster on non-convex data sets, researchers began to use spec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/211G06F18/23213
Inventor 陈晋音吴洋洋林翔郑海斌
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products