Two-stage hybrid particle swarm optimization clustering method

A technology of hybrid particle swarm and clustering methods, applied in the field of two-stage hybrid particle swarm optimization clustering, can solve problems such as inability to change, achieve the effect of improving accuracy and reducing computational complexity

Inactive Publication Date: 2012-09-12
XIDIAN UNIV
View PDF1 Cites 60 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Use the first-stage clustering hierarchical agglomerative clustering to obtain the initial cluster center sub-cluster set with high accuracy as the search space for the initial cluster center of the particle swarm optimization K-means clustering algorithm for the second-stage clustering, and the unselected The sub-clusters used as the clustering center are all broken up and re-divided int...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Two-stage hybrid particle swarm optimization clustering method
  • Two-stage hybrid particle swarm optimization clustering method
  • Two-stage hybrid particle swarm optimization clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0044] refer to figure 1 , the concrete steps that the present invention realizes are as follows:

[0045] Step 1. Test the data set from the UCI database: randomly select a data set from Iris, Wine and Glass, and the computer reads all sample data of this data set and loads it into the memory. Each piece of sample data consists of several dimensions. Read in the value of the number of clusters K.

[0046] Step 2. Statistical dimension information

[0047] The computer traverses all the samples in the data set, counts the value range information of each dimension of the data set, and obtains the maximum and minimum values ​​of the value ranges of the attributes of each dimension in the data set.

[0048] Step 3, dimension normalization

[0049] The normalization formula is used to process the attribute values ​​of each dimension of the data sample, and the attrib...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a two-stage hybrid particle swarm optimization clustering method, which is mainly used for solving the problems of greater time consumption and low accuracy of the conventional particle swarm optimization K-mean clustering method when the number of dimensions of samples is higher. The technical scheme disclosed by the invention comprises the following steps: (1) reading a data set and the number K of clusters; (2) taking statistics on information of dimensionality; (3) standardizing the dimensionality; (4) calculating a similarity matrix; (5) generating a candidate initial clustering center; (6) performing particle swarm K-mean partitional clustering; and (7) outputting a particle swarm optimal fitness value and a corresponding data set class cluster partition result. According to the two-stage hybrid particle swarm optimization clustering method disclosed by the invention, the first-stage clustering is firstly performed by adopting agglomerative hierarchical clustering, a simplified particle encoding way is provided, the second-stage clustering is performed on data by particle swarm optimization K-mean clustering, the advantages of hierarchical agglomeration, K-mean and particle swarm optimization methods are integrated, the clustering speed is accelerated, and the global convergence ability and the accuracy of the clustering result of the method are improved.

Description

technical field [0001] The invention belongs to the field of computer technology, and further relates to a two-stage hybrid particle swarm optimization clustering method in the field of data mining technology. The invention can be widely used in data compression, information retrieval, character recognition, image segmentation and text clustering, etc., and at the same time It can have a wide range of applications in biology, marketing, and abnormal data detection. Background technique [0002] Cluster analysis is an important means and method of data division or grouping processing in data mining. It does not require any prior knowledge, and through a certain similarity measurement criterion, similar samples are classified into a cluster. The purpose of clustering is to make the similarity of samples in the same cluster larger, and the similarity of samples between different clusters smaller. Small. In biology, cluster analysis can be used to cluster the genes of organism...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 刘志镜王纵虎王韦桦陈东辉屈鉴铭贺文骅王静姚勇熊静唐国良袁通刘慧
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products