Data clustering analysis method based on genetic algorithm

A genetic algorithm and data clustering technology, applied in the field of genetic algorithm, can solve problems such as high fitness and low fitness, achieve effective classification and overcome locality

Inactive Publication Date: 2019-08-30
SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (2) How to construct a fitness function to measure the adaptability of each individual to the clustering problem, that is, if the code of an individual represents a good clustering result, its fitness is high; otherwise, its fitness is low
[0009] In view of the global optimization of the genetic algorithm, aiming at the shortcomings of the most widely used K-means method, the present invention proposes a K-means clustering algorithm based on the genetic algorithm to overcome the locality and the limitation of the initial K-means algorithm. Sensitivity of cluster centers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data clustering analysis method based on genetic algorithm
  • Data clustering analysis method based on genetic algorithm
  • Data clustering analysis method based on genetic algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the technical problems, technical solutions and beneficial effects to be solved by the present invention clearer, the present invention will be described in detail below in conjunction with the embodiments. It should be noted that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0051] The data clustering analysis method based on genetic algorithm comprises the following steps:

[0052] (A) First select the initial population from the sample set to be clustered;

[0053] (B) Execute the genetic algorithm on the selected initial population;

[0054] (C) Execute the K-means operation on the new population generated after the genetic algorithm is executed;

[0055] (D) Step (A)-step (C) is repeated until the optimal solution of the clustering problem is found.

[0056] In the step (A), the initial population is randomly generated, and the specific steps are as follows:

[0057...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention particularly relates to a data clustering analysis method based on a genetic algorithm. The data clustering analysis method based on a genetic algorithm comprises the following steps: firstly, selecting an initial population from a sample set to be clustered; executing a genetic algorithm on the selected initial population; executing k-means operation on the new population generatedafter the genetic algorithm is executed; and repeating step (A) to step (C) until the optimal solution of the clustering problem is found. In the data clustering analysis method based on a genetic algorithm, local optimization of the K-means algorithm and global optimization of the genetic algorithm are combined, the optimal clustering number and the initial centroid set are finally obtained through multiple times of selection, intersection and variation genetic operation, the locality and sensitivity to the initial clustering center of a traditional K-means algorithm are overcome, and effective classification of data is achieved.

Description

technical field [0001] The invention relates to the technical field of genetic algorithms, in particular to a data clustering analysis method based on genetic algorithms. Background technique [0002] Clustering analysis is an unsupervised learning process, which refers to clustering things into clusters according to certain attributes of things, so that the similarity between clusters is as small as possible, and the similarity within clusters is as large as possible, so as to realize the classification of data. Cluster analysis is an important part of data mining technology. It can be used as an independent data mining tool to obtain the distribution of data in the database, and it can also be used as a preprocessing step for other data mining algorithms. Clustering analysis has become the main research field of data mining, and has been widely used in the fields of pattern recognition, image processing, data analysis and customer relationship management. [0003] K-means...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/12
CPCG06N3/126G06F18/23213
Inventor 王利鑫
Owner SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products