Method for automatically identifying optimal K value of elbow rules

An automatic recognition and optimal technology, applied in character and pattern recognition, computer parts, instruments, etc., can solve the problems of low clustering quality, inability to determine the number of clusters in advance, affecting the use and popularization, and reduce dependence. Effect

Inactive Publication Date: 2018-04-06
KUNMING UNIV OF SCI & TECH
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the K-means clustering algorithm also inevitably has shortcomings: the appropriate number of clusters cannot be determined in advance, resulting in low cluster quality
However, the elbow method often needs to roughly estimate the relatively reasonable number of clusters through observation, and the elbow method needs to use observation to find the optimal number of clusters, which affects its use in automated systems. and its promotion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically identifying optimal K value of elbow rules
  • Method for automatically identifying optimal K value of elbow rules
  • Method for automatically identifying optimal K value of elbow rules

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] Embodiment 1: as Figure 1-5 As shown, a method for automatically identifying the optimal K value in the elbow rule includes the following steps:

[0036] (1) Set the range [1, M] to find the optimal K value in the K-means clustering algorithm;

[0037] (2) Calculate the average degree of distortion corresponding to the number of clusters in the set range;

[0038] (3) Carry out a regularization conversion of 0-10 to the calculated average degree of distortion;

[0039] (4) Encapsulate the average degree of distortion and the range [1, M] after regularization of 0-10 into a data pair;

[0040] (5) Utilize the law of cosines to seek the included angle between the three consecutive data pairs that above-mentioned encapsulation forms;

[0041] (6) Find the smallest included angle;

[0042] (7) Use the smallest included angle to get the optimal K value.

[0043] The specific steps of a method for automatically identifying the optimal K value in the elbow rule are as fo...

example 1

[0057] Example 1: The specific steps of the method for automatically identifying the optimal K value in the elbow rule are as follows:

[0058] Step1. Set the range of the optimal K value to be found in the K-means clustering algorithm Range: [1, 2, ..., M]; specifically:

[0059] Set the range Range to find the optimal K value in the K-means clustering algorithm as: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10];

[0060] Step2, initialize k=1, and generate an average distortion list MDL whose length is M and all elements are 0; specifically:

[0061] Initialize k to 1, that is, k=1; generate an average distortion list MDL whose length is 10 and all elements are 0: MDL=[0,0,0,0,0,0,0,0,0, 0];

[0062] Step3, if k∈Range, execute Step4; if Then skip Step4-Step7 and execute Step8; specifically:

[0063] For example, when k=1, 1 ∈ [1, 2, 3, 4, 5, 6, 7, 8, 9, 10], namely k ∈ Range, execute Step4 (when k=2, 3, 4, 5, 6, 7, 8, 9, 10 are similar to the situation when k=1); for example, when k=...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for automatically identifying an optimal K value of elbow rules and belongs to the technical field of unsupervised clustering learning in machine learning. The methodcomprises steps that the to-be-searched optimal K value range [1, M] of the K-means clustering algorithm is set; the average distortion degree corresponding to the set range cluster number is calculated; 0-10 regularized conversion of the average distortion degree acquired through calculation is carried out; the average distortion degree after 0-10 regularized conversion and the range [1,M] are encapsulated into data pairs; the Cosine's theorem is utilized to solve angles among consecutive three encapsulated data pairs; the minimum angle is acquired; the minimum angle is utilized to acquire the optimal K value. The method is advantaged in that on the basis of K-means algorithm and the elbow rules, through 0-10 regularized conversion of the average distortion degree acquired through the elbow rules and further calculation carried out through utilizing the Cosine's theorem, the optimal K value in the designated range is lastly acquired.

Description

technical field [0001] The invention relates to a method for automatically identifying the optimal K value in the elbow rule, in particular to a method for combining and utilizing the K-Means clustering algorithm commonly used in unsupervised learning in machine learning, the elbow rule and the rule of passing 0-10 Regularize the average degree of distortion, package the 0-10 regularized average degree of distortion and the corresponding number of clusters into data pairs, and use the law of cosines to find the angle between three consecutive data points for automatic identification The method of the optimal K value estimated in the elbow rule belongs to the field of unsupervised clustering learning in machine learning. Background technique [0002] With the development of information technology and more and more data generated by people, human beings have gradually entered the era of big data. People use machine learning to study big data, and then acquire new knowledge or...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 石聪明王锋邓辉戴伟张晓丽杨秋萍卫守林
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products