Anomaly detection method based on information entropy clustering

A technology of anomaly detection and information entropy, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problem that the clustering effect is easily affected by the initial clustering center, and achieve the effect of avoiding falling into local optimum

Inactive Publication Date: 2019-03-26
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the present invention, we propose a dynamic clustering method based on information entropy and k-means clustering algorithm for the problem that the clustering effect of the traditional clustering algorithm is easily affected by the initial clustering center. The value method is used to modify the distance function between objects by assigning weights to the clustering objects, using the weighting function value of the initial clustering to select a high-quality initial clustering center, optimizing the initialization process of the algorithm, and based on this Anomaly Detection Algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Anomaly detection method based on information entropy clustering
  • Anomaly detection method based on information entropy clustering
  • Anomaly detection method based on information entropy clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The following describes in detail an anomaly detection algorithm based on entropy clustering proposed by the present invention with reference to the accompanying drawings.

[0034] Such as figure 1 As shown, the anomaly detection algorithm based on information entropy clustering proposed in the present invention includes the following steps:

[0035] Step 1) Determine the number of initial cluster centers K and the accuracy of clustering function ε

[0036] Step 2) Set the initial clustering criterion function value J 0 = 0, the initial abnormality Abn of each data point x in the data set x =0;

[0037] Step 3) Divide the data objects into k evenly 1 (k 1 >k) subsets, randomly select a data object from each subset, and use it as the clustering seed center, scan the data set, according to its similarity with each cluster center (weighted Euclidean distance), Group it into its most similar cluster, forming k 1 Initial clusters;

[0038] Step 4) Calculate k 1 Σ of clusters i , And...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an anomaly detection method based on information entropy clustering, which belongs to the field of machine learning and data mining. The anomaly detection algorithm of the invention is based on the idea of the clustering algorithm, and overcomes the shortcomings of the traditional K. Means clustering algorithm randomly selects the initial clustering centers, which easily leads to the clustering results into the local optimal problem. A method based on information entropy to select the initial clustering centers is proposed. The invention provides a method for dividing adata set into data blocks with more than K value, Then the entropy method is used to get the target value function of each data block, and the centroid corresponding to the first k data blocks with the smallest value of the target value function is selected as the initial clustering center. The entropy method is used to ensure the efficiency of selecting the initial clustering center, and the function of anomaly detection is realized in the iterative process of the algorithm. Compared with the traditional K-based clustering algorithm of means, the clustering effect and the anomaly detection ability of the algorithm proposed by the invention are higher than those of the traditional K. Means clustering algorithm. It has certain practical significance.

Description

Technical field [0001] The invention relates to the technical field of machine learning and data mining, in particular to an anomaly detection algorithm based on information entropy clustering. Background technique [0002] With the rapid development of information technology in modern times, some special data that are different from most data in many fields have received widespread attention. These special data are called abnormal data. Anomalies are data that are unique in the data set, which makes people suspect that these data are not biases, but are caused by completely different mechanisms. Commonly used methods of anomaly detection include: anomaly detection method based on statistics, anomaly detection method based on data flow algorithm, and machine learning method based on unsupervised learning. The application of data mining and machine learning in anomaly detection has received widespread attention. Data mining refers to the process of searching for hidden informati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213
Inventor 方锡谭文安赵璐
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products