Supercharge Your Innovation With Domain-Expert AI Agents!

Clustering-based anomaly detection method for a large-scale irregular KPI time sequence

An anomaly detection, time series technology, applied in character and pattern recognition, instruments, computing, etc., can solve the problems of low accuracy of anomaly detection results, affecting clustering effect, confusing information, etc.

Active Publication Date: 2021-09-10
CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, truncation or interpolation greatly affects the clustering effect
Although MF (matrices fraction, matrix decomposition) can be used to align irregular KPIs, MF confuses all KPI information, reduces the differences between classes in KPI, and the clustering error rate is higher than truncation, resulting in the final The accuracy of anomaly detection results is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering-based anomaly detection method for a large-scale irregular KPI time sequence
  • Clustering-based anomaly detection method for a large-scale irregular KPI time sequence
  • Clustering-based anomaly detection method for a large-scale irregular KPI time sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] In order to solve the above problems, the embodiment of the present invention provides a cluster-based large-scale irregular KPI time series anomaly detection method, refer to figure 1 , including the following steps:

[0046] Step S1: multiple irregular KPIs are preprocessed, and a regular KPI filling matrix is ​​synthesized.

[0047] refer to figure 2 , the irregular key performance indicators (key performance indicator, KPI) generated under different sampling strategies can be mainly divided into four types: equal interval and unequal amount, unequal interval, equal duration and unequal interval, and segmental irregularity. Equal intervals use the same sampling frequency but the number of samples is different; the intervals between the samples are different, so the time represented by each sample point is not necessarily the same; Sampling at intervals; segment irregularity is the use of different sampling frequencies at different times.

[0048] For different irre...

Embodiment 2

[0062] In order to verify the impact of irregular KPIs on clustering performance, the present invention uses irregular sampling methods to sample irregular KPIs in the existing data sets to obtain irregular KPIs, and cluster and anomaly them respectively. Detection analysis. The present invention samples three public time series data sets irregularly from the UCR time series archive. The general similarity measure of KNN cannot handle KPIs of different lengths, so longer KPIs are truncated and aligned to shorter KPIs to achieve the same length. After the distance measurement, the clustering performance is detected according to the clustering mode of KNN. KNN's clustering mode description: for the currently clustered KPI calculate x i (t i ) to the distance of all KPIs in the training set, and then count the distance x i (t i ) among the K KPIs with the shortest distance, the cluster that belongs to the most clusters is used as x i (t i ) clustering.

[0063] The impa...

Embodiment 3

[0100] In order to better illustrate the innovation of the present invention, the present invention uses two real data sets from the KPI anomaly detection competition in the AIOPS challenge. These two real-world datasets contain service KPI and device KPI data from the same INTERNET WEB service system. DS1 is from the preliminaries and DS2 is from the finals.

[0101] These data sets consist of a large number of time series of different types. In order to make these data applicable to the invention of this paper, the present invention needs to artificially introduce irregular types and abnormalities. The experiment conducted in this chapter is the invention of MF and K-MEANS cyclic iterative clustering performance. The irregular data used is UIEQ KPI. The irregular import method of this type of KPI is as follows: by selecting a sampling interval for each KPI, and then The segment KPI is sampled according to its interval to obtain the generated data. The interval ratio select...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a clustering-based anomaly detection method for a large-scale irregular KPI (Key Performance Indicator) time sequence, which comprises the following steps of: preprocessing a large-scale irregular KPI, forming a regular KPI filling matrix containing all KPI elements by using a plurality of KPIs with different lengths, and clustering the regular KPI filling matrix to obtain a plurality of sub-clusters; if the distance between the clustering center points is not smaller than a preset threshold value and the number of iterations is not larger than a preset threshold value, carrying out iteration, carrying out MF filling on the KPI in each sub-cluster to generate a sub-rule KPI filling matrix, using all the sub-rule KPI filling matrixes to synthesize a rule KPI filling matrix to serve as input of next clustering, achieving a circulation system of clustering, filling, clustering and filling; and if the distance between the clustering center points is smaller than a preset threshold value or the number of iterations is larger than a preset threshold value, considering that the whole loop iteration is completed. The large-scale irregular KPI is divided into a plurality of clustering clusters, and the same anomaly detection model is applied to each clustering cluster, so that the overhead can be effectively reduced, and high-efficiency anomaly detection is realized.

Description

technical field [0001] The invention relates to the technical field of computer network operation and maintenance, in particular to a cluster-based large-scale irregular KPI time series anomaly detection method. Background technique [0002] In the environment of large-scale computers and communication networks, in order to ensure reliable and efficient services to a large number of users, the operation and maintenance personnel of Internet services usually use some key performance indicators to monitor the service performance of these applications. For example, the number of times an application service is accessed per unit time, the transaction volume per unit time, flashbacks, network bandwidth, memory capacity, etc. These indicators are called KPI indicators (key performance indicator, KPI). [0003] KPI can be divided into two categories: regular KPI and irregular KPI. Among them, for regular KPIs, existing anomaly detection models have good detection results, but for ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23G06F18/214
Inventor 何施茗李卓宙王进徐超熊兵邝利丹
Owner CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More