Unlock instant, AI-driven research and patent intelligence for your innovation.
A method for generating KPI curves and labeling band features based on log event relationships
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An event relationship and log technology, applied in the field of artificial intelligence, can solve problems such as time-consuming and labor-intensive, too academic, and unable to solve challenges at the same time.
Active Publication Date: 2022-06-24
THREE GORGES INTELLIGENT CONTROL TECH CO LTD
View PDF3 Cites 0 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
[0004] Real-time anomaly detection by setting thresholds on KPI data is very common, but real-time anomaly detection for system logs has not been publicly reported
[0005] In order to pursue effectiveness, traditional machine learning mostly adopts supervised learning methods. In practice, it is difficult to obtain abnormal labels in batches. The accuracy of model output is improved through massive labeled data samples. Therefore, a large number of business experts are required to manually label KPI curves, which often requires Repeated adjustments and corrections are time-consuming and labor-intensive. In practice, it may be necessary to start monitoring millions or tens of millions of KPIs at the same time. Therefore, it is often impossible to find a certain algorithm that can meet the above requirements at the same time in the actual anomaly detection practice. Unable to solve the above challenges at the same time; non-supervised learning commonly used clustering and other technologies are mainly used in feature discovery, data exploration and other scenarios, because of the lack of annotation, the results need to be interpreted by data scientists to be abstractly mapped to business models, and cannot be directly Functional results; in the specific implementation of weak supervision, because of the introduction of non-supervised / supervised methods in stages, the accuracy of circular recursion is too academic, and it is difficult to implement. On the other hand, in order to integrate specific methods, vector expressions need to be used to unify different The representation between methods, the result is not easy for the application personnel to understand
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 1
[0068] A method for generating KPIs based on log keyword clustering, the steps of which include:
[0069] R1 collects the fault logs obtained by the industrial control equipment in the industrial control system network of the same power station based on monitoring indicators, constructs event tuples based on the fault logs, and processes the fault logs with the snowball algorithm to construct event relationships.
[0070] Methods for building event tuples:
[0071] F1 sets a training sentence set composed of training sentences, extracts corpus from the fault log and forms a pair of sentences to be processed with each training sentence, and performs word segmentation on the sentences in the sentence pair based on the pre-built corpus. The pre-built corpus includes Industry corpus and general corpus;
[0072] F2 converts each feature word of the sentence after word segmentation into word vector, and uses the cosine similarity to calculate the similarity of each senten...
Embodiment 2
[0106] The method for marking band features based on the log KPI curve obtained in Embodiment 1 includes the following steps:
[0107] Step A1. Extract the data point set of each minute in all log KPI curves into the same curve set L, and divide the curve set L into several log KPI curve data sets with a time width of s minutes. M i , i is the segment number;
[0108] Step A2. Use the dbscan algorithm to calculate the Euclidean distance between each segment of the data set according to the attributes of each segment of the log KPI curve data set, cluster the log KPI curve data set of the i segment, and obtain k clusters and abnormal items, Each cluster is a grouped dataset, and each grouped dataset has j segments of log KPI curve dataset F j ;
[0109] Step A3. Calculate the arithmetic mean of the j-segment log KPI curve data set in each grouped data set, Σ F j / j , as the fundamental wave of the group;
[0110] Step A4. Use the NCC algorithm to calcul...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
PUM
Login to View More
Abstract
The present invention discloses a method for generating KPI curves based on log event relationships and marking band characteristics. First, log KPI curves are generated according to the relationship of events in logs, and then the KPI curves are divided into several equal-length bands, which are aggregated according to the non-time dimension of the bands. Classify into multiple clusters, extract the fundamental wave of each cluster, compare the similarity between each band data of each cluster and the fundamental wave, find out the grouping boundary line of each cluster, group each band data of each cluster, and extract the continuous The total time length of similar bands, take the maximum value of the total time length as the sliding window width. This window is used to divide the KPI curve, so that the bands in each divided window are easy to cluster and classify, which is beneficial to quickly convert the entire KPI curve into a band chain composed of different types of bands, and then cycle the KPI curves of individual monitoring indicators Detection and type detection mark labels, and then use this window to segment a separate KPI curve, and use the bands in the fundamental KPI curve for grouping and labeling.
Description
technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a method for generating KPI curves and marking band features based on log event relationships. Background technique [0002] Outlier detection (also known as outlier detection) is a detection process to find objects that behave differently than expected, and these objects are called outliers or outliers. Anomaly detection methods usually include statistical-based models, distance-based models, linear transformation models, nonlinear transformation models, and machine learning models. [0003] KPI (key performance indicators) refers to the monitoring indicators of services, systems and other objects (such as latency, throughput, etc. in the network). Its storage form is a sequence arranged in the order of time of its occurrence, which is what we usually call a time series. Anomaly detection of time series is to check whether the current data deviates significantl...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.