Unlock instant, AI-driven research and patent intelligence for your innovation.

Algorithm model for intelligently identifying quantitative relationship and anomaly thereof in log data

An algorithm model and log technology, applied in digital data processing, character and pattern recognition, calculation, etc., can solve the problems of low accuracy of quantitative relationship, low efficiency, and no automatic generation of log fragments, etc., to improve efficiency and solve problems The effect of automatic division of questions

Pending Publication Date: 2022-05-13
北京云集智造科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) First, it is necessary to provide multiple log fragments to generate the log mode count matrix, and there is no effective automatic generation of a reasonable log fragment from the original large log data.
[0005] (2) Using the method of singular value decomposition, the method of counting the matrix from the extracted log pattern is too inefficient. When the number of log patterns or samples is large, the required calculation time is unacceptable; the second is to dig out Quantitative relationships are less precise

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Algorithm model for intelligently identifying quantitative relationship and anomaly thereof in log data
  • Algorithm model for intelligently identifying quantitative relationship and anomaly thereof in log data
  • Algorithm model for intelligently identifying quantitative relationship and anomaly thereof in log data

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0035] 1. The original log sample corresponding to the log mode represented by end 2019-05-21T11:36:09.984+0800I-[conn340768]end connection 10.120.117.126:29090(1972connections now open)

[0036] 2. The original log sample corresponding to the log mode represented by connect 2019-05-21T11:38:20.440+0800I NETWORK[thread2] connection accepted from 10.120.117.151:63824#341554(1819connections now open)

[0037] Reason for the exception: connect is the number of connections within one minute (one log mode), and end is the number of disconnected connections within one minute (another log mode). Therefore, the exception on 10.28 is caused by too many connections that are not closed in time Yes, this can also explain why the manual shutdown is required later.

[0038] Abnormal data: (each point in the graph represents the number of logs corresponding to this log mode in the corresponding 5-minute time window) as shown in Figure 1;

[0039] The historical data is as follows Figure 2...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an algorithm model for intelligently identifying a quantitative relationship and an anomaly thereof in log data. The algorithm model comprises training data preprocessing, log analysis, feature extraction, log quantitative relationship mining and anomaly detection. According to the method, an effective pruning method is provided, a complex multivariate log mode quantitative relation is converted into a binary log mode quantitative relation mining problem, and compared with an existing solution, the mining efficiency is greatly improved (from several hours to less than 10 minutes), and the algorithm effect is effectively improved. The problem of automatic division of the log fragments can be effectively solved by utilizing the fixed time window to divide the log fragments, and the window division in the time dimension is closer to the research that the log number corresponding to a log mode with a quantitative relationship in a short time meets the quantitative relationship of the log mode. Therefore, the windows in the time dimension instead of the windows in the number dimension are used.

Description

technical field [0001] The invention relates to the field of log anomaly detection, in particular to an algorithm model for intelligently identifying quantitative relationships and anomalies in log data. Background technique [0002] Log anomaly detection, that is, fault diagnosis based on log data, refers to analyzing log data generated during system operation by intelligent means to automatically discover system anomalies and diagnose system faults. Existing log anomaly detection systems are usually divided into three parts. The first part is log parsing, which converts unstructured logs into structured log patterns. Log text can be regarded as composed of constants and variables. Log parsing The purpose is to retain the constant part of the log text and replace the variable part with wildcards to generate a log pattern; the second part is feature extraction. After parsing the first part of the log, the original log text sequence can be converted into a log pattern sequenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30G06K9/62
CPCG06F11/3072G06F18/23G06F18/214
Inventor 沈国鹏朱品燕
Owner 北京云集智造科技有限公司