Distributed system fault root cause tracing method based on knowledge graph technology

A distributed system and knowledge graph technology, applied in the field of distributed system fault root cause tracing, can solve problems such as different and incomplete explanation of event-level fault trigger paths

Pending Publication Date: 2021-09-10
SOUTHEAST UNIV
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods have two limitations: 1. They have not studied how to use the hidden explicit knowledge in historical data to guide the current root cause analysis; 2. They cannot fully explain the fault tr

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed system fault root cause tracing method based on knowledge graph technology
  • Distributed system fault root cause tracing method based on knowledge graph technology
  • Distributed system fault root cause tracing method based on knowledge graph technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the following specific embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention.

[0053] The present invention is a distributed system root cause tracing method based on knowledge graph, which includes the following five steps:

[0054] Step 1): Starting from collecting historical fault data of large-scale distributed system Kubernetes, events are generated from these historical fault data. The events found in this process will be used in the construction of the subsequent fault propagation diagram. The detailed steps are as follows:

[0055] (1) Generate events from log data;

[0056] For log data, on the one hand, a clustering algorithm is used to discover common templates in the logs, and then the newly discovered templates are added to the tem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed system fault root cause tracing method based on a knowledge graph technology. The method is used for solving the problem that large distributed system fault root cause tracing is difficult. The distributed system fault root cause tracing task is used for finding out a root cause causing a system fault. According to the method, a fault knowledge graph is constructed for each type of faults in a distributed system, events are generated from historical fault data by using a template technology in the construction process, then a fault propagation graph is constructed by using a relationship between learning events of a machine learning model, and finally, a merging algorithm is used to extract a common structure of the fault propagation graphs of the same type of faults to generate a fault knowledge graph. When a fault occurs, the fault knowledge graph most similar to the real-time fault propagation graph is obtained by constructing and calculating the real-time fault propagation graph and the similarity between the real-time fault propagation graph and the fault knowledge graph, so that the root cause of the system fault is obtained according to the fault root cause marked by the fault knowledge graph.

Description

technical field [0001] The invention belongs to the field of knowledge graphs, and in particular relates to a root cause tracing method for distributed system faults based on knowledge graph technology. Background technique [0002] With the rapid development of virtualization technology, distributed systems are becoming larger and more complex. Due to the complex network topology of the distributed system, the urgent time to repair faults, and the scarcity of high-level distributed system operation and maintenance personnel, when the system fails, it is difficult for the operation and maintenance personnel to find the root cause in a short time, and the system will be in an unstable state. stable state. It is an urgent problem to find out the root cause of large-scale distributed system failures in time and ensure the safe and stable operation of the system. In recent years, more and more researchers have begun to pay attention to these problems. The methods based on depe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/07G06K9/62G06N5/02
CPCG06F11/079G06F11/0709G06N5/02G06F18/24
Inventor 吴天星罗安源漆桂林方苏东
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products