System fault management method, device and system

A system failure and management method technology, applied in transmission systems, digital transmission systems, data exchange networks, etc., can solve problems such as loss of orders, time-consuming, adverse social impact, etc.

Pending Publication Date: 2020-10-27
上海燕汐软件信息科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the current conventional system online fault management process, it takes too long from fault identification to fault recovery of the entire link, and if the root cause of the fault cannot be identified and repaired in a short period of time, the entire fault time will be limited. multiplied risk
Business interruption caused by system failure is often unacceptable for an enterprise. It may be the loss of a large number of orders or the loss of customers. In extreme cases, it will cause adverse social impact

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System fault management method, device and system
  • System fault management method, device and system
  • System fault management method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0067] combine figure 1 As shown, this embodiment provides a system fault management method, which at least includes the following steps:

[0068] S1. Identify the system fault according to the received fault prompt information and trigger a fault work order of the corresponding dimension.

[0069] Wherein, the fault prompt information includes at least one of multi-dimensional monitoring alarm information triggered by the alarm platform and manual alarm information.

[0070] In this embodiment, faults are divided into different dimensions according to fault types. Among them, the alarm platform is used to monitor the fault dimensions with a high trigger probability and send corresponding dimension monitoring alarm information. When the user triggers other faults other than multiple dimensions of the alarm platform, the system fault is prompted by generating manual alarm information.

[0071] Triggering fault work orders through multi-dimensional monitoring can greatly impr...

Embodiment 2

[0122] In order to implement the system fault management method in the first embodiment above, this embodiment provides a corresponding system fault management device, such as image 3 As shown, the device includes at least:

[0123] Fault work order triggering module 1, used to identify system faults and trigger fault work orders of corresponding dimensions according to the received fault prompt information;

[0124] The fault location module 2 is used to generate parallel troubleshooting tasks in the corresponding dimension according to the fault work order and push them to the corresponding troubleshooting personnel, and locate the fault point according to the received troubleshooting results corresponding to each troubleshooting task;

[0125] The emergency plan module 3 is configured to search for a restoration plan that matches the fault point in the preset restoration plan matching relationship, and push the restoration plans to the fault handling personnel after they a...

Embodiment 3

[0153] Corresponding to the above method and device, Embodiment 3 of the present application provides a computer system, including:

[0154] one or more processors; and

[0155] A memory associated with the one or more processors, the memory is used to store program instructions, and when the program instructions are read and executed by the one or more processors, perform the following operations:

[0156] Identifying a system fault and triggering a fault work order according to the received fault prompt information, where the fault prompt information includes at least one of multi-dimensional monitoring alarm information and manual alarm information;

[0157] Generate parallel troubleshooting tasks according to the fault work order and push them to the corresponding troubleshooting personnel, and locate the fault point according to the received troubleshooting results corresponding to each troubleshooting task;

[0158] Find the recovery plan that matches the failure point ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system fault management method, device and system. The method comprises the steps of identifying a system fault according to received fault prompt information and triggeringa fault work order of a corresponding dimension; generating parallel troubleshooting tasks in corresponding dimensions and pushing the same to corresponding troubleshooting processing personnel respectively; positioning a fault point according to a received troubleshooting result; searching for a recovery plan matched with the fault point and pushing the recovery plan to fault processing personnel; and executing the recovery plan after the selection by the fault processing personnel so as to repair the system fault. According to the method, a multi-dimensional and parallel troubleshooting mode is adopted, the troubleshooting time is shortened, the troubleshooting accuracy is improved, and the fault management efficiency is improved.

Description

technical field [0001] The invention relates to the technical field of information system operation and maintenance, in particular to a system failure management method, device and system. Background technique [0002] IT system online fault management is particularly important in the daily operation and maintenance of the system. It not only tests technology, but also tests timeliness. [0003] The online fault management process is a test of the technical personnel / technical team's ability to respond, judge, and organize. In the face of sudden production failures, how to quickly locate the problem, find a recovery plan, and quickly implement the recovery plan is not an easy task. In the current conventional system online fault management process, it takes too long from fault identification to fault recovery of the entire link, and if the root cause of the fault cannot be identified and repaired in a short period of time, the entire fault time will be limited. Multiply th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24
CPCH04L41/06H04L41/0631H04L41/0677
Inventor 何俊敏杨微易玉凤马兴孟波
Owner 上海燕汐软件信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products