Computer IO HUNG event early warning method and device, equipment and medium

An early warning device and computer technology, applied in the field of computers, can solve problems such as difficult to locate faults, complicated operations, and difficulty in getting started, and achieve the effect of avoiding production accidents and achieving remarkable results.

Active Publication Date: 2021-03-12
中国农业银行股份有限公司福建省分行
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When this kind of failure occurs, the operating system level of the machine has actually stopped working, and it is basically impossible to log in, but the machine is connected by pinging the machine at the network level, which makes the traditional monitoring system have the illusion that the machine is normal, so that The situation expanded and even caused production accidents
On March 3, 2019, an IO HUNG event occurred in the server of a North China cloud computing data center of a domestic IT giant. Due to the difficulty in locating the fault for a while, it affected the customer's various businesses for several hours, causing huge economic losses and reputational impact
[0005] Traditional BMC monitoring has some disadvantages: first, the software is huge and there are several CDs; second, the deployment is complicated, not only the server needs to install the console, configuration management, knowledge base module, forwarding service, etc., but also the client needs to install 680M software; The third is that the operation is complicated, there are many menus, and it is not easy to get started; the fourth is that the reliability is poor, and various client alarms often appear on the server, most of which are caused by the expiration of the password of the patrol user on the client. In this case, the patrol needs to be reconfigured on the server. The new password can continue to monitor, and there is a gap in monitoring during this outage period; fifth, the BMC client program consumes more system resources, and even affects the operation of other client applications; sixth, the copyright of BMC software is strict and expensive

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Computer IO HUNG event early warning method and device, equipment and medium
  • Computer IO HUNG event early warning method and device, equipment and medium
  • Computer IO HUNG event early warning method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] Such as Figure 4 As shown, the present embodiment provides an early warning method for a computer IO HUNG event, including:

[0073] Step 1. Deploy the collector on each virtual machine;

[0074] Step 2, the collection machine regularly collects the data on the virtual machine, and writes it into the monitoring message file;

[0075] Step 3, if writing to disk is successful, then send status information message; No, then do not send; Described status information message includes acquisition time, and described status information message also includes CPU usage idle rate and IO waiting time; Used for Analysis after the IO HUNG incident;

[0076] Step 4, the early warning machine regularly checks the message of the server; and reads the acquisition time from a message closest to the current time, compares it with the standard time of the machine, and if the difference reaches the set deviation value, it performs an early warning;

[0077] Step 5: The early warning mac...

Embodiment 2

[0080] Such as Figure 5 As shown, a kind of early warning device of computer IO HUNG event is provided in the present embodiment, comprises:

[0081] The deployment module deploys the collection machine on each virtual machine;

[0082] The acquisition module, the acquisition machine regularly collects the data on the virtual machine, and writes it into the monitoring message file;

[0083] Sending module, if the write disk is successful, then send status information message; Analysis after the IO HUNG incident;

[0084] In the early warning module, the early warning machine regularly checks the messages of the server; and reads the acquisition time from a message closest to the current time, compares it with the standard time of the machine, and if the difference reaches the set deviation value, an early warning will be issued.

[0085] In the matching module, the early warning machine matches the physical host corresponding to the virtual machine, and stores the early wa...

Embodiment 3

[0089] This embodiment provides an electronic device, including a memory, a processor, and a computer program stored in the memory and operable on the processor. When the processor executes the computer program, any implementation manner in Embodiment 1 can be implemented.

[0090] Since the electronic device introduced in this embodiment is the device used to implement the method in Embodiment 1 of this application, based on the method described in Embodiment 1 of this application, those skilled in the art can understand the electronic device of this embodiment. Specific implementation methods and various variations thereof, so how the electronic device implements the method in the embodiment of the present application will not be described in detail here. As long as a person skilled in the art implements the equipment used by the method in the embodiment of the present application, it all belongs to the protection scope of the present application.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a computer IO HUNG event early warning method, device and equipment, and a medium. The method comprises the steps that collecting machines are deployed on all virtual machines;the acquisition machine acquires data on the virtual machine at regular time and writes the data into the monitoring message file; if the disk writing succeeds, a state information message is sent; ifnot, sending is not performed, wherein the state information message comprises acquisition time; the early warning aircraft regularly checks the message of the server, reads the acquisition time fromthe message closest to the current time, compares the acquisition time with the local standard time, and if the difference reaches a set deviation value, carries out early warning; the invention notonly is effective for known and unknown IO HUNG problems of various computers, but also is suitable for part of traditional faults, and is beneficial to discovering operating system level problems such as timing task failure, user password expiration and clock skew. Besides, through the message data collected in real time, a performance report of the client can be generated and used for regular analysis.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to an early warning method, device, equipment and medium for computer IO HUNG events. Background technique [0002] In today's era of turbulent IT technology, the general trend of cloud computing is unstoppable like the roaring Yangtze River. At present, various businesses of enterprises have increasingly strong demand for IT, and data centers are continuously integrated intensively. IT managers are increasingly aware of the operation and maintenance challenges brought about by new data centers, especially in the daily operation and maintenance of many failures. Most of the faults can be found by the conventional monitoring system, but some faults, such as IO HUNG causing the computer to "lose connection", are difficult to capture in time. IO HUNG is a kind of extremely weird fault, and it is a big problem in computer system monitoring, and traditional monitoring (such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/07G06F11/30
CPCG06F11/0766G06F11/079G06F11/0712G06F11/302G06F11/3089
Inventor 张松坚陈长钦杨超沈书航
Owner 中国农业银行股份有限公司福建省分行
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products