Fault processing method of computer cluster system

A fault handling method and computer cluster technology, applied in transmission systems, digital transmission systems, electrical components, etc., can solve the problems of high cost, low work efficiency, increase the number and workload of maintenance personnel, and improve fault tolerance, The effect of reducing workload

Inactive Publication Date: 2014-02-26
EISOO SOFTWARE
View PDF4 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For dealing with failures in the computer cluster system, the usual method is for maintenance personnel to enter the computer room to find the faulty machine among multiple nodes in the computer cluster system, then determine the cause of the machine's failure, and then perform maintenance work. When the number of nodes increases, it may be necessary Increase the number and workload of maintenance personnel, not only the cost is high, but also the work efficiency is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fault processing method of computer cluster system
  • Fault processing method of computer cluster system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Aiming at the problems existing in the prior art, this application provides a fault handling method for a computer cluster system, which uses a message mechanism to report a fault in the computer cluster system, and handles the fault by a specific node, thereby realizing the fault without manual intervention. The automatic processing function of the computer cluster system failure ensures that the computer cluster system nodes can be used normally after failure, reduces the workload of maintenance personnel, and improves the fault tolerance of the computer cluster system.

[0027] The main design idea of ​​the technical solution of this application is: use message middleware and single-node monitoring program to form a monitoring network covering the nodes of the entire computer cluster system, and monitor the service status and network status of each node in real time. The monitoring program on the node reports the fault information to the management center for unified ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fault processing method of a computer cluster system. The method comprises the following steps: (A) at least two nodes in the computer cluster system are selected and are set as management nodes which bear the fault processing and the management of the computer cluster system, one node in the management nodes is taken as a main node, and other nodes are taken as standby nodes, (B) a bottom monitoring service module of each node in the computer cluster system monitors the operation state of the node and software and hardware loads and judges whether a fault appears or not, and if so, the bottom monitoring service module notifies a message middleware service module to send a fault massage to a management center service module of the main node; and (C) the management center service module of the main node carries out fault processing according to the fault message. According to the technical scheme of the invention, in the condition that human intervention is not needed, the automatic processing function of the cluster computer system fault can be realized.

Description

technical field [0001] The present application relates to computer technology, in particular to a computer cluster system, and in particular to a method for troubleshooting a computer cluster system. Background technique [0002] With the advancement of information technology, both enterprises and other organizations are increasingly dependent on computer systems. With the rapid expansion of data volume, a single computer can no longer meet its needs, and the use of supercomputers will greatly increase the cost of computers. In this case, computer cluster technology emerged as the times require. [0003] A computer cluster system is connected by a group of loosely integrated computer software or hardware, and highly closely cooperates to complete computing work. Multiple computer devices forming a computer cluster system can be regarded as a computer logically. A single computer in a computer cluster system is usually called a node, and the computer cluster system can be c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24
Inventor 陈浩赵亚萍
Owner EISOO SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products