High-availability monitoring and management device and redundant switching method for high-density blade server

A blade server, monitoring and management technology, which is applied in hardware redundancy for data error detection, hardware monitoring, instrumentation, etc., can solve problems such as fragmentation and system confusion, and achieve the goal of ensuring normal operation and maintenance and reducing operation and maintenance risks Effect

Active Publication Date: 2022-07-05
NAT UNIV OF DEFENSE TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, if a single heartbeat line implements redundant detection, the disconnection of the heartbeat line will cause the master and slave servers to think that they should undertake the service work and compete for shared resources, which will cause system chaos, that is, the phenomenon of "split brain", which is also redundant. Problems to avoid when switching

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-availability monitoring and management device and redundant switching method for high-density blade server
  • High-availability monitoring and management device and redundant switching method for high-density blade server
  • High-availability monitoring and management device and redundant switching method for high-density blade server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] like figure 1 As shown, the high-availability monitoring and management device for a high-density blade server in this embodiment includes a master CMU and a slave CMU, a total of two redundantly arranged chassis management units, and there are two communication links between the master CMU and the slave CMU The two communication links include a first communication link for sending a heartbeat message containing equipment status information and a second communication link for sending a remedial heartbeat message containing equipment status information. Both the CMU and the slave CMU have Ethernet interfaces for connecting to each computing blade in the high-density blade server and the BMU in the switching blade, as well as connecting terminals for connecting the power modules and cooling modules of each chassis in the high-density blade server.

[0036] like figure 1 As shown, both the master CMU and the slave CMU include a shelf switching module 1, a shelf management...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a high-availability monitoring and management device and a redundant switching method for high-density blade servers. The device of the invention includes a master CMU and a slave CMU, a total of two redundantly arranged frame management units, the master CMU and the slave CMU. There are two communication links between them, and the two communication links include a first communication link for sending a heartbeat message containing equipment status information and a second communication link for sending a remedial heartbeat message containing equipment status information. road. The invention can ensure the ability of remote monitoring and management of all components, ensure the normal operation and maintenance of high-density blade servers, can greatly reduce the risk of operation and maintenance, can solve the phenomenon of "brain split", and can transmit equipment status through heartbeat messages and remedial heartbeat messages. It can comprehensively judge the status of the master CMU and the slave CMU to avoid the problem of switching without switching or switching without switching, which can greatly improve the availability of the high-density blade server monitoring and management system.

Description

technical field [0001] The invention relates to a high-availability technology for servers, in particular to a high-availability monitoring and management device and a redundancy switching method for high-density blade servers. Background technique [0002] A supercomputing center or data center is generally deployed with a large number of high-density blade servers. The chassis of each high-density blade server contains dozens of computing motherboards, several switching motherboards (service data network), one monitoring motherboard, and several chassis. Components such as power modules and chassis cooling modules (fans). Computing motherboards and switching motherboards usually integrate a board-level management unit (BMU, Base Management Unit) to implement board monitoring and management in the form of daughter cards. The monitoring motherboard acts as a chassis management unit (CMU Chassis Management Unit) to collect BMU monitoring and management information and impleme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/30G06F11/16
CPCG06F11/3006G06F11/16Y02D30/50
Inventor 袁远邢建英李世杰王俊蒋句平黎铁军宋振龙李琼魏登萍谢徐超任静
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products