Unlock instant, AI-driven research and patent intelligence for your innovation.

Multipath server system fault diagnosis device, system and method

A multi-channel server and system failure technology, applied in the field of multi-channel server system fault diagnosis devices, can solve the problems of difficult generalization, refinement of the system, and increased difficulty of board-level design, so as to reduce the difficulty and cost of implementation and facilitate expansion Effect

Pending Publication Date: 2022-07-05
PHYTIUM TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. The diagnostic information acquisition process requires the support of software, which leads to the excessive dependence of the diagnostic process on the normal state of the software system. Once the software program is abnormal, the fault diagnosis of the entire system cannot be realized
[0005] 2. Due to the internal resource sharing of each CPU in the multi-channel server, even if the fault diagnosis based on the software diagnosis method detects a fault, it is difficult to locate the socket number where the fault occurred. At the same time, it is limited by the physical channel for information reporting. It is also impossible to report the fault information independently, that is, the software diagnosis method is difficult to refine the fault diagnosis to the socket granularity
[0006] In the prior art, there are few studies on the fault diagnosis of multi-channel server systems, and the fault diagnosis is usually realized by analyzing the fault log information, and the fault discovery of the CPU small system is not paid attention to, and different sockets cannot be distinguished from each other. diagnosis
In the prior art, it is necessary to rely on the design of auxiliary external circuits to realize the fault discovery of small CPU systems, such as figure 1 As shown, the external circuit is used to capture the fault signal and analyze the fault information, but this will not only increase the difficulty of board-level design, but also make the system difficult to generalize

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multipath server system fault diagnosis device, system and method
  • Multipath server system fault diagnosis device, system and method
  • Multipath server system fault diagnosis device, system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] like figure 2 As shown, the fault diagnosis device for the multi-channel server system in this embodiment includes: a socket fault monitoring module 1 and a storage module 2 that are connected to each other. The socket fault monitoring module 1 is used to monitor the information of the CPUs in each socket in the multi-channel server system and judge The monitored information type, when it is judged to be fault information, the control will store the fault information in the storage module 2 . By setting the socket fault monitoring module 1, the socket is the minimum range of fault monitoring, and the CPU collects fault information in the socket, that is, the fault information is collected internally by the CPU, and then the fault information is stored in the storage module 2 uniformly, which can effectively The scope of fault diagnosis is reduced to the socket level, and fault detection in a small CPU system is realized without additional hardware circuits, which great...

Embodiment 2

[0060] like Figure 4As shown, this embodiment is basically the same as Embodiment 1, except that the socket fault monitoring module 1 includes a plurality of socket fault monitoring units 101, and two or more sockets are jointly connected to one socket fault monitoring unit 101, that is, one socket fault monitoring unit 101 is connected to each other. The Socket fault monitoring unit 101 monitors the CPU fault information inside two or more Sockets at the same time; the storage module 2 includes a plurality of storage units 201, and two or more Sockets are connected to one storage unit 201 correspondingly, that is, one storage unit 201 jointly stores two storage units 201. CPU fault information within more than one socket.

[0061] Although compared with Embodiment 1, this embodiment cannot independently implement fault monitoring of each Socket, but compared with Embodiment 1, the number of Socket fault monitoring units 101 and storage units 201 can be reduced to further red...

Embodiment 3

[0063] like Figure 5 As shown, the fault diagnosis method for a multi-channel server system in this embodiment is applied to a multi-channel server system. The multi-channel server system includes a plurality of Sockets, and each Socket includes more than one CPU. The multi-channel server system also includes interconnected Socket faults. The monitoring module 1 and the storage module 2, each Socket is respectively connected with the Socket fault monitoring module 1, and the steps of the method include:

[0064] Step S01. The Socket fault monitoring module 1 monitors the fault information of the CPU in each Socket in the multi-channel server system respectively and judges the type of information monitored;

[0065] Step S02. When the socket fault monitoring module 1 determines that the monitored information is fault information, it controls to store the monitored fault information in the preconfigured storage module 2.

[0066] like figure 2 In the illustrated embodiment, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multipath server system fault diagnosis device, system and method, the device comprises a Socket fault monitoring module and a storage module which are connected with each other, the Socket fault monitoring module is used for monitoring fault information of a CPU in each Socket in a multipath server system, and the monitored fault information is stored in the storage module. The Socket granularity fault diagnosis method can be suitable for a multipath server system to realize Socket granularity fault diagnosis, and has the advantages of simple structure, low complexity and cost, good expansibility and the like.

Description

technical field [0001] The invention relates to the technical field of multi-channel server systems, and in particular, to a fault diagnosis device, system and method for a multi-channel server system. Background technique [0002] The internal structure of a multi-channel server system is relatively complex. The system contains multiple CPU chips. Each CPU chip is interconnected through an interconnection channel (FIT) for data interaction. The internal resources of each CPU are shared. If cross-channel access occurs , the data needs to be interacted with through FIT. Therefore, in the multi-channel server system, it is very important to realize the internal fault diagnosis to ensure the stable and reliable operation of the system. [0003] Most of the fault diagnosis methods in the prior art are aimed at a single-channel system, and are usually implemented by adopting a software diagnosis method, that is, a software program is used to detect whether there is a fault in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/07
CPCG06F11/0721G06F11/0787G06F11/079
Inventor 杨有桂陈才刘付东
Owner PHYTIUM TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More