GPU fault diagnosis system, diagnosis method, equipment and readable storage medium
A technology of a fault diagnosis system and a diagnosis method, which is applied in the field of GPU fault diagnosis, can solve problems such as low fault diagnosis accuracy, fault location troubles, and incomplete log collection, so as to improve the efficiency and accuracy of fault diagnosis, improve efficiency and accuracy , the effect of reducing technical requirements
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0044] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
[0045] In this embodiment, a kind of GPU fault diagnosis system (abbreviated as AI EasyCfg in this embodiment, abbreviated as Artificial Intelligence Easy Configure) for x86 server is provided, which is suitable for NVIDIA GPU state detection and functional testing, and can improve Work efficiency of on-site engineers and accuracy of GPU fault judgment. It has functions such as humanized interaction, one-click log collection, fault log diagnosis, GPU real-time status detection and stress test, and fault handling suggestions.
[0046] In this example, if figure 1 As shown, a GPU fault diagnosis system is built under linux OS. The computer system adopts CUDA (Compute Unified Device Architecture), which is a computing platform launched by graphics card manufacturer NVIDIA. It is a general-purpose parallel computing architecture, which enables GPU to solve...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


