Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

PCIe error self-repairing method, device and apparatus and readable storage medium

A self-healing and equipment technology, applied in the directions of non-redundancy-based fault handling, response error generation, etc., can solve problems such as system operation obstruction, cost increase, PCIe error reporting, etc., to optimize system operation, realize fault self-healing, The effect of reducing implementation costs

Active Publication Date: 2021-07-27
SHANDONG YINGXIN COMP TECH CO LTD
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] During use, with the long-term operation of the device and the aging of the device, as well as the joint influence of multiple devices in the complex PCIe link, PCIe errors may occur
At present, after the system is running, the server system will have an automatic error reporting mechanism. When a PCIe error occurs in the system, some minor problems can be automatically repaired, but most problems cannot be automatically repaired by the system, which will cause the system to be blocked, and even cause the machine to shut down or restart. , causing the equipment to fail to operate normally; moreover, most of the PCIe error reports are manually performed by the operation and maintenance personnel for fault judgment and troubleshooting, resulting in the consumption of manpower by the operation and maintenance personnel and the increase in the cost of equipment replacement.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PCIe error self-repairing method, device and apparatus and readable storage medium
  • PCIe error self-repairing method, device and apparatus and readable storage medium
  • PCIe error self-repairing method, device and apparatus and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The core of the present invention is to provide a PCIe fault self-recovery method, which can reduce the impact of PCIe error reporting on system operation and reduce PCIe operation and maintenance costs.

[0053]In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0054] Please refer to figure 1 , figure 1 It is a flow chart of a PCIe fault self-repair method in an embodiment of the present invention, the method comprising the following steps:

[0055] S101. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a PCIe error self-repairing method, which comprises the following steps of: when a PCIe link in a system runs, monitoring the running state of the system by acquiring CE reporting times and UCE reporting times in the PCIe link, if the CE reporting times reach a corresponding reporting threshold value, or the UCE reporting times reach the corresponding reporting threshold value, removing error reporting equipment from the system. In this way, the error reporting equipment can be used to avoid the adverse effect of the error reporting equipment on the continuous operation of the system, a SI parameter register of the error reporting equipment is modified according to pre-stored adjustable parameters of all PCIe devices in a server, the SI parameters of the error reporting equipment are automatically optimized, the system is re-accessed after the PCIe error is repaired, and the error self-repairing is realized. Therefore, the implementation cost caused by the participation of operation and maintenance personnel and server customer service personnel in equipment replacement is reduced. The invention further discloses a PCIe error self-repairing device and apparatus and a readable storage medium, which have corresponding technical effects.

Description

technical field [0001] The present invention relates to the technical field of equipment operation and maintenance, in particular to a PCIe fault self-recovery method, device, equipment and readable storage medium. Background technique [0002] PCIe (peripheral component interconnect express, a high-speed serial computer expansion bus standard) device is an indispensable part of the server. The performance, calculation, and functions of the server are all related to the PCIe device, and the PCIe device also involves the calculation of the server. (such as GPU, FPGA), storage (such as SAS HBA, NVME SSD), network (NIC), etc., play an important role. [0003] During use, due to the long-term operation and aging of the device, and the joint influence of multiple devices in the complex PCIe link, PCIe errors may occur. At present, after the system is running, the server system will have an automatic error reporting mechanism. When a PCIe error occurs in the system, some minor pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/07
CPCG06F11/076G06F11/0793G06F11/0745G06F2201/81
Inventor 王培培
Owner SHANDONG YINGXIN COMP TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products