PCIE link stability test method and device, computer equipment and medium

A technology of stability testing and computer program, applied in fault hardware testing method, detection of faulty computer hardware, calculation, etc., can solve problems such as the inability to accurately reflect the overall performance of PCIE link, and extend data transmission time and improve data The effect of transmission efficiency

Pending Publication Date: 2021-02-26
宁畅信息产业(北京)有限公司
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above test results for each individual link cannot accurately reflect the overall performance of the PCIE link

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PCIE link stability test method and device, computer equipment and medium
  • PCIE link stability test method and device, computer equipment and medium
  • PCIE link stability test method and device, computer equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0063] In deep learning training, multiple GPU parallel training methods are often used for network training. Specifically, the same deep learning network is arranged on each GPU, and then the CPU is connected to multiple GPUs, and the parallel training of multiple GPUs is called and controlled by the CPU. Its training process includes: each GPU sends its training iteration result of each round to the CPU, and then sends it to other GPUs through the CPU to realize multi-GPU data synchronization. However, with the increasing complexity of deep learning networks, more and m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a PCIE link stability test method and device, computer equipment and a storage medium, and relates to the technical field of computer application. The method comprises the following steps: constructing a GPU communication ring according to a topological structure of a GPU in a PCIE link to be tested; sending a test instruction to a target GPU in the GPU communication ring,controlling the target GPU to generate a test data block according to the size of the test data block in the test instruction, and controlling each GPU included in the GPU communication ring to sequentially transmit the test data block; and in the process of transmitting the test data block, obtaining state data of each GPU included in the GPU communication ring, and determining a test result ofthe PCIE link to be tested according to each state data. According to the embodiment of the invention, all communication nodes on the to-be-tested PCIE link are connected based on the GPU communication ring, and the GPU included in the GPU communication ring is controlled to sequentially transmit the test data blocks to achieve overall pressurization of the to-be-tested PCIE link, so that the overall performance of the PCIE link can be tested, and the test result can better reflect the overall performance of the to-be-tested PCIE link.

Description

technical field [0001] The present application relates to the field of computer application technology, in particular to a stability test method, device, computer equipment and media of a PCIE link. Background technique [0002] In deep learning training, multiple GPU parallel training methods are often used for network training. As the complexity of deep learning networks increases, more and more GPUs are required for network training. However, due to the limited number of channels on the CPU, the number of GPUs that can be directly connected to the CPU is limited. For this reason, a solution of connecting a PCIE switch chip to the CPU and then connecting multiple GPUs to the PCIE switch chip is currently proposed. Among them, a link composed of a CPU, a PCIE switch chip, and multiple GPUs is called a PCIE link. In practical applications, after the PCIE link is constructed, it is necessary to perform a stress test on the PCIE link to detect the performance of the PCIE li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/22
CPCG06F11/221G06F11/2273
Inventor 许飞魏冰清王永懿张珅秦晓宁
Owner 宁畅信息产业(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products