Check patentability & draft patents in minutes with Patsnap Eureka AI!

Estimating performance of GPU application for different GPU-link performance ratio

a technology of gpu-link and performance ratio, applied in the field of information processing, can solve problems such as difficult to obtain and determine, and achieve the effect of increasing gpu-link performance ratio and higher gpu-link performance ratio

Active Publication Date: 2019-10-24
INT BUSINESS MASCH CORP
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This approach allows for accurate estimation of GPU application performance on machines with increased GPU-link performance ratios without knowing the internal structure, enabling effective resource allocation and optimization for improved computational performance.

Problems solved by technology

Currently, performance estimation in such a situation needs information about how GPU computation and CPU-GPU communication depend on each other in the target GPU application, which is not so easy to obtain and determine.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Estimating performance of GPU application for different GPU-link performance ratio
  • Estimating performance of GPU application for different GPU-link performance ratio
  • Estimating performance of GPU application for different GPU-link performance ratio

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017]The present invention is directed to estimating the performance of a Graphical Processing Unit (GPU) application for a different GPU-link performance ratio. The present invention can be used to estimate the performance of one or more GPU applications for one or more different GPU-link performance ratios.

[0018]As used herein, the term “GPU-link performance ratio” relates to the performance of GPU execution versus the performance of CPU-GPU communication link. Hence, for increasing GPU-link performance, the GPU execution performance increases relative to the CPU-GPU communication. In contrast, for decreasing GPU-link performance, the GPU performance decreases relative to the CPU-GPU communication.

[0019]Moreover, as used herein, the term “CPU-GPU communication” can refer to any of a communication from the CPU to the GPU and / or from the GPU to the CPU.

[0020]In an embodiment, the present invention can estimate the performance of a target GPU application for a higher GPU-link perfor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A computer-implemented method is provided for estimating the performance of a GPU application on a new computing machine having an increased GPU-link performance ratio relative to a current computing machine having a current GPU-link performance ratio. The method includes adding a delay to CPU-GPU communication on the current computing machine to simulate a delayed-communication environment on the current computing machine. The method further includes executing the target GPU application in the delayed-communication environment. The method also includes measuring the performance of the target GPU application in the delayed-communication environment. The method additionally includes estimating the performance of the new computing machine having the increased higher GPU-link performance ratio, based on the measured performance of the target GPU application in the delayed-communication environment.

Description

BACKGROUNDTechnical Field[0001]The present invention relates generally to information processing and, in particular, to estimating the performance of a Graphical Processing Unit (GPU) application for a different GPU-link performance ratio.Description of the Related Art[0002]Currently, a supercomputer node often includes a CPU and a GPU. On such computing machines, application execution mainly involves “GPU computation” and “CPU-GPU communication”. That is, before executing a kernel on a GPU, the data used by the kernel has to be transferred from CPU memory to GPU memory. After execution, the data produced by the kernel on the GPU is typically transferred back to the CPU memory. In the preceding scenario, the execution relates to “GPU computation”, while the transferring of data relates to “CPU-GPU communication”[0003]Performance estimation for future computing machines, especially for machines with faster GPUs, is very important both for application tuning and for sales. However, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G06T1/20G06F9/38
CPCG06T1/20G06F9/3881G06F11/3024G06F11/3433G06F11/3457G06F11/3447
Inventor KAWACHIYA, KIYOKUNINEGISHI, YASUSHIDOI, JUN
Owner INT BUSINESS MASCH CORP
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More