A traceability method for gpu computing performance problems

A computing performance and performance technology, applied in computing, instrumentation, error detection/correction, etc., can solve problems such as complex GPU structure, impossibility to achieve full optimization, and difficulty in tracing the source of performance problems, so as to shorten the optimization cycle and improve accuracy Effect

Active Publication Date: 2019-11-26
BEIJING WUZI UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

With this execution mode, it is more difficult to use the behavior description method to analyze performance and trace the source of performance problems
[0004] In addition, calculating metrics with statistical meaning is currently the main method for internal analysis of kernel functions. However, the structure of the GPU is complex, and there are usually many metrics for computing performance, including memory operations, computing operations, branches, and synchronization operations. These are all related to performance. More or less relationship, if each item is optimized separately, the workload will be very large, these metrics are usually interrelated, and it is often impossible to achieve full optimization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A traceability method for gpu computing performance problems
  • A traceability method for gpu computing performance problems
  • A traceability method for gpu computing performance problems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the gist of the present invention more obvious and understandable, the present invention will be further described below in conjunction with the accompanying drawings and examples. Many details and specific examples are set forth in the following description, and these examples are provided to enable a more thorough understanding of the present invention and to fully convey the present invention to those skilled in the art. Although the present invention can be implemented in many other ways different from this description, those skilled in the art can make corresponding promotions without violating the connotation of the present invention, so the present invention is not limited by the specific examples and specific accompanying drawings disclosed below. restricted.

[0026] see figure 1 , a source-tracing method for GPU computing performance problems, including the following steps:

[0027] (1) Using the CUDA task parallel model CTPM to describe the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a tracing method for a GPU computing performance problem. The method comprises the following steps: (1) describing behavior characteristics of a CUDA program kernel function level by using a CUDA task parallel model CTPM, thereby identifying a performance low-efficiency behavior of the CUDA program kernel function level, and expanding kernel function level performance optimization work; and (2) for the internal data parallel part of the CUDA program kernel function, using a hierarchical iteration attribute reduction algorithm LIAR to identify the key metric, performing deep performance mining on the CUDA program, and optimizing the internal performance problem of the CUDA kernel function. According to the method, by analyzing the CUDA development operation environment and the programming model based on the GPU hardware characteristics, an effective performance problem tracing method is found, the accuracy of performance bottleneck positioning is improved, and thepurpose of shortening the optimization period is achieved.

Description

technical field [0001] The invention relates to the field of performance monitoring and analysis in the field of information technology, in particular to a source tracing method for GPU computing performance problems. Background technique [0002] In recent years, the development of high-performance computers is changing with each passing day. The traditional single computing structure is gradually replaced by a hybrid computing structure, in order to give full play to the strengths of various components and play an effective role in improving computing capabilities. In the hybrid structure, GPU plays an increasingly important role as the acceleration part of the current hybrid structure. At the same time, the mixed computing mode increases the complexity of programming. [0003] Currently, NVIDIA's GPU and its development and operating environment and programming model CUDA are the most mature and widely used. Usually, the CUDAkernel function-level program analysis method...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/34
CPCG06F11/3466
Inventor 丁毅周丽靳军唐恒亮
Owner BEIJING WUZI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products