Performance evaluation method and system for GPU applications in CPU-GPU heterogeneous environment

A technology for evaluating system and performance, applied in the field of GPU performance evaluation, can solve the problems of incompetence, time-consuming, and difficult to use models, and achieve the effect of ensuring universality and good scalability.

Active Publication Date: 2018-04-13
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, there are certain defects in the above method
Although the traditional performance analysis model has high accuracy, it requires a detailed understanding of hardware architecture knowledge. The methods of data acquisition and modeling are very complicated, often time-consuming, and the model is difficult to use; or for a specific Built for architecture or application, not universal
Although the method based on machine learning is simple and easy to use, its accuracy strongly depends on the training data set and the selection of feature values ​​by the model itself; and there are few studies on performance evaluation using this method, which is mainly used for performance prediction. Cannot reflect performance bottlenecks and guide application optimization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Performance evaluation method and system for GPU applications in CPU-GPU heterogeneous environment
  • Performance evaluation method and system for GPU applications in CPU-GPU heterogeneous environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific implementations described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below may be combined with each other as long as they do not constitute a conflict with each other.

[0044] Such as figure 1 As shown, the performance evaluation method for GPU applications in a CPU-GPU heterogeneous environment includes offline decision tree construction and online performance evaluation:

[0045] The offline decision tree construction part includes the following steps:

[0046] (S1) Extract multiple sample monitoring records of different GPU applications during the runnin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a performance evaluation method and system for GPU applications in a CPU-GPU heterogeneous environment and belongs to the field of GPU performance evaluation. Specifically, themethod comprises the steps of learning performance condition of various applications operated on GPU architecture based on a decision-making tree algorithm in machine learning and establishing a decision-making tree model; acquiring monitoring characteristics of which influences on application performance time are the highest in sequence within a decision-making tree matching process, namely ordering the important degree of the characteristics; carrying out correspondence on screened characteristic sets and four common problems of the applications in sequence, wherein the four common problemsmainly indicate computing correlation, memory correlation, occupation rate correlation and synchronization correlation, thereby preliminarily obtaining problem directions in which performance bottlenecks of to-be-analyzed applications are located. According to the method and the system, through utilization of the method of combining the decision-making tree model with analysis modeling, the universal, relatively accurate, rapid, simple and easy-to-use method for carrying out the performance evaluation on resources and applications on a GPU is provided.

Description

technical field [0001] The invention belongs to the field of GPU performance evaluation, and more specifically relates to a method and system for performance evaluation of resources and applications on the GPU in combination with machine learning and analytical modeling in a CPU+GPU hybrid heterogeneous environment. Background technique [0002] With the continuous development of science and technology, all aspects put forward higher requirements for high-performance computing. GPU has powerful computing power, high memory bandwidth, low power consumption, and good programmability, but it does not handle logic well, making CPU-GPU heterogeneity an inevitable trend. However, although CPU-GPU heterogeneous computing nodes can achieve high performance, the actual performance is often not ideal, and computing resources and storage bandwidth cannot be effectively utilized. The reason for performance degradation lies in many aspects, including load imbalance caused by uneven task...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/34
CPCG06F11/3447G06F11/3476
Inventor 廖小飞郑然胡清月金海
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products