Method, device and system for acquiring hardware performance data

A technology of hardware performance and performance, applied in the computer field, can solve the problems of sampling method error, unable to express information in the middle of the program, difficult to obtain performance data in a targeted manner, and achieve the effect of high degree of freedom and flexibility

Pending Publication Date: 2022-06-03
CAMBRICON TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are many drawbacks in obtaining performance data based on sampling
First of all, for code fragments with low execution frequency or short execution time, there may be errors in this sampling method
Secondly, such a sampling method cannot use the information expressed in the middle of the program during the compilation phase, and it is difficult to obtain the performance data of a specific segment of the program in a targeted manner, such as obtaining the performance data information of a specific function call

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for acquiring hardware performance data
  • Method, device and system for acquiring hardware performance data
  • Method, device and system for acquiring hardware performance data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] At least for the problem of extremely inefficient acquisition of hardware performance data in the prior art, the solution disclosed in this disclosure proposes to insert a performance reading instruction into the target code (or user program to be tested), and combine the target code and the performance reading Instructions are fetched and compiled together to obtain an executable program for accurately obtaining hardware performance data. In an implementation scenario, when the solution of the present disclosure is applied to a heterogeneous system composed of a host ("host") and a device ("device"), the above-mentioned executable program is obtained by executing a compiling process on the host side, and The executable program is executed on the device side, and the solution disclosed in the present disclosure can accurately obtain relevant hardware performance data when the target code is running on the device side. Thus, software developers can make a preliminary jud...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method, equipment and a system for acquiring hardware performance data. The apparatus may be included in a computing processing device of a combined processing device, which may include one or more data processing devices. The combined processing device can also comprise an interface device and other processing devices. And the computing processing device interacts with other processing devices to jointly complete computing operation specified by a user. The combined processing device can further comprise a storage device, and the storage device is connected with the equipment and the other processing devices and used for storing data of the equipment and the other processing devices. By utilizing the scheme disclosed by the invention, the hardware performance data related to execution of the target code can be effectively obtained.

Description

technical field [0001] The present disclosure relates generally to the field of computing. More specifically, the present disclosure relates to methods, devices, compilers, heterogeneous systems, and computer-readable storage media for obtaining hardware performance data of object codes. Background technique [0002] In the process of software development, performance is usually one of the most concerned indicators. When implementing the same function, there are great differences in performance between different implementations. Therefore, in the process of programming, software developers often use performance analysis tools to view the performance data of the code during the actual operation of the hardware, such as cache misses ("cache miss") or branch misses ("branch miss") etc., as a reference for optimizing program performance. [0003] Currently, common performance analysis tools support sampling-based acquisition of hardware performance data. However, there are m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/41G06F9/445
CPCG06F8/443G06F9/4451G06F9/445G06F8/41
Inventor 不公告发明人
Owner CAMBRICON TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products