Performance data acquisition method for large-scale parallel program

A data acquisition and large-scale technology, applied in the direction of multi-program devices, etc., can solve the problems of increased conflict domain, reduced file writing efficiency, and decreased collection efficiency, so as to improve use efficiency, reduce conflict domain, and expand scalability strong effect

Inactive Publication Date: 2011-05-18
BEIHANG UNIV
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of this data acquisition model is that no communication protocol is required and the implementation method is simple. The disadvantage is that the architecture is not scalable, and its acquisition efficiency will decrease significantly with the increase in the number of computing nodes.
If the parallel program runs on a supercomputer with a large number of nodes, the disadvantage of this data acquisition model will appear: the parallel program needs to write ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Performance data acquisition method for large-scale parallel program
  • Performance data acquisition method for large-scale parallel program
  • Performance data acquisition method for large-scale parallel program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The method is a method for collecting large-scale parallel program performance data based on load balancing. see image 3 As shown, this method divides the nodes in the high-performance computer into three types, namely computing nodes, acquisition nodes and control nodes. In a high-performance computer system, nodes with strong computing capabilities can be used as computing nodes, nodes with high IO throughput can be used as collection nodes, and consoles with computing capabilities in high-performance computer systems can be used as control nodes. If the computing and IO throughput rates of each node are roughly the same, they can be allocated arbitrarily under the premise that the number of collection nodes does not exceed the number of computing nodes. Performance data needs to be transmitted between computing nodes and collection nodes, so the network communication between the two needs to be realized through reliable transmission protocols, such as TCP protocol,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a load balancing based performance data acquisition method for a large-scale parallel program. The method is used for performing distributed acquisition on performance data generated by a large-scale parallel program in a high-performance computer and saving the data in an external memory. Nodes in the high-performance computer are divided into three classes of compute nodes, acquisition nodes and control nodes with the method comprising the following steps of: dynamically selecting the needed acquisition nodes with a load balancing algorithm; instrumenting and committing the parallel program on the control nodes through tools; collecting the performance data of the parallel program and sending the performance data to the selected acquisition nodes in a distributedway; receiving and caching the performance data from all the compute nodes in a distributed way and writing the performance data in the external memory. The efficiency and the expandability of the acquisition of the performance data of the parallel program can be improved with the method.

Description

technical field [0001] The invention can be applied to the field of large-scale parallel program data acquisition in high-performance computer systems. Background technique [0002] With the high-performance computing technology becoming more and more mature, parallel programs have been paid more and more attention to and used by more and more people. At present, the actual utilization rate of high-performance computers is very low. The main reason is that the task scheduling in parallel programs is not reasonable enough and the load is unbalanced, so that most processors are in an idle state. Without the necessary performance monitoring and visualization tools, it is difficult for programmers to monitor the execution status of parallel programs, communication status, processor utilization, etc., so that it cannot be accurately determined as the performance bottleneck of the algorithm. In this case, the development of parallel program performance evaluation tools has become...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/46
Inventor 李云春王金磊李巍
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products