Method for quickly collecting large-scale parallel program performance data based on layer tree network

A large-scale, tree network technology, applied in the computer field, can solve problems such as waste of resources and affect transmission efficiency, and achieve the effect of rapid collection

Active Publication Date: 2021-03-05
JIANGNAN INST OF COMPUTING TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When there is a lot of data to be collected, the collection nodes often need to be deployed on dedicated servers, and the amount of performance data collected on a single node is small. If the above scheme is adopted, it will cause a certain degree of waste of resources and will Affect transmission efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for quickly collecting large-scale parallel program performance data based on layer tree network
  • Method for quickly collecting large-scale parallel program performance data based on layer tree network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0026] EXAMPLES: A large-scale parallel program performance data of a layered tree network, based on the following configuration:

[0027] Several acquisition nodes for collecting performance data;

[0028] Several sub-collection nodes are used to manage collection nodes or to manage sub-collection nodes, but also for screening, collection, and processing of collected performance data;

[0029] The root node is used to manage the collection nodes and is also used for the collection and processing of the acquired performance data;

[0030] The root node is disposed on the service node, the sub-collection node is disposed on the serial number of the service node or the compute node, the acquisition node is set to the primary core of the compute node, the root node, sub-collection node and acquisition Node laminated to form a tree network;

[0031] Include the following steps:

[0032] S1, based on the maximum number of port numbers that can be used up to the calculation node, determ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for quickly collecting large-scale parallel program performance data based on a layer tree network, which comprises the following steps of: S1, determining the numberm of sub-nodes needing to be responsible for sub-collection nodes according to the number limit M of port numbers which can be used at most by a process started by a calculation node; S2, according tothe m value obtained in S1, dividing the total process number N of the performance data needing to be collected into a topological network with the same parallelism degree of each tree layer; s3, analyzing a data transmission efficiency ratio of 1: eta 1, eta 2,..., eta<h-1> of each tree layer, wherein h is the depth of the tree; S4, optimizing the topological network according to the data transmission efficiency ratio of each tree layer in S3; and S5, establishing a layer tree collection network through socket to realize collection of performance data. Aiming at the structural characteristics of the heterogeneous many-core, in combination with the characteristics of the heterogeneous many-core processor, part of the collection processes sink to the computing nodes, and topological optimization is performed on the tree-type collection network in combination with the data transmission efficiency of each tree layer, so that quick collection of large-scale parallel program performance data can be realized.

Description

Technical field [0001] The present invention relates to a rapid collection method based on a large-scale parallel program performance data based on a layer tree network, which belongs to the technical field of computer. Background technique [0002] In order to obtain higher computing power, the parallel scale of the high-performance computing platform is increasing, and the supercomputer system with tens of millions of calculated core has begun to serve research applications, and performs large-scale application running performance on such platforms. Monitor analysis, the rapid processing of a large distributed performance data generated during the period faces challenges. [0003] The existing performance data collection method can be divided into two, one is based on the collection method of the forward process, generates each process or thread performance data file, and the client is responsible for subsequent processing; the other is based on the program Collection method, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30
CPCG06F11/3089G06F11/302G06F11/3051
Inventor 冯赟龙刘勇何王全陈华蓉宋佳伟王敬宇彭达佳孙川张威罗威梁艳
Owner JIANGNAN INST OF COMPUTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products