Task and data collaborative scheduling method for wide-area high-performance computing environment

A high-performance computing and data collaboration technology, applied in the computer field, can solve problems such as unbalanced global resource utilization, incomplete matching of data layout and computing task distribution, and no consideration of data migration, so as to meet the needs of efficient computing, accurate and efficient The effect of coordinated scheduling of tasks and data

Active Publication Date: 2020-10-16
BEIHANG UNIV
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The main problems of traditional wide-area scheduling methods are: most task scheduling methods do not consider data migration, but data migration costs account for a large proportion of task completion time, which leads to a large amount of global data migration and task completion time. and long waiting times for results
In addition, most of the existing data scheduling methods focus on the efficient access and migration of data, and seldom consider the correlation between data and computing tasks, which leads to the incomplete matching of data layout and computing task distribution, which leads to relatively Low global resource utilization and long task completion times
However, the existing collaborative scheduling methods only focus on optimizing the scheduling process through one aspect of data layout or task scheduling. The relatively single consideration of scheduling factors leads to unbalanced global resource utilization, so that the system still has a relatively high task completion rate. time and waiting time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Task and data collaborative scheduling method for wide-area high-performance computing environment
  • Task and data collaborative scheduling method for wide-area high-performance computing environment
  • Task and data collaborative scheduling method for wide-area high-performance computing environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Combined with the accompanying drawings below ( Figure 1-Figure 5 ) The present invention is described in further detail.

[0038] figure 1 It is a flow chart of the task and data collaborative scheduling method for the wide-area high-performance computing environment of the present invention. Such as figure 1 As shown in Fig. 1, the task completion time model is first constructed according to the global resource status, computing task requirements, data layout and other conditions. Secondly, according to the optimal scheme selection mechanism, the optimal center-level collaborative scheduling scheme is selected based on the task completion time estimate and global resource usage status. Then, data redundancy layout is implemented based on data access frequency in the scheduling process. Finally, a queue-level scheduling scheme is generated based on the task-stealing mechanism to correct the deviation of the estimated value of task completion time. This method can...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a task and data collaborative scheduling method for a wide-area high-performance computing environment. The method comprises the following steps: firstly, constructing a task completion time model according to conditions such as a global resource state, a computing task demand and a data layout condition; secondly, according to an optimal scheme selection mechanism, selecting an optimal center-level cooperative scheduling scheme based on a task completion time estimated value and a global resource use state; and then, realizing data redundancy layout based on the data access frequency in the scheduling process. And finally, generating a queue-level scheduling scheme based on a task stealing mechanism so as to correct estimated value deviation of task completion time.According to the method, the computing tasks and the corresponding data can be cooperatively scheduled among a plurality of cross-domain centers and task queues, task allocation and data layout are efficiently and reasonably carried out, and the system time performance and the global resource utilization rate are both considered, so that efficient computing is realized.

Description

technical field [0001] The invention discloses a task and data collaborative scheduling method for a wide-area high-performance computing environment, relates to challenges faced by wide-area high-performance computing, and belongs to the field of computer technology. Background technique [0002] In the high-performance computing environment, for many parallel applications such as weather forecasting and geological observation, the calculation data comes from remote sensing satellites or observation points distributed all over the world, resulting in data storage in geographically distributed computing centers (including supercomputing centers, data processing centers, etc.) center, etc.) and wide area sharing. At the same time, with the increasing complexity of scientific and engineering problems, parallel applications have gradually increased the demand for storage and computing resources. To address the above challenges, the current trend is to execute applications in w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/21G06F16/27G06F9/48G06F9/50
CPCG06F16/214G06F16/27G06F9/4881G06F9/5088
Inventor 肖利民宋尧秦广军霍志胜张晨浩周汉杰
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products