Intelligent task scheduling strategy training method based on strategy gradient reinforcement learning
A technology of task scheduling and reinforcement learning, which is applied in the direction of program startup/switching, resource allocation, and multi-programming devices. It can solve problems such as large computing resources, suboptimal scheduling strategies, and high complexity of task scheduling strategy training, so as to improve training Efficiency, reduced computing resource consumption, and reduced overall running time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0045] Example: such as figure 1 As shown, an intelligent task scheduling policy training method based on policy gradient reinforcement learning mainly includes the following steps:
[0046] The first step: task scheduling sequence data generation based on reinforcement learning; such as figure 2 As shown, it mainly includes the following five sub-steps:
[0047] (1) Online calculation of task state matrix S at time t t . As shown in the following formula, S t is an m*2q dimensional matrix, where m represents the number of ready tasks in the system, q represents the number of processors in the system, EST(n i ,p j ) means task n i in processor p jThe earliest start time on the w i,j Indicates task n i in processor p j Computational overhead on .
[0048]
[0049] (2) Based on S t Scheduling action matrix PA using policy network output t . As shown in the following formula, the probability value of the corresponding scheduling action is output through the poli...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


