Intelligent task scheduling strategy training method based on strategy gradient reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of task scheduling and reinforcement learning, which is applied in the direction of program startup/switching, resource allocation, and multi-programming devices. It can solve problems such as large computing resources, suboptimal scheduling strategies, and high complexity of task scheduling strategy training, so as to improve training Efficiency, reduced computing resource consumption, and reduced overall running time

Pending Publication Date: 2019-06-07

浙江方正印务有限公司

View PDF1 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In order to overcome the above-mentioned deficiencies, the present invention aims to provide a strategy training method for intelligent task scheduling based on strategy gradient reinforcement learning. This method abstracts the task scheduling problem into a process of reinforcement learning, and uses strategy gradient descent to train the network model. A scheduling policy training method combining multi-dimensional task state matrix, scheduling policy matrix and reward function is proposed, which improves the training efficiency of task scheduling policy, reduces the overall running time of tasks, reduces the consumption of computing resources, and solves the problems in the prior art. Existing task scheduling policy training complexity is high, the scheduling policy is sub-optimal, and consumes a lot of computing resources.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0045] Example: such as figure 1 As shown, an intelligent task scheduling policy training method based on policy gradient reinforcement learning mainly includes the following steps:

[0046] The first step: task scheduling sequence data generation based on reinforcement learning; such as figure 2 As shown, it mainly includes the following five sub-steps:

[0047] (1) Online calculation of task state matrix S at time t t . As shown in the following formula, S t is an m*2q dimensional matrix, where m represents the number of ready tasks in the system, q represents the number of processors in the system, EST(n i ,p j ) means task n i in processor p jThe earliest start time on the w i，j Indicates task n i in processor p j Computational overhead on .

[0048]

[0049] (2) Based on S t Scheduling action matrix PA using policy network output t . As shown in the following formula, the probability value of the corresponding scheduling action is output through the poli...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to an intelligent task scheduling strategy training method based on strategy gradient reinforcement learning. The method mainly comprises task scheduling sequence data generationbased on reinforcement learning and a task scheduling strategy training method based on strategy gradient reinforcement learning. According to the method, a task scheduling problem is abstracted intoa reinforcement learning process, a strategy gradient descent training network model is used, a scheduling strategy training method combining a multi-dimensional task state matrix, a scheduling strategy matrix and a reward function is provided, the training efficiency of a task scheduling strategy is improved, and the total running time of the task is shortened.

Description

technical field [0001] The invention relates to the technical field of computer system task scheduling and artificial intelligence algorithms, in particular to an intelligent task scheduling policy training method based on policy gradient reinforcement learning. Background technique [0002] Facing the big data analysis and computing tasks in the data center, how to efficiently schedule computing tasks to run on large-scale servers has become a key issue in computer systems. Big data analysis and computing tasks have the characteristics of high parallelism and complex data dependencies, and the task scheduling problem is facing huge challenges. At the same time, there are large performance differences among server computing devices in data centers, thus forming a complex distributed heterogeneous computing system. Scheduling parallel computing tasks in a distributed heterogeneous computing system is a recognized NP-hard problem in the industry, and it is impossible to find ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F9/48G06F9/50

Inventor 程雨夏庄跃辉

Owner 浙江方正印务有限公司

Intelligent task scheduling strategy training method based on strategy gradient reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology