High-performance cluster resource fair allocation method based on multi-agent reinforcement learning
A reinforcement learning and multi-agent technology, which is applied in the field of resource scheduling of high-performance clusters, can solve the problems that cluster resources cannot be executed immediately, and achieve the effect of flexible and fast adjustment process, reducing time cost and improving generalization ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0024] like figure 1 As shown, a method for fair allocation of high-performance cluster resources based on multi-agent reinforcement learning provided by an embodiment of the present invention includes the following steps:
[0025] Step S1: establishing a Markov game model for high-performance cluster resource scheduling, including: defining job characteristic state, cluster resource usage state, single user state, and environment state of a single agent;
[0026] Step S2: collect real cluster data, use the simulation environment to perform job playback, and build a high-performance cluster simulation environment;
[0027] Step S3: train the strategy and state value evaluation network in a high-performance cluster simulation environment; wherein, the strategy and state value evaluation network includes: an action strategy neural network NN actor Sum value evaluation neural network NN critic , and respectively construct the corresponding loss function for parameter update.
...
Embodiment 2
[0095] like Image 6 As shown, the embodiment of the present invention provides a high-performance cluster resource fair allocation system based on multi-agent reinforcement learning, including the following modules:
[0096] Establishing a Markov game model module 41 for: establishing a Markov game model for high-performance cluster resource scheduling, including: defining job characteristic state, cluster resource usage state, single user state, and environment state of a single agent;
[0097] Building a high-performance cluster simulation environment module 42 for collecting real cluster data, using the simulation environment for job playback, and building a high-performance cluster simulation environment;
[0098] The training strategy and state value evaluation network module 43 is used to train the strategy and state value evaluation network in a high-performance cluster simulation environment; wherein, the strategy and state value evaluation network includes: an action...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com