Reinforcement learning-based data center high-density computing resource adaptive scheduling system

By constructing a volume entropy yield field and a hysteresis temperature increment field, and combining a dynamic discount factor and an immediate reward expectation, the problem of scheduling strategy divergence caused by heat conduction hysteresis in traditional reinforcement learning in ultra-high-density computing systems is solved, and the stable and efficient operation of high-density computing systems is achieved.

CN122240283APending Publication Date: 2026-06-19NANJING SHIYUN INFORMATION TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
NANJING SHIYUN INFORMATION TECH CO LTD
Filing Date
2026-05-25
Publication Date
2026-06-19

AI Technical Summary

Technical Problem

Traditional reinforcement learning in ultra-high-density computing systems suffers from the lag caused by cross-temporal heat conduction and associative memory effects, which leads to divergent optimization directions of scheduling strategies and makes it difficult to meet the requirements of ultra-stability and safe operation of high-density computing systems.

Method used

By constructing a volume entropy yield field, a hysteresis temperature increment field, and a dynamic discount factor, combined with the immediate reward expectation value and the cumulative advantage function, the update step size of the action network is limited, thereby achieving smooth scheduling of computing power distribution and reducing the risk of thermal runaway.

Benefits of technology

It significantly reduces the risk of chip thermal runaway, improves system stability and computing power efficiency, and enables smooth scheduling of high-density computing arrays under critical thermal limit conditions.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure CN122240283A_ABST
    Figure CN122240283A_ABST
Patent Text Reader

Abstract

This invention relates to the field of data processing technology and discloses an adaptive scheduling system for high-density computing resources in data centers based on reinforcement learning. The system includes: acquiring an absolute temperature field and a dynamic command heat generation power density distribution to construct a volume entropy productivity field; generating a computing power density distribution based on a dimensionless preference continuous field and substituting it into a partial differential analytical model of heat conduction to solve for the hysteresis temperature increment; combining the absolute cold source temperature to derive the immediate reward expectation value, and generating a dynamic discount factor through the total convective heat transfer admittance and the total heat capacity of the cluster to update the strategy; finally, calculating the microchannel flow velocity distribution and issuing the execution. This invention improves the stability and energy efficiency of ultra-high-density computing array operation.
Need to check novelty before this filing date? Find Prior Art