Multi-intersection vehicle-road cooperative control method and device based on hierarchical reinforcement learning, medium
By calculating the global reward offset using a managerial intelligent agent and dynamically adjusting traffic lights and CAV trajectory planning, the problem of insufficient linkage mechanism in hierarchical reinforcement learning traffic control is solved, and the optimization and stability of global traffic flow are achieved.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- ZHEJIANG UNIV
- Filing Date
- 2026-05-25
- Publication Date
- 2026-06-19
AI Technical Summary
In existing hierarchical reinforcement learning traffic control methods, there is a lack of flexible and efficient linkage mechanisms between managers and lower-level agents, which leads to oscillations in traffic light timing and CAV trajectory planning strategies, making it difficult to achieve global optimization.
A managerial intelligent agent collects global traffic information, calculates the global reward offset, and transmits it to the reward function of the traffic light intelligent agent and the connected autonomous vehicle intelligent agent to dynamically adjust the traffic light phase and CAV trajectory planning, forming a multi-level hierarchical decision-making architecture.
It achieves the co-evolution of traffic light timing and CAV trajectory planning under the same global objective, avoids system oscillations, and improves the global optimality and flexibility of regional traffic flow.
Smart Images

Figure CN122245129A_ABST