Multi-intersection vehicle-road cooperative control method and device based on hierarchical reinforcement learning, medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
By calculating the global reward offset using a managerial intelligent agent and dynamically adjusting traffic lights and CAV trajectory planning, the problem of insufficient linkage mechanism in hierarchical reinforcement learning traffic control is solved, and the optimization and stability of global traffic flow are achieved.

CN122245129APending Publication Date: 2026-06-19ZHEJIANG UNIV

View PDF 0 Cites 0 Cited by

Patent Information

Authority / Receiving Office: CN · China
Patent Type: Applications(China)
Current Assignee / Owner: ZHEJIANG UNIV
Filing Date: 2026-05-25
Publication Date: 2026-06-19

Application Information

Patent Timeline

25 May 2026

Application

19 Jun 2026

Publication

CN122245129A

IPC: G08G1/082; G08G1/01; G08G1/0968; G08G1/0967; G06N3/092

AI Tagging

Application Domain

Detection of traffic movement Biological models

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

Intersection traffic digital twin method and device based on video and coordinate fusion and storage medium
CN122201023AArrangements for variable traffic instructionsDetection of traffic movement
Traffic recording device and traffic monitoring system
JP7873577B2Detection of traffic movement Closed circuit television systems
Control method of vehicle and vehicle
CN122201020AControlling traffic signalsDetection of traffic movement
Data model driven road infrastructure digital twin simulation prediction method
CN122197345ASimulator control Detection of traffic movement
Intelligent transportation system and method for improving road safety
CN122224006ADetection of traffic movement Particular environment based services

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

AI Technical Summary

Technical Problem

In existing hierarchical reinforcement learning traffic control methods, there is a lack of flexible and efficient linkage mechanisms between managers and lower-level agents, which leads to oscillations in traffic light timing and CAV trajectory planning strategies, making it difficult to achieve global optimization.

Method used

A managerial intelligent agent collects global traffic information, calculates the global reward offset, and transmits it to the reward function of the traffic light intelligent agent and the connected autonomous vehicle intelligent agent to dynamically adjust the traffic light phase and CAV trajectory planning, forming a multi-level hierarchical decision-making architecture.

Benefits of technology

It achieves the co-evolution of traffic light timing and CAV trajectory planning under the same global objective, avoids system oscillations, and improves the global optimality and flexibility of regional traffic flow.

✦ Generated by Eureka AI based on patent content.

Smart Images

Figure CN122245129A_ABST

Patent Text Reader

Abstract

This invention discloses a multi-intersection vehicle-road cooperative control method, device, and medium based on hierarchical reinforcement learning, comprising: acquiring intersection nodes and road connection relationships of the traffic network to be analyzed, thereby constructing a road network model; deploying a manager agent, several traffic light agents, and several connected autonomous vehicle agents at each intersection; each traffic light agent collecting the current traffic state of the intersection, and each connected autonomous vehicle agent collecting the current traffic flow state of the road; the manager agent at each intersection calculating a global reward offset based on the shared global traffic state and traffic flow state, and passing the global reward offset to the reward functions of the corresponding traffic light agent and connected autonomous vehicle agent; each traffic light agent and each connected autonomous vehicle agent adjusting the traffic light phase and the trajectory planning of the connected autonomous vehicle respectively based on the global reward offset.

Need to check novelty before this filing date? Find Prior Art