Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Expressway road cooperative control system and method based on deep reinforcement learning

A technology of expressway and reinforcement learning, which is applied in the traffic control system of road vehicles, traffic control system, neural learning method, etc., and can solve problems such as random disturbance, surrounding road congestion, and less consideration of vehicle queuing

Active Publication Date: 2021-01-29
NANJING UNIV OF INFORMATION SCI & TECH
View PDF10 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, deep reinforcement learning has the following problems when dealing with cooperative control: (1) Synchronous control problem when multi-agents cooperate
For example, the period of the ramp signal light and the period of the variable speed limit control are inconsistent, how to unify the two; (2) the existing reward function is easily affected by random disturbances in the traffic environment; On-ramp queuing issues, which can lead to congestion on surrounding roads
(4) The traditional deep reinforcement learning technology has inherent defects, and it is easy to cause problems such as behavior space state explosion when dealing with multi-agent cooperative control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Expressway road cooperative control system and method based on deep reinforcement learning
  • Expressway road cooperative control system and method based on deep reinforcement learning
  • Expressway road cooperative control system and method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] The present invention will be further described in detail below in conjunction with the examples.

[0079] The expressway variable speed limit and on-ramp cooperative control system based on vehicle-road coordination technology in this embodiment includes a traffic information interaction module, a traffic control module, a deep learning neural network training module, and several traffic control units.

[0080] Among them: the traffic information interaction module collects road observation information based on vehicle-road coordination technology o t , and o t Transformed into traffic state information available for deep reinforcement learning s t , sent to the traffic control module; at the same time, the instructions from the traffic control module are passed to the vehicles within the jurisdiction.

[0081] The traffic control module based on deep reinforcement learning, according to the traffic state information st Choose the optimal behavior strategy a t . Am...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an expressway road cooperative control system and method based on deep reinforcement learning, wherein the system comprises a traffic information interaction module, a trafficcontrol module, a deep learning network training module, and a plurality of variable speed limit and ramp control units. The traffic status of ta road is obtained through the information interaction module, and then the traffic status is transmitted to the traffic control module. And the control strategy is continuously optimized through the training module, and the stability of the training process is ensured by adopting a deep reinforcement learning algorithm with an actor-critic architecture. All traffic control units in the system can be controlled at the same time, the problems of trafficstatus space explosion and the like cannot be caused, it can be guaranteed that vehicles pass through bottleneck road sections at a high speed, and passing of surrounding road vehicles cannot be affected by queuing and the like.

Description

technical field [0001] The invention relates to the technical field of traffic control and intelligent transportation, in particular to a system and method for cooperative control of expressway main roads and entrance ramps based on deep reinforcement learning. Background technique [0002] Expressways present frequent, periodic, and long-distance traffic congestion during peak hours, among which, the entrance ramps and adjacent main roads of expressways have become typical bottleneck areas of expressways. Since the early road network planning may be unreasonable, and road reconstruction is difficult, the coordinated management and control of expressway ramps and adjacent main roads is an important way to improve road traffic efficiency and improve driving safety. [0003] The existing cooperative control methods are mainly model predictive control or feedback control methods. In the model predictive control method, the characteristic variables are generally extracted from ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G08G1/08G08G1/01G06N3/08G06N3/04
CPCG08G1/08G08G1/0116G08G1/0133G06N3/08G06N3/045
Inventor 王翀
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products