Highway road collaborative control system and method based on deep reinforcement learning

A highway and reinforcement learning technology, applied in the traffic control system of road vehicles, traffic control system, neural learning method, etc., can solve the problems of random disturbance, behavior space state explosion, surrounding road congestion, etc., to improve traffic efficiency, The effect of improving efficiency and simplifying control methods

Active Publication Date: 2021-09-07
NANJING UNIV OF INFORMATION SCI & TECH
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, deep reinforcement learning has the following problems when dealing with cooperative control: (1) Synchronous control problem when multi-agents cooperate
For example, the period of the ramp signal light and the period of the variable speed limit control are inconsistent, how to unify the two; (2) the existing reward function is easily affected by random disturbances in the traffic environment; On-ramp queuing issues, which can lead to congestion on surrounding roads
(4) The traditional deep reinforcement learning technology has inherent defects, and it is easy to cause problems such as behavior space state explosion when dealing with multi-agent cooperative control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Highway road collaborative control system and method based on deep reinforcement learning
  • Highway road collaborative control system and method based on deep reinforcement learning
  • Highway road collaborative control system and method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] The present invention will be further described in detail below in conjunction with the examples.

[0079] The expressway variable speed limit and on-ramp cooperative control system based on vehicle-road coordination technology in this embodiment includes a traffic information interaction module, a traffic control module, a deep learning neural network training module, and several traffic control units.

[0080] Among them: the traffic information interaction module collects road observation information based on vehicle-road coordination technology o t , and o t Transformed into traffic state information available for deep reinforcement learning s t , sent to the traffic control module; at the same time, the instructions from the traffic control module are passed to the vehicles within the jurisdiction.

[0081] The traffic control module based on deep reinforcement learning, according to the traffic state information st Choose the optimal behavior strategy a t . Am...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a highway road collaborative control system and method based on deep reinforcement learning. The system includes a traffic information interaction module, a traffic control module, a deep learning network training module, and several variable speed limit and ramp control units. The module obtains the traffic status of the road, and then passes it to the traffic control module. The latter continuously optimizes the control strategy through the training module, and uses a deep reinforcement learning algorithm with actor-critic architecture to ensure the stability of the training process. The invention can simultaneously control all the traffic control units in the system without causing problems such as traffic state space explosion, can ensure that vehicles pass through the bottleneck section at a relatively high speed, and will not affect the passage of vehicles on surrounding roads due to problems such as queuing.

Description

technical field [0001] The invention relates to the technical field of traffic control and intelligent transportation, in particular to a system and method for cooperative control of expressway main roads and entrance ramps based on deep reinforcement learning. Background technique [0002] Expressways present frequent, periodic, and long-distance traffic congestion during peak hours, among which, the entrance ramps and adjacent main roads of expressways have become typical bottleneck areas of expressways. Since the early road network planning may be unreasonable, and road reconstruction is difficult, the coordinated management and control of expressway ramps and adjacent main roads is an important way to improve road traffic efficiency and improve driving safety. [0003] The existing cooperative control methods are mainly model predictive control or feedback control methods. In the model predictive control method, the characteristic variables are generally extracted from ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G08G1/08G08G1/01G06N3/08G06N3/04
CPCG08G1/08G08G1/0116G08G1/0133G06N3/08G06N3/045
Inventor 王翀
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products