Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Traffic signal timing optimization method based on deep reinforcement learning

A technology of reinforcement learning and signal timing, applied in neural learning methods, road vehicle traffic control systems, traffic control systems, etc., can solve problems such as driver's intense driving behavior

Active Publication Date: 2021-04-23
BEIJING UNIV OF TECH
View PDF3 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the red light time of a single lane is too long, the remaining lanes will suffer unbearable waiting time, causing drivers to drive aggressively

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Traffic signal timing optimization method based on deep reinforcement learning
  • Traffic signal timing optimization method based on deep reinforcement learning
  • Traffic signal timing optimization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The signal timing optimization method based on deep reinforcement learning of the present invention will be further described in detail below in conjunction with the above illustrations.

[0053] The invention is based on the Linux system and uses the traffic simulation software SUMO as a test platform. Compared with other simulation software such as Aimsun and Vissim, the execution speed of SUMO is faster. It not only enables large-scale traffic flow management, but also interacts with other applications such as Pycharm. Most importantly, SUMO's built-in API interface Traci (traffic control interface) can extract simulation environment data online, and can simulate the output actions of the signal light decision network in real time to realize the interactive process of reinforcement learning.

[0054] Step 1: Design of the traffic network

[0055] This time, the intersection of Hongyan East Road and Xidawang South Road in Chaoyang District, Beijing is used as the tra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a traffic signal timing optimization method based on deep reinforcement learning. A signal lamp outputs a proper phase according to the traffic flow states in all directions of an intersection, and dynamically adjustes the phase length; Specifically, firstly, a PPO algorithm is adopted to improve the convergence rate of the model; then, the vehicle state is defined using a DTSE method, and the design of the state, action, and award is elaborated; and finally, the actual traffic data is experimented through a traffic simulation platform SUMO. Results show that compared with traditional timing control, the scheme can effectively reduce waiting time and queuing length of vehicles in various traffic flow modes.

Description

technical field [0001] The invention relates to the fields of traffic signal control, deep learning, and reinforcement learning, and the specific invention is a traffic signal timing optimization method based on deep reinforcement learning. The method first obtains the state information of the vehicle and the signal light through the traffic camera and the signal light controller respectively as the input of the neural network, then outputs a suitable signal phase through the network, and finally adjusts the neural network parameters according to the value of the reward equation by reinforcement learning. In the case of ensuring traffic safety, learn control rules, adjust the output phase of signal lights, and improve the traffic efficiency of the road network by minimizing the queuing length and waiting time of vehicles in all directions at the intersection. Background technique [0002] The management of urban road intersections is mainly realized by controlling signal lig...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G08G1/081G08G1/083G08G1/085G08G1/017G06N3/08
CPCG08G1/081G08G1/083G08G1/085G08G1/0175G06N3/08Y02T10/40
Inventor 张利国崔铜巢马子博江丰尧邓文星
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products