Traffic signal timing optimization method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and signal timing, applied in neural learning methods, road vehicle traffic control systems, traffic control systems, etc., can solve problems such as driver's intense driving behavior

Active Publication Date: 2021-04-23

BEIJING UNIV OF TECH

View PDF3 Cites 31 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

If the red light time of a single lane is too long, the remaining lanes will suffer unbearable waiting time, causing drivers to drive aggressively

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0052] The signal timing optimization method based on deep reinforcement learning of the present invention will be further described in detail below in conjunction with the above illustrations.

[0053] The invention is based on the Linux system and uses the traffic simulation software SUMO as a test platform. Compared with other simulation software such as Aimsun and Vissim, the execution speed of SUMO is faster. It not only enables large-scale traffic flow management, but also interacts with other applications such as Pycharm. Most importantly, SUMO's built-in API interface Traci (traffic control interface) can extract simulation environment data online, and can simulate the output actions of the signal light decision network in real time to realize the interactive process of reinforcement learning.

[0054] Step 1: Design of the traffic network

[0055] This time, the intersection of Hongyan East Road and Xidawang South Road in Chaoyang District, Beijing is used as the tra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a traffic signal timing optimization method based on deep reinforcement learning. A signal lamp outputs a proper phase according to the traffic flow states in all directions of an intersection, and dynamically adjustes the phase length; Specifically, firstly, a PPO algorithm is adopted to improve the convergence rate of the model; then, the vehicle state is defined using a DTSE method, and the design of the state, action, and award is elaborated; and finally, the actual traffic data is experimented through a traffic simulation platform SUMO. Results show that compared with traditional timing control, the scheme can effectively reduce waiting time and queuing length of vehicles in various traffic flow modes.

Description

technical field [0001] The invention relates to the fields of traffic signal control, deep learning, and reinforcement learning, and the specific invention is a traffic signal timing optimization method based on deep reinforcement learning. The method first obtains the state information of the vehicle and the signal light through the traffic camera and the signal light controller respectively as the input of the neural network, then outputs a suitable signal phase through the network, and finally adjusts the neural network parameters according to the value of the reward equation by reinforcement learning. In the case of ensuring traffic safety, learn control rules, adjust the output phase of signal lights, and improve the traffic efficiency of the road network by minimizing the queuing length and waiting time of vehicles in all directions at the intersection. Background technique [0002] The management of urban road intersections is mainly realized by controlling signal lig...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G08G1/081G08G1/083G08G1/085G08G1/017G06N3/08

CPCG08G1/081G08G1/083G08G1/085G08G1/0175G06N3/08Y02T10/40

Inventor 张利国崔铜巢马子博江丰尧邓文星

Owner BEIJING UNIV OF TECH

Traffic signal timing optimization method based on deep reinforcement learning

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

Agents

Company

Traffic signal timing optimization method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

Agents

Company

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology