Traffic signal lamp control method based on time distribution and reinforcement learning

A technology of traffic lights and reinforcement learning, which is applied in the field of traffic lights control based on time allocation and reinforcement learning, can solve problems such as the inability to directly realize the countdown function, and achieve the effect of reducing waiting time and delay, and reducing the length of the fleet

Active Publication Date: 2019-08-23
ZHEJIANG UNIV
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Most of the phase-switching models check the traffic flow distribution on the lane connecting the intersection every short period of time (usually about 5 seconds), and then the model generates an operation signal whether to switch to the next phase and Immediate execution, so most of these methods cannot directly implement the very important countdown function on the real road

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Traffic signal lamp control method based on time distribution and reinforcement learning
  • Traffic signal lamp control method based on time distribution and reinforcement learning
  • Traffic signal lamp control method based on time distribution and reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0044] First of all, it needs to be explained that the present invention is compatible with almost all reinforcement learning frameworks. The following only uses the classic DQN framework as an example to illustrate how the present invention can be used in combination with reinforcement learning frameworks.

[0045] Different from the existing switching phase action, which only needs to consider the two action options of "maintain" and "switching", the time allocation type action designed by the present invention needs to consider how to allocate duration to all phases in a signal cycle. However, if all the timing methods are directly considered as the action options to be considered, there will ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a traffic signal lamp control method based on time distribution and reinforcement learning. The traffic signal lamp control method comprises the following steps: (1) configuring a simulated intersection environment and traffic flow data to a traffic simulator, and building an intelligent agent network; (2) an intelligent agent network generating the action of the next signal period according to the road condition state and transmitting the action to a traffic simulator to simulate one signal period; (3) storing the experience of the last signal period into a replay buffer; (4) sampling an experience from the replay buffer to train an intelligent agent network, judging whether the number of simulated steps reaches a preset value, and if not, returning to the step (2), otherwise, executing the next step; and (5) resetting the traffic simulator, testing the intelligent agent network, and performing traffic signal lamp control application after the test is finished.By adopting the traffic signal lamp control method of the invention, the traffic efficiency can be obviously improved, and the method can be more easily applied to actual roads.

Description

technical field [0001] The invention belongs to the field of traffic signal lamp control, in particular to a traffic signal lamp control method based on time allocation and reinforcement learning. Background technique [0002] For a long time, traffic congestion has not only plagued the daily travel of people all over the world, but also caused serious economic losses. Studies have pointed to inefficient traffic light control signals as one of the most prominent causes of frequent congestion. Therefore, how to optimize the traffic light control mechanism and improve the overall traffic efficiency has attracted extensive attention from academia and governments. [0003] Traditional traffic signal control strategies mostly rely on static schedules designed by traffic engineers, or dynamically adjust traffic signals based on real-time traffic information and artificially specified rules. However, due to the complexity and variability of real traffic scenarios, these methods s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G08G1/08G08G1/01
CPCG08G1/0145G08G1/08
Inventor 项超蔡登何晓飞金仲明黄建强华先胜
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products