Optimization method for signal timing in single-point crossroad based on deep reinforcement learning

A technology of reinforcement learning and single-point intersection, applied in the field of intelligent transportation, can solve problems such as poor robustness, inapplicability, and complex and cumbersome calculation processes, and achieve the effects of reducing traffic congestion, improving stability, and reducing complexity

Inactive Publication Date: 2019-01-15
BEIHANG UNIV
View PDF4 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are still many deficiencies in its control of complex traffic systems, such as poor robustness, not stable enough, etc.
[0004] Reinforcement learning algorithms have been applied in the traffic field relatively early, and can control and optimize simple traffic signal timing. However, for traditional reinforcement learning algorithms, although they can solve simple control problems, due to the limitations of their own algorithms, It cannot solve the problem of high input dimension, and the calculation process is very complicated and cumbersome, so it is not suitable for today's traffic signal control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimization method for signal timing in single-point crossroad based on deep reinforcement learning
  • Optimization method for signal timing in single-point crossroad based on deep reinforcement learning
  • Optimization method for signal timing in single-point crossroad based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings, but the implementation and protection of the present invention are not limited to the content described below.

[0025] The invention is based on deep reinforcement learning, uses traffic simulation software SUMO to perform simulation iterations, and finally realizes self-adaptive timing optimization of single-point intersections. The specific implementation steps are as follows:

[0026] Step 1: Establish a single-point intersection through the SUMO simulation software. Here, a one-way three-lane intersection is established according to the actual road. like figure 2 As shown, the innermost lane is the left-turn lane, the middle lane is the through lane, and the right lane is the through and right-turn lane. After the intersection is established, enter the traffic volume of each lane for the four roads entering the intersection. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an optimization method for signal timing in a single-point crossroad based on deep reinforcement learning, aiming at optimizing simulation timing in the single-point crossroadbased on deep reinforcement learning by actual road data, and finally applying a simulation scheme to an actual road. The method overcomes the shortcomings of the existing adaptive signal timing method and provides a novel algorithm that can perform high-dimensional input and fast convergence. The method comprises the following steps: firstly, performing matrix processing on the original data, randomly extracting input data through an empirical playback strategy, selecting an action of reinforcement learning by using an epsilon-greedy strategy, and finally obtaining an optimal solution for signal timing through continuous iterative training.

Description

technical field [0001] The invention discloses a deep reinforcement learning method for controlling and optimizing a single-point signalized intersection, which belongs to the technical field of intelligent transportation. Background technique [0002] With the development of the world economy and the rapid advancement of urbanization, the size and population of cities are also expanding and increasing. Although the expansion of cities has promoted economic development to a certain extent, the emergence of more and more private cars has caused urban traffic congestion. In order to strengthen the management of urban roads and reduce the occurrence of traffic accidents and congestion, it is necessary to set up traffic lights at intersections, thereby increasing road utilization and improving traffic conditions. The current traffic signal control can be roughly divided into three categories: (1) fixed signal timing. This timing strategy sets a fixed green signal ratio and sig...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G08G1/08G06F17/50
CPCG06F30/20G08G1/08
Inventor 陈鹏朱泽茂鲁光泉王云鹏余贵珍
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products