Traffic signal self-adaptive control method based on deep reinforcement learning

An adaptive control and traffic signal technology, applied in traffic signal control, neural learning methods, biological neural network models, etc., can solve the problems of incomplete traffic state perception, inaccurate signal control strategy formulation, etc., to achieve traffic state perception Inaccurate, to achieve the effect of precise perception

Inactive Publication Date: 2017-06-30
DALIAN UNIV OF TECH
View PDF8 Cites 89 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On this basis, an effective traffic signal control strategy is formulated through reinforcement learning, which solves the shortcomings of the traditional traffic signal control sy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Traffic signal self-adaptive control method based on deep reinforcement learning
  • Traffic signal self-adaptive control method based on deep reinforcement learning
  • Traffic signal self-adaptive control method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the drawings.

[0035] Such as figure 1 As shown, a traffic signal adaptive control method based on deep reinforcement learning includes the following steps:

[0036] Step 1. Define traffic signal control agent, state space S, action space A and reward function r, which specifically includes the following sub-steps:

[0037] Step 1.1. The traffic signal control agent uses the deep reinforcement learning method to build a deep neural network Q V As a value network, the initial experience playback memory pool D is empty. The neural network of the present invention uses a convolutional neural network, which is an input layer, 3 convolutional layers, 1 fully connected layer and 4 output layers. The input layer is the current Traffic state s, the output layer is the estimated value of all actions of the current traffic state Q V (s,a); The experience playback memory pool D is used to record transfer samples ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of traffic control and artificial intelligence and provides a traffic signal self-adaptive control method based on deep reinforcement learning. The method includes the following steps that 1, a traffic signal control agent, a state space S, a motion space A and a return function r are defined; 2, a deep neutral network is pre-trained; 3, the neutral network is trained through a deep reinforcement learning method; 4, traffic signal control is carried out according to the trained deep neutral network. By preprocessing traffic data acquired by magnetic induction, video, an RFID, vehicle internet and the like, low-layer expression of the traffic state containing vehicle position information is obtained; then the traffic state is perceived through a multilayer perceptron of deep learning, and high-layer abstract features of the current traffic state are obtained; on the basis, a proper timing plan is selected according to the high-layer abstract features of the current traffic state through the decision making capacity of reinforcement learning, self-adaptive control of traffic signals is achieved, the vehicle travel time is shortened accordingly, and safe, smooth, orderly and efficient operation of traffic is guaranteed.

Description

Technical field [0001] The invention relates to a traffic signal adaptive control method based on deep reinforcement learning, and belongs to the technical field of traffic control and artificial intelligence. Background technique [0002] With the increase in car ownership, traffic congestion has become a problem that plagues my country’s economic development. In 2016, the per capita economic loss caused by traffic congestion reached 8,000 yuan. At the same time, it also brings energy issues, traffic safety issues and environmental issues. There are two options for solving traffic congestion, increasing the construction of traffic infrastructure and developing advanced traffic control systems. However, urban land resources are limited and the problem cannot be solved by simply increasing the transportation infrastructure. Therefore, more attention should be paid to the development of advanced traffic control systems. Currently, mainstream traffic control systems include TRANSY...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G08G1/08G06N3/04G06N3/08
CPCG06N3/08G08G1/08G06N3/045
Inventor 谭国真王莹多
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products