A traffic signal adaptive control method based on deep reinforcement learning

An adaptive control, traffic signal technology, applied in the direction of traffic signal control, neural learning method, biological neural network model, etc., can solve the problems of inaccurate signal control strategy formulation, incomplete traffic state perception, etc., to solve the problem of traffic state perception Inaccurate, to achieve the effect of accurate perception

Inactive Publication Date: 2019-10-11
DALIAN UNIV OF TECH
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On this basis, an effective traffic signal control strategy is formulated through reinforcement learning, which solves the shortcomings of the traditional traffic signal control system, such as incomplete perception of traffic conditions and inaccurate formulation of signal control strategies, and realizes adaptive control of traffic signals at intersections.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A traffic signal adaptive control method based on deep reinforcement learning
  • A traffic signal adaptive control method based on deep reinforcement learning
  • A traffic signal adaptive control method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with accompanying drawing.

[0035] Such as figure 1 As shown, a traffic signal adaptive control method based on deep reinforcement learning, including the following steps:

[0036] Step 1. Define traffic signal control Agent, state space S, action space A and reward function r, specifically including the following sub-steps:

[0037] Step 1.1, the traffic signal control agent uses the deep reinforcement learning method to build a deep neural network Q V For the value network, the initialization experience playback memory pool D is empty, and the neural network of the present invention adopts a convolutional neural network, followed by an input layer, 3 convolutional layers, 1 fully connected layer and 4 output layers, and the input layer is the current The traffic state s, the output layer estimates Q for the value of all actions in the current traffic state V (s, a); experience playback memory poo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to the technical field of traffic control and artificial intelligence. A traffic signal adaptive control method based on deep reinforcement learning comprises the following steps: (1) defining a traffic signal control agent, a state space S, an action space A and a reward function r, (2) Pre-training of the deep neural network, (3) training of the neural network using a deep reinforcement learning method, and (4) traffic signal control based on the trained deep neural network. By preprocessing the traffic data collected by magnetic sensing, video, RFID, and Internet of Vehicles, etc., the low-level representation of the traffic state including vehicle location information is obtained; secondly, the traffic state is perceived by the multi-layer perceptron of deep learning, and the obtained The high-level abstract features of the current traffic state; on this basis, use the decision-making ability of reinforcement learning to select the appropriate timing scheme according to the high-level abstract features of the current traffic state, and realize the adaptive control of traffic signals to reduce vehicle travel time and ensure traffic safety. Smooth, orderly and efficient operation.

Description

technical field [0001] The invention relates to a traffic signal adaptive control method based on deep reinforcement learning, which belongs to the technical fields of traffic control and artificial intelligence. Background technique [0002] With the increase of car ownership, traffic congestion has become a problem that plagues my country's economic development. In 2016, the per capita economic loss caused by traffic congestion reached 8,000 yuan. At the same time, it also brings energy problems, traffic safety problems and environmental problems. There are two options for solving traffic congestion, increasing traffic infrastructure construction and developing advanced traffic control systems. However, urban land resources are tight and limited, and simply relying on increasing traffic infrastructure cannot solve the problem. Therefore, more attention should be paid to the development of advanced traffic control systems. Currently, mainstream traffic control systems inc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G08G1/08G06N3/04G06N3/08
CPCG06N3/08G08G1/08G06N3/045
Inventor 谭国真王莹多
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products