Traffic signal self-adaptive control method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An adaptive control and traffic signal technology, applied in traffic signal control, neural learning methods, biological neural network models, etc., can solve the problems of incomplete traffic state perception, inaccurate signal control strategy formulation, etc., to achieve traffic state perception Inaccurate, to achieve the effect of precise perception

Inactive Publication Date: 2017-06-30

DALIAN UNIV OF TECH

View PDF8 Cites 89 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

On this basis, an effective traffic signal control strategy is formulated through reinforcement learning, which solves the shortcomings of the traditional traffic signal control system, such as incomplete perception of traffic conditions and inaccurate formulation of signal control strategies, and realizes adaptive control of traffic signals at intersections.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the drawings.

[0035] Such as figure 1 As shown, a traffic signal adaptive control method based on deep reinforcement learning includes the following steps:

[0036] Step 1. Define traffic signal control agent, state space S, action space A and reward function r, which specifically includes the following sub-steps:

[0037] Step 1.1. The traffic signal control agent uses the deep reinforcement learning method to build a deep neural network Q V As a value network, the initial experience playback memory pool D is empty. The neural network of the present invention uses a convolutional neural network, which is an input layer, 3 convolutional layers, 1 fully connected layer and 4 output layers. The input layer is the current Traffic state s, the output layer is the estimated value of all actions of the current traffic state Q V (s,a); The experience playback memory pool D is used to record transfer samples ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of traffic control and artificial intelligence and provides a traffic signal self-adaptive control method based on deep reinforcement learning. The method includes the following steps that 1, a traffic signal control agent, a state space S, a motion space A and a return function r are defined; 2, a deep neutral network is pre-trained; 3, the neutral network is trained through a deep reinforcement learning method; 4, traffic signal control is carried out according to the trained deep neutral network. By preprocessing traffic data acquired by magnetic induction, video, an RFID, vehicle internet and the like, low-layer expression of the traffic state containing vehicle position information is obtained; then the traffic state is perceived through a multilayer perceptron of deep learning, and high-layer abstract features of the current traffic state are obtained; on the basis, a proper timing plan is selected according to the high-layer abstract features of the current traffic state through the decision making capacity of reinforcement learning, self-adaptive control of traffic signals is achieved, the vehicle travel time is shortened accordingly, and safe, smooth, orderly and efficient operation of traffic is guaranteed.

Description

Technical field [0001] The invention relates to a traffic signal adaptive control method based on deep reinforcement learning, and belongs to the technical field of traffic control and artificial intelligence. Background technique [0002] With the increase in car ownership, traffic congestion has become a problem that plagues my country’s economic development. In 2016, the per capita economic loss caused by traffic congestion reached 8,000 yuan. At the same time, it also brings energy issues, traffic safety issues and environmental issues. There are two options for solving traffic congestion, increasing the construction of traffic infrastructure and developing advanced traffic control systems. However, urban land resources are limited and the problem cannot be solved by simply increasing the transportation infrastructure. Therefore, more attention should be paid to the development of advanced traffic control systems. Currently, mainstream traffic control systems include TRANSY...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G08G1/08G06N3/04G06N3/08

CPCG06N3/08G08G1/08G06N3/045

Inventor 谭国真王莹多

Owner DALIAN UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Traffic signal self-adaptive control method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology