A traffic signal adaptive control method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An adaptive control, traffic signal technology, applied in the direction of traffic signal control, neural learning method, biological neural network model, etc., can solve the problems of inaccurate signal control strategy formulation, incomplete traffic state perception, etc., to solve the problem of traffic state perception Inaccurate, to achieve the effect of accurate perception

Inactive Publication Date: 2019-10-11

DALIAN UNIV OF TECH

View PDF8 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

On this basis, an effective traffic signal control strategy is formulated through reinforcement learning, which solves the shortcomings of the traditional traffic signal control system, such as incomplete perception of traffic conditions and inaccurate formulation of signal control strategies, and realizes adaptive control of traffic signals at intersections.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with accompanying drawing.

[0035] Such as figure 1 As shown, a traffic signal adaptive control method based on deep reinforcement learning, including the following steps:

[0036] Step 1. Define traffic signal control Agent, state space S, action space A and reward function r, specifically including the following sub-steps:

[0037] Step 1.1, the traffic signal control agent uses the deep reinforcement learning method to build a deep neural network Q V For the value network, the initialization experience playback memory pool D is empty, and the neural network of the present invention adopts a convolutional neural network, followed by an input layer, 3 convolutional layers, 1 fully connected layer and 4 output layers, and the input layer is the current The traffic state s, the output layer estimates Q for the value of all actions in the current traffic state V (s, a); experience playback memory poo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention relates to the technical field of traffic control and artificial intelligence. A traffic signal adaptive control method based on deep reinforcement learning comprises the following steps: (1) defining a traffic signal control agent, a state space S, an action space A and a reward function r, (2) Pre-training of the deep neural network, (3) training of the neural network using a deep reinforcement learning method, and (4) traffic signal control based on the trained deep neural network. By preprocessing the traffic data collected by magnetic sensing, video, RFID, and Internet of Vehicles, etc., the low-level representation of the traffic state including vehicle location information is obtained; secondly, the traffic state is perceived by the multi-layer perceptron of deep learning, and the obtained The high-level abstract features of the current traffic state; on this basis, use the decision-making ability of reinforcement learning to select the appropriate timing scheme according to the high-level abstract features of the current traffic state, and realize the adaptive control of traffic signals to reduce vehicle travel time and ensure traffic safety. Smooth, orderly and efficient operation.

Description

technical field [0001] The invention relates to a traffic signal adaptive control method based on deep reinforcement learning, which belongs to the technical fields of traffic control and artificial intelligence. Background technique [0002] With the increase of car ownership, traffic congestion has become a problem that plagues my country's economic development. In 2016, the per capita economic loss caused by traffic congestion reached 8,000 yuan. At the same time, it also brings energy problems, traffic safety problems and environmental problems. There are two options for solving traffic congestion, increasing traffic infrastructure construction and developing advanced traffic control systems. However, urban land resources are tight and limited, and simply relying on increasing traffic infrastructure cannot solve the problem. Therefore, more attention should be paid to the development of advanced traffic control systems. Currently, mainstream traffic control systems inc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G08G1/08G06N3/04G06N3/08

CPCG06N3/08G08G1/08G06N3/045

Inventor 谭国真王莹多

Owner DALIAN UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A traffic signal adaptive control method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology