Multi-intersection signal lamp cooperative control method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-intersection, collaborative control technology, applied in the traffic control system of road vehicles, traffic control system, traffic flow detection, etc., can solve the problems of low efficiency of reinforcement learning training, deep neural network consumption, large computing resources, etc., to achieve The effect of improving model accuracy and convergence speed, simple structure, and improving operating efficiency

Active Publication Date: 2019-07-26

TSINGHUA UNIV

View PDF9 Cites 42 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This method is characterized by complete information and will not cause loss of information. However, due to the low training efficiency of reinforcement learning, the deep neural network itself will consume more computing resources, so the efficiency is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0118]The model of the present invention combines two training methods, and the specific details are shown in Algorithm 1. When it needs to be emphasized, the new policy network in the reinforcement learning of the present invention is the same network as the policy network in the imitation learning. In training, first use imitation learning to imitate several times until the accuracy Acc reaches a threshold ξ, and then use reinforcement learning for further training. In the model corresponding to the single intersection experimental environment of the present invention, ξ=0.9 is taken. In the multi-intersection environment, since the expert strategy adopted in the present invention does not consider the coordination of multi-intersections, the present invention sets ξ=0.7 to encourage exploration.

[0119] Algorithm 1 A signal light control model that combines imitation learning and reinforcement learning

[0120] Initialize a new policy network π′ and old policy network π ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a multi-intersection signal lamp cooperative control method based on deep reinforcement learning, which comprises the following steps of: 1) establishing a multi-channel tensor capable of reflecting the original traffic state of the traffic network according to the actual condition of the multi-intersection traffic network; 2) establishing a multi-intersection cooperativecontrol neural network model according to the obtained multi-channel tensor of the multi-intersection traffic network; 3) training the established multi-intersection cooperative control neural networkmodel by adopting a method of combining simulation learning and reinforcement learning to obtain a trained multi-intersection cooperative control neural network model; and 4) inputting the phase information of the current multi-intersection into the trained multi-intersection cooperative control neural network model to obtain a cooperative control output result of the current multi-intersection signal lamp. The method can be widely applied to the field of multi-intersection signal lamp cooperative control.

Description

technical field [0001] The present invention relates to the technical field of collaborative control of multi-intersection signal lights in traffic road networks, in particular to a method for collaborative control of multi-intersection signal lights based on deep reinforcement learning. Modeling is carried out to form a new multi-intersection signal light collaborative control scheme. Background technique [0002] As the hub and key node of the urban traffic network, the intersection has a decisive influence on the operation efficiency of the traffic network. Therefore, optimizing the phase duration of intersection signal lights can greatly improve the operational efficiency of the existing urban traffic network. With the continuous development of intelligent transportation-related technologies such as cloud computing and 5G, real-time control of the transportation network through the central control system has become more and more feasible. Vehicle-to-vehicle (V2V) and v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G08G1/01G08G1/081G08G1/095G08G1/096G06N3/04

CPCG08G1/081G08G1/0125G08G1/012G08G1/095G08G1/096G06N3/045

Inventor 胡坚明霍雨森裴欣张佐姚丹亚

Owner TSINGHUA UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-intersection signal lamp cooperative control method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology