Distributed traffic signal control method based on generative adversarial network and reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning and traffic signal technology, which is applied in the traffic control system of road vehicles, traffic signal control, traffic control system, etc., can solve the problems of independent operation, low data accumulation efficiency, and poor regional joint control effect, etc., to achieve Improve learning ability and solve the effect of low generation efficiency

Active Publication Date: 2021-09-24

SOUTHEAST UNIV +1

View PDF5 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, there are still several problems: (1) In the distributed signal control, the communication between agents is not enough, which may easily lead to separate operations between intersections, and the effect of regional joint control is not good; (2) A2C as an online strategy Reinforcement learning algorithms need to accumulate data through real-time interaction with the environment, and then use it for model training, so there are disadvantages of low data utilization efficiency and low model training efficiency; (3) when you want the trained A2C model to continue learning in practical applications When the data accumulation efficiency is low, the model parameters cannot be updated in time according to the traffic status.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0056] A distributed traffic signal control method based on generative confrontation network and reinforcement learning disclosed in the embodiment of the present invention is suitable for distributed signal control of regional road traffic. It mainly includes modeling the road traffic environment to define the three elements of agent reinforcement learning (state, action and reward); the agent interacts with the simulation environment to accumulate experience database A policy-based generative adversarial model (P-WGAN-GP) is then constructed and trained to generate a pseudo-database Finally, the reinforcement learning A2C model is constructed, and the interaction mode between generative confrontation model and reinforcement learning is proposed, and the experience database and pseudo-database are used for model parameter training. Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for accelerating a reinforcement learning (RL) algorithm by using an improved generative adversarial network (WGAN-GP) and applying the algorithm to regional traffic signal control, and the advantages of the generative adversarial network in the aspect of data generation and the advantages of the reinforcement learning algorithm in the aspect of learning a control strategy are applied to the regional traffic signal control. And the learning speed and effect of the signal control strategy can be effectively improved. The method mainly comprises: giving a control framework of multi-agent reinforcement learning in regional traffic signal control, and meanwhile, defining all elements of reinforcement learning, namely, states, actions, rewards and objective functions; defining a generative adversarial network structure; and proposing a data interaction framework of the generative adversarial network and reinforcement learning.

Description

technical field [0001] The invention relates to the field of traffic management and control, in particular to a distributed traffic signal control method based on generating confrontation networks and reinforcement learning. Background technique [0002] It is generally believed that the adaptive traffic signal control method is one of the effective methods to cope with the increasing traffic demand and relieve road traffic congestion. Compared with the early adaptive signal control methods, such as SCOOT, SCATS, and OPAC, the traffic signal control method based on reinforcement learning can learn the signal control scheme through the interaction with the traffic system without complex calculation formulas. [0003] There have been research attempts to apply actor-critic algorithm (A2C) to distributed traffic signal control. However, there are still several problems: (1) In the distributed signal control, the communication between agents is not enough, which may easily lead...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G08G1/07G08G1/01G06F30/27

CPCG08G1/07G08G1/0125G06F30/27

Inventor 王昊卢云雪董长印杨朝友

Owner SOUTHEAST UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Distributed traffic signal control method based on generative adversarial network and reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology