Distributed traffic signal control method based on generative adversarial network and reinforcement learning

A reinforcement learning and traffic signal technology, which is applied in the traffic control system of road vehicles, traffic signal control, traffic control system, etc., can solve the problems of independent operation, low data accumulation efficiency, and poor regional joint control effect, etc., to achieve Improve learning ability and solve the effect of low generation efficiency

Active Publication Date: 2021-09-24
SOUTHEAST UNIV +1
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are still several problems: (1) In the distributed signal control, the communication between agents is not enough, which may easily lead to separate operations between intersections, and the effect of regional joint control is not good; (2) A2C as an online strategy Reinforcement learning algorithms need to accumulate data through real-time interaction with the environment, and then use it for model training, so there are disadvantages of low data utilization efficiency and low model training efficiency; (3) when you want the trained A2C model to continue learning in practical applications When the data accumulation efficiency is low, the model parameters cannot be updated in time according to the traffic status.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed traffic signal control method based on generative adversarial network and reinforcement learning
  • Distributed traffic signal control method based on generative adversarial network and reinforcement learning
  • Distributed traffic signal control method based on generative adversarial network and reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0056] A distributed traffic signal control method based on generative confrontation network and reinforcement learning disclosed in the embodiment of the present invention is suitable for distributed signal control of regional road traffic. It mainly includes modeling the road traffic environment to define the three elements of agent reinforcement learning (state, action and reward); the agent interacts with the simulation environment to accumulate experience database A policy-based generative adversarial model (P-WGAN-GP) is then constructed and trained to generate a pseudo-database Finally, the reinforcement learning A2C model is constructed, and the interaction mode between generative confrontation model and reinforcement learning is proposed, and the experience database and pseudo-database are used for model parameter training. Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for accelerating a reinforcement learning (RL) algorithm by using an improved generative adversarial network (WGAN-GP) and applying the algorithm to regional traffic signal control, and the advantages of the generative adversarial network in the aspect of data generation and the advantages of the reinforcement learning algorithm in the aspect of learning a control strategy are applied to the regional traffic signal control. And the learning speed and effect of the signal control strategy can be effectively improved. The method mainly comprises: giving a control framework of multi-agent reinforcement learning in regional traffic signal control, and meanwhile, defining all elements of reinforcement learning, namely, states, actions, rewards and objective functions; defining a generative adversarial network structure; and proposing a data interaction framework of the generative adversarial network and reinforcement learning.

Description

technical field [0001] The invention relates to the field of traffic management and control, in particular to a distributed traffic signal control method based on generating confrontation networks and reinforcement learning. Background technique [0002] It is generally believed that the adaptive traffic signal control method is one of the effective methods to cope with the increasing traffic demand and relieve road traffic congestion. Compared with the early adaptive signal control methods, such as SCOOT, SCATS, and OPAC, the traffic signal control method based on reinforcement learning can learn the signal control scheme through the interaction with the traffic system without complex calculation formulas. [0003] There have been research attempts to apply actor-critic algorithm (A2C) to distributed traffic signal control. However, there are still several problems: (1) In the distributed signal control, the communication between agents is not enough, which may easily lead...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G08G1/07G08G1/01G06F30/27
CPCG08G1/07G08G1/0125G06F30/27
Inventor 王昊卢云雪董长印杨朝友
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products