Actor-critic algorithm-based distributed traffic signal lamp joint control method

A traffic signal light and joint control technology, which is applied in the traffic control system of road vehicles, traffic control system, traffic flow detection, etc., can solve the problems of huge Q value table, unstable convergence value, poor calculation ability, etc., and achieve the goal of improving road traffic. Smoothness, improvement of communication volume, effect of small amount of calculation

Active Publication Date: 2020-10-16
NANJING UNIV OF SCI & TECH
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Previous studies mainly focused on the optimal control of a single traffic intersection, ignoring the fact that the traffic flows at different intersections in the urban traffic network often affect each other
On the other

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Actor-critic algorithm-based distributed traffic signal lamp joint control method
  • Actor-critic algorithm-based distributed traffic signal lamp joint control method
  • Actor-critic algorithm-based distributed traffic signal lamp joint control method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0074] In this embodiment, the distributed traffic signal light joint control method based on the actor-critic algorithm includes the following stages:

[0075] The first stage:

[0076] In this paper, the network composed of multi-agents is defined as G(ν,ε) by using the definition of graph theory, where ν is the set of agents as each node, and ε is the set of edges between different nodes. For agent i, define the set of its associated nodes as N i , agent i and agent j (j∈N i ) has the shortest path length d i,j .

[0077] second stage:

[0078] In this paper, the Markov decision process of a single traffic intersection in the traffic signal control system is mathematically modeled. Define its state set, action set, and reward value here as follows:

[0079] (1) State set. Define the local state of each traffic intersection as

[0080]

[0081] where len t [l] is the queue length on the lane, L i is the set of all entrance lanes at traffic intersection i, l repr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an actor-critic algorithm-based distributed traffic signal lamp joint control method. The method includes the steps of: conducting mathematical modeling on a network composed of multiple agents; modeling a Markov decision process of a single traffic intersection in a distributed traffic signal lamp control system, and defining a state set, an action set and a single-step reward value; constructing a multi-agent joint control mode, and establishing communication connection between the agents to exchange respective information; establishing a flexible dominant actor-critic algorithm, adding a strategy entropy of a next state into the single-step reward value, constructing a value function, and adding a dominant function; based on the flexible dominant actor-critic algorithm, taking the purpose of minimizing the average waiting time of vehicles, adopting a joint flexible dominant actor-critic algorithm by the agents of each traffic intersection for learning and control of signal lamps. According to the invention, through cooperative control of signal lamps at different traffic intersections, the overall road smoothness of the traffic network is improved.

Description

technical field [0001] The invention relates to the technical field of adaptive traffic signal control (Adaptive Traffic Signal Control, ATSC), in particular to a method for joint control of distributed traffic signal lights based on actor-critic algorithm. Background technique [0002] With the deepening of urbanization, most cities are facing the huge problem of traffic congestion. The congested road traffic environment not only causes great damage to the environment, but also has a huge negative impact on the social economy. Due to the small space for road expansion reserved in urban planning and the large impact on the construction of transportation infrastructure in the city, coupled with the continuous increase in the number of vehicles per capita, the problem will become more difficult. In this case, optimizing the control technology of signal lights is an easy and economical way to alleviate the problem. Compared with the traditional timing scheme for adjusting dif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G08G1/081G08G1/01
CPCG08G1/081G08G1/0145Y02T10/40
Inventor 王天誉梁腾张杰李骏
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products