Traffic light control method based on heuristic deep Q network

A heuristic, traffic light technology, applied in the control of traffic signals, neural learning methods, biological neural network models, etc., can solve the problems of slow convergence speed of deep reinforcement learning algorithm, difficult to learn traffic light control strategies, etc., to avoid Traffic congestion situation, stress relief, effect of improving learning efficiency

Active Publication Date: 2020-09-22
XIDIAN UNIV
View PDF11 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in this model, each agent learns independently and does not cooperate with each other, which makes the convergence speed of the deep reinforcement learning algorithm very slow, and it is difficult to learn an optimal traffic light control strategy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Traffic light control method based on heuristic deep Q network
  • Traffic light control method based on heuristic deep Q network
  • Traffic light control method based on heuristic deep Q network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The examples and effects of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0038] refer to figure 1 , the implementation steps of this example are as follows:

[0039] Step 1. Read the urban traffic network information, establish a collection of vehicle traffic states at each intersection, and convert the read urban traffic network information into an adjacency matrix for storage.

[0040] 1.1) Generate urban traffic road network information through the open source traffic simulation software GLD, such as image 3 As shown in , the vehicle traffic state set T of each intersection is established i :

[0041] 1.1.1) Construct the set I of intersection traffic signal controllers according to traffic road network information:

[0042] I={agent 0 ,···agent i ,···agent n},

[0043] where agent i Indicates the traffic signal controller of the i-th intersection, i∈[0,n], n is the number of intersections...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-intersection traffic signal control method based on heuristic deep Q learning, and mainly solves problems of correlation of training data, incapability of rapid convergence of a traffic light control strategy and low control efficiency in an existing method. The method comprises steps that the urban traffic network information is read, a vehicle traffic state set ofeach intersection is established, and the read urban traffic network information is converted into an adjacent matrix to be stored; a state set, an action set and an action reward value of each intersection are acquired from a vehicle traffic state set of each intersection; according to the state set, the action set, the action reward value and the adjacency matrix, a heuristic deep Q network method is used for continuously executing actions according to the state of each intersection to obtain rewards and then to the next state, and urban road network traffic lights are controlled. The method can improve control efficiency of intersection traffic signal lamps, improves performance of the multi-intersection traffic signal controller, can be used for urban traffic management, and reduces the urban traffic congestion.

Description

technical field [0001] The invention belongs to the field of intelligent traffic control, in particular to a traffic light control method, which can be used for urban traffic management and reduce urban traffic congestion. Background technique [0002] In the field of urban intelligent traffic control, deep learning and reinforcement learning are currently very popular research directions and have achieved good results. Reinforcement learning is to learn the optimal traffic control strategy by continuously interacting with the urban road environment to obtain the environmental state, and form an adaptive control system for urban traffic. However, due to the increase in the complexity of the urban road environment, the dimension of the state-action space will increase dramatically in the process of acquiring prior knowledge. In order to solve such problems, the deep reinforcement learning DRL formed by the combination of reinforcement learning and deep learning uses both the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G08G1/07G06K9/62G06N3/04G06N3/08
CPCG08G1/07G06N3/08G06N3/045G06F18/214
Inventor 方敏徐维刘超葛领驰陈博
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products