An adaptive traffic signal control system and method based on deep reinforcement learning

A reinforcement learning and traffic signal technology, applied in the field of intelligent transportation, can solve the problems of inability to fully consider various potential information, low control strategy efficiency, and low learning efficiency, so as to improve feature extraction ability and generalization ability, and improve traffic efficiency , Improve the effect of road network efficiency

Active Publication Date: 2021-06-08
BEIJING JIAOTONG UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the adaptive signal control scheme represented by Q-learning usually uses artificial feature variables as the traffic state, which simplifies the complexity of expressing the traffic state and cannot fully consider various potential information of the traffic state; secondly, the core of Q-learning is the state -The mapping relationship of the behavior value table leads to Q-learning leading to a large state space, low learning efficiency, and low efficiency of control strategy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An adaptive traffic signal control system and method based on deep reinforcement learning
  • An adaptive traffic signal control system and method based on deep reinforcement learning
  • An adaptive traffic signal control system and method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to make the purpose, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0071] On the contrary, the invention covers any alternatives, modifications, equivalent methods and schemes within the spirit and scope of the invention as defined by the claims. Further, in order to make the public have a better understanding of the present invention, some specific details are described in detail in the following detailed description of the present invention. The present invention can be fully understood by those skilled in the art without the description of these detailed parts.

[0072] The present invention provides an adaptive intersection traffic signal control system b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of intelligent transportation, and proposes an adaptive traffic signal control system and method based on deep reinforcement learning. The present invention utilizes the interaction module to realize the real-time interaction between the intersection environment and the controller, that is, the state perception module collects the traffic state of the intersection in real time and provides an optimal decision-making scheme under the current traffic state through the control decision-making module; at the same time, the present invention can be updated by The module adopts the framework of reinforcement learning to continuously update the control core (Q value network) inside the controller to further improve the optimization effect of future control schemes. The present invention can comprehensively collect various influencing factors in two dimensions of time and space; use the recurrent neural network to improve the feature extraction ability and generalization ability for high-dimensional input matrix; can realize the complexity, Real-time, dynamic, random, adaptable and other requirements to improve the efficiency of traffic control at intersections and reduce travel delays.

Description

technical field [0001] The invention relates to the field of intelligent transportation, in particular to an adaptive traffic signal control system and method based on deep reinforcement learning. Background technique [0002] With the deepening of China's urbanization process, the urban population and vehicles continue to increase, so urban traffic management needs to propose an adaptive urban traffic signal control method that can meet the dynamic needs. The salient features of the urban traffic system are: the dynamic fluctuation of traffic demand, the instability of time and space, the diversity of influencing factors, and the complexity of control strategies. [0003] The adaptive control methods in the prior art mostly adopt control system methods such as fuzzy control, neural network, and genetic algorithm. The adaptive control scheme in the prior art has the following characteristics: due to calculation conditions and modeling reasons, most adaptive control The sche...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G08G1/01G08G1/07
CPCG08G1/0145G08G1/07
Inventor 卫翀李殊荣闫学东马路邵春福
Owner BEIJING JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products