Traffic organization scheme optimization method based on multi-signal lamp reinforcement learning
A technology of reinforcement learning and optimization methods, applied in traffic control systems of road vehicles, machine learning, traffic signal control, etc., can solve problems such as model convergence and speed instability, and achieve the effect of improving the smooth flow rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0062] This embodiment is a multi-intersection traffic organization plan optimization method based on multi-signal reinforced learning, using multi-agents, Actor-Critic network, Subnet network, and trajectory reconstruction to improve the traffic flow rate of the road network. A multi-agent environment is one in which there are multiple intelligent entities in each step, such as figure 1 Shown is the difference between multi-agent and single-agent environments.
[0063] First construct an Actor network. The traffic road network contains multiple intersections, and the signal lights at each intersection correspond to an agent. Multiple agents need to construct multiple corresponding Actor networks. The Actor network includes a state space set and a behavior space set.
[0064] Through the program in the traffic lights to change the state of the road, to achieve a certain sense of short-term road closure for traffic control. In this embodiment, proceeding from the actual situat...
Embodiment 2
[0072] This embodiment is a method for optimizing a traffic organization scheme based on reinforcement learning of multi-signal lights for a single intersection. The simulation platform used in this embodiment is SUMO. SUMO is an open source road simulator, which can meet the collection of relevant data required in the simulation experiment, as well as the simulation of traffic behavior and the required road network construction. The most important thing is to The timing data of traffic lights can be collected. The development IDE tool for writing code is Pycharm, and Tensorflow-gpu-1.4.0 version and Numpy are used to complete the relevant reinforcement learning and neural network construction. The above extensions need to be improved, and the second most important thing is to implement SUMO Traci Traffic control interface, Traci can help to expand the dynamic control of traffic lights, can call SUMO simulation tools, obtain individual vehicle information, and obtain detailed ...
Embodiment 9
[0075] In the experimental model of 9-grid multi-intersection in this embodiment, each rectangle represents a signalized intersection, and every two adjacent intersections are connected by two lanes.
[0076] In the setting of this embodiment, the following parameter settings need to be completed in the SUMO simulation software. In the 9-grid environment, a total of 7,000 vehicles enter the simulation system. The model sets the initial vehicles to 50 vehicles, and the shortest vehicle There are 2 driving paths, the longest vehicle driving path is 7, and the random seed parameter is set to 10.
[0077] After the experimental model is built, the action mode of each agent is constructed according to its own behavior mode. Under the original conditions, the total waiting time of cars in this environment is 24732 seconds. There are 21 pairs of OD pairs in this experimental traffic environment. In the original environment, the traffic volume in the lower right area of the 9th grid...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com