Idle traffic light intelligent control method based on reinforcement learning

A technology of traffic lights and reinforcement learning, which is applied in the field of idle time traffic lights control based on reinforcement learning, can solve problems such as frequent accidents, achieve the effects of low calculation requirements, convenient real-time target detection, and fewer training parameters

Inactive Publication Date: 2020-03-27
TIANJIN UNIV
View PDF9 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in actual driving, accidents caused by "yellow flash" occur frequently

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Idle traffic light intelligent control method based on reinforcement learning
  • Idle traffic light intelligent control method based on reinforcement learning
  • Idle traffic light intelligent control method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0023] The specific implementation steps are as follows:

[0024] a) Assume that the intersection is divided into east-west and north-south directions, which are denoted as E-W and S-N respectively. The traffic lights have two display states: E-W is green light, S-N is red light and E-W is red light, S-N is green light, which are recorded as B_E and B_S respectively.

[0025] b) Use the SlimYOLOv3 model to collect real-time traffic flow on the road. Specifically, with the intersection as the center, the roads in each direction are divided into x 1 、x 2 and x 3 three intervals, such as Figure 4 shown. Detect the number of vehicles in each interval based on the front of the vehicle, denoted as n 1 , n 2 and n 3 . The observed state value s at time t t is a six-dimensional vector, s t =[n B1 ,n B2 ,n B3 ,n R1 ,n R2 ,n R3 ]. Among them, n Bi Represents the number of vehicles in the section i in the direction of travel, n Ri Represents the number of vehicles wa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an idle traffic light control method based on reinforcement learning, and the method comprises the following steps of employing a SlimYOLOv3 model to sense an environment, analyzing a scene, recognizing all vehicle types of targets in the scene, and positioning the positions of the targets through defining a bounding box around each target; adopting a DQN-based reinforcement learning method to train a traffic light control intelligent agent, a) defining an action space, enabling the traffic lights to randomly select actions according to probabilities, and adopting a greedy algorithm to randomly select the actions according to the probabilities; b) defining a state space, wherein the road surface state observed at any moment is the number of vehicles in different intervals in each direction, and an observation state value is a six-dimensional vector; c) defining a reward function, wherein the penalty weights of the three interval road sections are respectively defined as the specification, and the reward value is the sum of the penalty weights of the road sections; and d) learning a strategy enabling the reward value to be the highest by adopting the DQN-based reinforcement learning method to obtain the traffic light control intelligent agent with high performance.

Description

technical field [0001] The invention belongs to the technical field of intelligent traffic lights, and in particular relates to a method for controlling idle traffic lights based on reinforcement learning. Background technique [0002] With the acceleration of urbanization in China, the scale of cities has gradually expanded. In the field of traffic management, the government and relevant departments are committed to strengthening the construction of urban public transport, improving road layout, and opening up the urban microcirculation. At present, most of the traffic lights at street crossroads in our country adopt a timing switching control method, that is, the switching interval is fixed. However, in idle roads with frequent signal lights, this control method cannot well satisfy the driver's driving experience. For example, when driving at night, there is less traffic on the auxiliary road, and there is often an embarrassing situation where there is no traffic on the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G08G1/07
CPCG08G1/07
Inventor 金志刚韩玥
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products