Idle traffic light intelligent control method based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of traffic lights and reinforcement learning, which is applied in the field of idle time traffic lights control based on reinforcement learning, can solve problems such as frequent accidents, achieve the effects of low calculation requirements, convenient real-time target detection, and fewer training parameters

Inactive Publication Date: 2020-03-27

TIANJIN UNIV

View PDF9 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in actual driving, accidents caused by "yellow flash" occur frequently

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment approach

[0023] The specific implementation steps are as follows:

[0024] a) Assume that the intersection is divided into east-west and north-south directions, which are denoted as E-W and S-N respectively. The traffic lights have two display states: E-W is green light, S-N is red light and E-W is red light, S-N is green light, which are recorded as B_E and B_S respectively.

[0025] b) Use the SlimYOLOv3 model to collect real-time traffic flow on the road. Specifically, with the intersection as the center, the roads in each direction are divided into x 1 、x 2 and x 3 three intervals, such as Figure 4 shown. Detect the number of vehicles in each interval based on the front of the vehicle, denoted as n 1 , n 2 and n 3 . The observed state value s at time t t is a six-dimensional vector, s t =[n B1 ,n B2 ,n B3 ,n R1 ,n R2 ,n R3 ]. Among them, n Bi Represents the number of vehicles in the section i in the direction of travel, n Ri Represents the number of vehicles wa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to an idle traffic light control method based on reinforcement learning, and the method comprises the following steps of employing a SlimYOLOv3 model to sense an environment, analyzing a scene, recognizing all vehicle types of targets in the scene, and positioning the positions of the targets through defining a bounding box around each target; adopting a DQN-based reinforcement learning method to train a traffic light control intelligent agent, a) defining an action space, enabling the traffic lights to randomly select actions according to probabilities, and adopting a greedy algorithm to randomly select the actions according to the probabilities; b) defining a state space, wherein the road surface state observed at any moment is the number of vehicles in different intervals in each direction, and an observation state value is a six-dimensional vector; c) defining a reward function, wherein the penalty weights of the three interval road sections are respectively defined as the specification, and the reward value is the sum of the penalty weights of the road sections; and d) learning a strategy enabling the reward value to be the highest by adopting the DQN-based reinforcement learning method to obtain the traffic light control intelligent agent with high performance.

Description

technical field [0001] The invention belongs to the technical field of intelligent traffic lights, and in particular relates to a method for controlling idle traffic lights based on reinforcement learning. Background technique [0002] With the acceleration of urbanization in China, the scale of cities has gradually expanded. In the field of traffic management, the government and relevant departments are committed to strengthening the construction of urban public transport, improving road layout, and opening up the urban microcirculation. At present, most of the traffic lights at street crossroads in our country adopt a timing switching control method, that is, the switching interval is fixed. However, in idle roads with frequent signal lights, this control method cannot well satisfy the driver's driving experience. For example, when driving at night, there is less traffic on the auxiliary road, and there is often an embarrassing situation where there is no traffic on the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G08G1/07

CPCG08G1/07

Inventor 金志刚韩玥

Owner TIANJIN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Idle traffic light intelligent control method based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment approach

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology