Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Unprotected left-turn driving control method based on deep reinforcement learning

A technology of reinforcement learning and control methods, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as limiting model training efficiency and error correction ability, and achieve the effect of improving interpretability

Active Publication Date: 2021-09-07
CHONGQING UNIV
View PDF22 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the inexplicability of the deep neural network model, the training efficiency and error correction ability of the model are greatly limited.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unprotected left-turn driving control method based on deep reinforcement learning
  • Unprotected left-turn driving control method based on deep reinforcement learning
  • Unprotected left-turn driving control method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] Embodiment 1: as Figure 1 to Figure 5 As shown, the present invention provides an unprotected left-turn driving control method based on deep reinforcement learning. In order to create enough unprotected LTAP / OD events in one simulation, two identical closed road loop simulation scenarios were constructed ,Such as Figure 2a shown. It can be seen that after the two vehicles pass through the target intersection (frame selection position) of concern, they will come back through the loop to form a loop. Any number of unprotected LTAP / OD events can be obtained by setting an appropriate simulation run time. For the training process of deep reinforcement learning, each unprotected LTAP / OD event becomes an episode (Episode). Training and testing also need to use the scene of multiple straight vehicles (see Figure 2b ) and three candidate paths (see Figure 2c ).

[0043]Agents are expected to master human-like negotiation skills to handle complex unprotected LTAP / OD eve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unprotected left-turn driving control method based on deep reinforcement learning, and the method comprises the following steps: 1, building a simulation and training environment: 1) building two same closed road environment simulation scenes; 2) setting proper simulation running time, and generating any number of unprotected LTAP / OD events; 3) setting a plurality of straight vehicles and three left-turn vehicle candidate paths; 2, designing a reward function, and processing a complex unprotected LTAP / OD event by adopting the driving skill of a human driver; 3, designing a strategy structure, updating parameters of the deep convolution fuzzy system by using a learning algorithm, and searching an optimal value function; and 4, designing a learning algorithm, and improving the training efficiency by adopting the data of the human driver and a deep convolution fuzzy system algorithm. The combination of the driving skill of the human driver and the deep convolution fuzzy system algorithm effectively improves the interpretability of the deep reinforcement learning algorithm, the error correction capability of the training efficiency and the passing efficiency of the vehicle.

Description

technical field [0001] The invention belongs to the field of motion control of middle and high-level automatic driving vehicles, and in particular relates to a method for training an unprotected left-turn control model used to generate an automatic driving strategy. Background technique [0002] Scenarios where a straight-going vehicle (SDV) and a left-turning vehicle (TV) are traveling in opposite directions at an intersection without traffic signals or other stop signs (LTAP / OD, figure 1 ), efficiently and safely complete an unprotected left turn, which is highly challenging not only for autonomous vehicles, but also for human drivers. Existing self-driving cars pay more attention to the robustness of the algorithm when completing an unprotected left turn, and mostly rely on manual customization of rules, often adopting an overly conservative strategy. Although the safety is guaranteed to a certain extent, the traffic efficiency is low low. In contrast, experienced human...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): B60W60/00G06N3/04G06N3/08
CPCB60W60/001G06N3/08G06N3/043G06N3/045Y02T10/40
Inventor 赵敏孙棣华陈进
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products