Micro-decision-making method for self-driving vehicles based on reinforcement learning

A reinforcement learning and automatic driving technology, applied in neural learning methods, biological neural network models, control devices, etc., can solve the problem of difficult urban roads to show good decision-making performance, not well adapted to environmental dynamic changes, state space and Large behavior space, etc., to achieve the effect of strong universality and portability, easy deployment, and strong feasibility

Active Publication Date: 2021-10-26
BEIJING UNIV OF POSTS & TELECOMM
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Both the finite state machine model and the decision tree model ignore the uncertainty of the environment and cannot adapt well to the dynamic changes of the environment. When more behavior patterns are defined, the state space and behavior space are large, and the judgment logic is complex. The feasibility is not high, and it is difficult to show good decision-making performance in urban roads with rich structural features
[0010] Both the finite state machine model and the decision tree model in the prior art ignore the uncertainty of the environment, and cannot adapt to the dynamic changes of the environment well, and when more behavior patterns are defined, the state space and behavior space are relatively large , the judgment logic is complex, the feasibility is not high, it is difficult to show better decision-making performance in urban roads with rich structural characteristics, and no effective solution has been proposed so far

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Micro-decision-making method for self-driving vehicles based on reinforcement learning
  • Micro-decision-making method for self-driving vehicles based on reinforcement learning
  • Micro-decision-making method for self-driving vehicles based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0065] Such as Figure 1 to Figure 4 As shown, the self-driving vehicle micro-decision-making method includes the following steps:

[0066] Step 1, reinforcement learning modeling, modeling and representation of automatic driving decision-making scheme:

[0067] In step 1.1, the driving process of the vehicle is defined as a Markov decision process. The autonomous vehicle is regarded as an agent, and the driving environment of the vehicle is regarded as a reinforcement learning environment. The agent vehicle makes driving decisions and Driving behavior, adjust the driving decision based on the driving results, divide the driving time into multiple time slots, each agent vehicle makes a driving decision at the beginning of the time slot, and determine the driving behavior of each agent vehicle in the time slot;

[0068] Step 1.2, use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microcosmic decision-making method for an automatic driving vehicle based on reinforcement learning. The method adopts the A3C algorithm of reinforcement learning, the driving behavior is output by the Actor network, the flexibility is strong, and the complexity of the judgment logic is not affected by the size of the state space and the behavior space. The method employs a two-stage training solution process. In the first stage of training, a microcosmic decision-making model for automatic driving is obtained for all road sections to ensure driving safety. In the second stage, the overall model of the first stage is deployed to each road segment, and each road segment trains a single-segment model on this basis, which is portable. Meanwhile, the continuous training in the second stage enables the method to adapt to the influence of various real-time factors. Finally, the distributed communication architecture based on the real Internet of Vehicles system structure is described, which can complete the distributed calculation in the solution process. Therefore, the method can adapt to different road characteristics and dynamic driving environments, and has wide applicability and robustness. sex.

Description

technical field [0001] The invention relates to the technical field of automatic driving, in particular to a microscopic decision-making method for automatic driving vehicles based on reinforcement learning. Background technique [0002] Automated driving technology is one of the core technologies in intelligent transportation. Automated driving decisions are usually divided into two categories. One is the macroscopic path planning problem, that is, after the departure and destination of the vehicle are determined, the driving distance and congestion situation are comprehensively considered. Factors, how to choose the optimal driving route, this kind of problem has a relatively mature solution, another kind of problem is how to drive the vehicle on a certain micro road after the macro driving route is determined. [0003] In the prior art, the micro-decision-making models of self-driving vehicles are divided into the following categories: [0004] Finite state machine model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): B60W50/00B60W60/00G06N3/04G06N3/08
CPCB60W50/00B60W60/001G06N3/08B60W2050/0028G06N3/047G06N3/045
Inventor 郑侃刘杰赵龙
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products