Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic driving vehicle microscopic decision-making method based on reinforcement learning

A technology of automatic driving and reinforcement learning, applied in the direction of neural learning methods, biological neural network models, control devices, etc., can solve the problem of difficult urban roads showing better decision-making performance, not well adapted to dynamic changes in the environment, state space and Large behavioral space and other issues, to achieve the effects of strong universality and portability, easy deployment, and strong feasibility

Active Publication Date: 2020-10-30
BEIJING UNIV OF POSTS & TELECOMM
View PDF9 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Both the finite state machine model and the decision tree model ignore the uncertainty of the environment and cannot adapt well to the dynamic changes of the environment. When more behavior patterns are defined, the state space and behavior space are large, and the judgment logic is complex. The feasibility is not high, and it is difficult to show good decision-making performance in urban roads with rich structural features
[0010] Both the finite state machine model and the decision tree model in the prior art ignore the uncertainty of the environment, and cannot adapt to the dynamic changes of the environment well, and when more behavior patterns are defined, the state space and behavior space are relatively large , the judgment logic is complex, the feasibility is not high, it is difficult to show better decision-making performance in urban roads with rich structural characteristics, and no effective solution has been proposed so far

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic driving vehicle microscopic decision-making method based on reinforcement learning
  • Automatic driving vehicle microscopic decision-making method based on reinforcement learning
  • Automatic driving vehicle microscopic decision-making method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0065] Such as Figure 1 to Figure 4 As shown, the self-driving vehicle micro-decision-making method includes the following steps:

[0066] Step 1, reinforcement learning modeling, modeling and representation of automatic driving decision-making scheme:

[0067] In step 1.1, the driving process of the vehicle is defined as a Markov decision process. The autonomous vehicle is regarded as an agent, and the driving environment of the vehicle is regarded as a reinforcement learning environment. The agent vehicle makes driving decisions and Driving behavior, adjust the driving decision based on the driving results, divide the driving time into multiple time slots, each agent vehicle makes a driving decision at the beginning of the time slot, and determine the driving behavior of each agent vehicle in the time slot;

[0068] Step 1.2, use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic driving vehicle microcosmic decision-making method based on reinforcement learning. According to the method, a reinforcement learning A3C algorithm is adopted, driving behaviors are output by an Actor network, the flexibility is high, and the complexity of logic judgment is not affected by state space and behavior space. According to the method, a two-stage training solving process is adopted. In the first stage, an automatic driving microcosmic decision model suitable for all road sections is obtained through training so as to guarantee driving safety. Inthe second stage, the overall model in the first stage is deployed to each road section, and each road section trains a single-road-section model on the basis of the overall model and has transportability. Meanwhile, the continuous training of the second stage enables the method to adapt to the influence of various real-time factors. Finally, distributed communication architecture based on a realInternet of Vehicles system structure is elaborated, and distributed calculation in the solving process can be completed, so that the method can adapt to different road features and dynamic driving environments, and has wide applicability and robustness.

Description

technical field [0001] The invention relates to the technical field of automatic driving, in particular to a microscopic decision-making method for automatic driving vehicles based on reinforcement learning. Background technique [0002] Automated driving technology is one of the core technologies in intelligent transportation. Automated driving decisions are usually divided into two categories. One is the macroscopic path planning problem, that is, after the departure and destination of the vehicle are determined, the driving distance and congestion situation are comprehensively considered. Factors, how to choose the optimal driving route, this kind of problem has a relatively mature solution, another kind of problem is how to drive the vehicle on a certain micro road after the macro driving route is determined. [0003] In the prior art, the micro-decision-making models of self-driving vehicles are divided into the following categories: [0004] Finite state machine model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): B60W50/00B60W60/00G06N3/04G06N3/08
CPCB60W50/00B60W60/001G06N3/08B60W2050/0028G06N3/047G06N3/045
Inventor 郑侃刘杰赵龙
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products