Automatic driving vehicle microscopic decision-making method based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of automatic driving and reinforcement learning, applied in the direction of neural learning methods, biological neural network models, control devices, etc., can solve the problem of difficult urban roads showing better decision-making performance, not well adapted to dynamic changes in the environment, state space and Large behavioral space and other issues, to achieve the effects of strong universality and portability, easy deployment, and strong feasibility

Active Publication Date: 2020-10-30

BEIJING UNIV OF POSTS & TELECOMM

View PDF9 Cites 16 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] Both the finite state machine model and the decision tree model ignore the uncertainty of the environment and cannot adapt well to the dynamic changes of the environment. When more behavior patterns are defined, the state space and behavior space are large, and the judgment logic is complex. The feasibility is not high, and it is difficult to show good decision-making performance in urban roads with rich structural features

[0010] Both the finite state machine model and the decision tree model in the prior art ignore the uncertainty of the environment, and cannot adapt to the dynamic changes of the environment well, and when more behavior patterns are defined, the state space and behavior space are relatively large , the judgment logic is complex, the feasibility is not high, it is difficult to show better decision-making performance in urban roads with rich structural characteristics, and no effective solution has been proposed so far

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0064] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0065] Such as Figure 1 to Figure 4 As shown, the self-driving vehicle micro-decision-making method includes the following steps:

[0066] Step 1, reinforcement learning modeling, modeling and representation of automatic driving decision-making scheme:

[0067] In step 1.1, the driving process of the vehicle is defined as a Markov decision process. The autonomous vehicle is regarded as an agent, and the driving environment of the vehicle is regarded as a reinforcement learning environment. The agent vehicle makes driving decisions and Driving behavior, adjust the driving decision based on the driving results, divide the driving time into multiple time slots, each agent vehicle makes a driving decision at the beginning of the time slot, and determine the driving behavior of each agent vehicle in the time slot;

[0068] Step 1.2, use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an automatic driving vehicle microcosmic decision-making method based on reinforcement learning. According to the method, a reinforcement learning A3C algorithm is adopted, driving behaviors are output by an Actor network, the flexibility is high, and the complexity of logic judgment is not affected by state space and behavior space. According to the method, a two-stage training solving process is adopted. In the first stage, an automatic driving microcosmic decision model suitable for all road sections is obtained through training so as to guarantee driving safety. Inthe second stage, the overall model in the first stage is deployed to each road section, and each road section trains a single-road-section model on the basis of the overall model and has transportability. Meanwhile, the continuous training of the second stage enables the method to adapt to the influence of various real-time factors. Finally, distributed communication architecture based on a realInternet of Vehicles system structure is elaborated, and distributed calculation in the solving process can be completed, so that the method can adapt to different road features and dynamic driving environments, and has wide applicability and robustness.

Description

technical field [0001] The invention relates to the technical field of automatic driving, in particular to a microscopic decision-making method for automatic driving vehicles based on reinforcement learning. Background technique [0002] Automated driving technology is one of the core technologies in intelligent transportation. Automated driving decisions are usually divided into two categories. One is the macroscopic path planning problem, that is, after the departure and destination of the vehicle are determined, the driving distance and congestion situation are comprehensively considered. Factors, how to choose the optimal driving route, this kind of problem has a relatively mature solution, another kind of problem is how to drive the vehicle on a certain micro road after the macro driving route is determined. [0003] In the prior art, the micro-decision-making models of self-driving vehicles are divided into the following categories: [0004] Finite state machine model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): B60W50/00B60W60/00G06N3/04G06N3/08

CPCB60W50/00B60W60/001G06N3/08B60W2050/0028G06N3/047G06N3/045

Inventor 郑侃刘杰赵龙

Owner BEIJING UNIV OF POSTS & TELECOMM

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Automatic driving vehicle microscopic decision-making method based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology