Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and decision-making methods, applied in neural learning methods, design optimization/simulation, biological neural network models, etc. and other problems to achieve the effect of easy verification

Active Publication Date: 2020-11-06

JILIN UNIV

View PDF10 Cites 26 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Existing intelligent vehicle decision-making methods have more or less defects. For example, the rule-based method is mostly suitable for simple scenarios, and it is difficult to be competent for the urban road environment with rich structural features; the decision tree method based on the polling mechanism requires Decision network is defined offline for each driving scenario; information / ontology-based decision-making inference methods rely on vehicle-to-vehicle (V2V) communication and require full knowledge of other vehicles’ information (including driving intentions)

[0004] At the same time, the above methods also have the common problem of ignoring the dynamics and uncertainty of the environment. However, the real traffic environment often has strong uncertainties. The relationship between vehicles and vehicles, vehicles and roads is intricate and changing in real time. It is difficult to achieve safe and efficient decision-making, affecting the intelligent driving of vehicles, and even causing traffic accidents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0067] Such as figure 1 As shown, a smart car speed decision method based on deep reinforcement learning includes the following steps:

[0068] Step 1. Construct state space S, action space A, and immediate reward space R

[0069] Markov decision model, which can be represented by a quaternion , which are state space, action space, state transition function, and immediate reward. In this method, no state transition function is involved, so only the state space S, the action space A, and the immediate reward space R need to be constructed.

[0070] The composition of the state space includes the state of the smart car (this car) and the state of other cars. The state space is constructed as follows:

[0071] S=[s ego ,s 1 ,s 2 ,...,s n ]

[0072] where s ego Indicates the state of the vehicle, s 1 -s n Indicates the status of other vehicles in the current traffic scene, and n indicates the number of other vehicles.

[0073] Regardless of whether it is the car or anot...

Embodiment 2

[0113] A simulation method of a smart car speed decision-making method based on deep reinforcement learning, which is based on a DQN (Deep-Q-Learning) simulation system built by matlab automatic driving toolbox for simulation experiments, including the following steps:

[0114] First build the road environment for automatic driving decision-making, which is realized by the DrivingScenario Designer in the matlab automatic driving toolbox. The speed decision-making of the smart car proposed in the present invention is mainly aimed at traffic intersections without signal lights, so the first step is to add two vertical lines in the scene. Intersecting roads with a length of 100 meters are used as road objects in the driving environment, where each road is a two-way double-lane, and the width of each lane is 4.4 meters. The second step is to add vehicle objects, including the own car (smart car) and other cars. The starting position and target position of the own car are fixed, mai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an intelligent vehicle speed decision-making method based on a deep reinforcement learning method. The method comprises the steps of constructing a state space S, an action space A and an instant rewarding space R of a Markov decision-making model of an intelligent vehicle passing through an intersection; initializing a neural network, and constructing an experience pool; performing action selection by adopting an epsilon-greed algorithm, and filling the experience into the experience pool constructed in the step 2; randomly selecting a part of experience from the experience pool, and training a neural network by adopting a stochastic gradient descent method; completing the speed decision of the intelligent vehicle at the current moment according to the latest neural network, adding the experience to an experience pool, randomly selecting a part of experience, and carrying out the training of a new round of neural network. The invention further discloses a simulation method of the intelligent vehicle speed decision-making method based on deep reinforcement learning. The method is advantaged in that simulation experiments are carried out based on a deep reinforcement learning simulation system established by a matlab automatic driving toolbox.

Description

technical field [0001] The invention relates to the technical field of deep reinforcement learning and the technical field of automatic driving, in particular to a speed decision method for a smart car based on deep reinforcement learning and Matlab. Background technique [0002] With the development of society and the continuous improvement of the level of science and technology, people put forward new requirements for cars. Many automakers and scientific research institutions are shifting their research focus from "traditional cars" to "smart cars that integrate people, vehicles, and roads." Various countries have also introduced regulations and policies to promote the development of smart cars. [0003] Autonomous driving technology has become the core technology and research hotspot for the future development of smart cars, and decision-making control, as one of the four branches of autonomous driving technology, occupies an important position. Existing intelligent vehi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F30/15G06F30/20G06N3/04G06N3/08

CPCG06F30/15G06F30/20G06N3/08G06N3/045

Inventor 赵海艳陈伟轩刘晓斌赵津杨冯宇驰

Owner JILIN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology