Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof

A technology of reinforcement learning and decision-making methods, applied in neural learning methods, design optimization/simulation, biological neural network models, etc. and other problems to achieve the effect of easy verification

Active Publication Date: 2020-11-06
JILIN UNIV
View PDF10 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing intelligent vehicle decision-making methods have more or less defects. For example, the rule-based method is mostly suitable for simple scenarios, and it is difficult to be competent for the urban road environment with rich structural features; the decision tree method based on the polling mechanism requires Decision network is defined offline for each driving scenario; information / ontology-based decision-making inference methods rely on vehicle-to-vehicle (V2V) communication and require full knowledge of other vehicles’ information (including driving intentions)
[0004] At the same time, the above methods also have the common problem of ignoring the dynamics and uncertainty of the environment. However, the real traffic environment often has strong uncertainties. The relationship between vehicles and vehicles, vehicles and roads is intricate and changing in real time. It is difficult to achieve safe and efficient decision-making, affecting the intelligent driving of vehicles, and even causing traffic accidents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof
  • Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof
  • Intelligent vehicle speed decision-making method based on deep reinforcement learning and simulation method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0067] Such as figure 1 As shown, a smart car speed decision method based on deep reinforcement learning includes the following steps:

[0068] Step 1. Construct state space S, action space A, and immediate reward space R

[0069] Markov decision model, which can be represented by a quaternion , which are state space, action space, state transition function, and immediate reward. In this method, no state transition function is involved, so only the state space S, the action space A, and the immediate reward space R need to be constructed.

[0070] The composition of the state space includes the state of the smart car (this car) and the state of other cars. The state space is constructed as follows:

[0071] S=[s ego ,s 1 ,s 2 ,...,s n ]

[0072] where s ego Indicates the state of the vehicle, s 1 -s n Indicates the status of other vehicles in the current traffic scene, and n indicates the number of other vehicles.

[0073] Regardless of whether it is the car or anot...

Embodiment 2

[0113] A simulation method of a smart car speed decision-making method based on deep reinforcement learning, which is based on a DQN (Deep-Q-Learning) simulation system built by matlab automatic driving toolbox for simulation experiments, including the following steps:

[0114] First build the road environment for automatic driving decision-making, which is realized by the DrivingScenario Designer in the matlab automatic driving toolbox. The speed decision-making of the smart car proposed in the present invention is mainly aimed at traffic intersections without signal lights, so the first step is to add two vertical lines in the scene. Intersecting roads with a length of 100 meters are used as road objects in the driving environment, where each road is a two-way double-lane, and the width of each lane is 4.4 meters. The second step is to add vehicle objects, including the own car (smart car) and other cars. The starting position and target position of the own car are fixed, mai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an intelligent vehicle speed decision-making method based on a deep reinforcement learning method. The method comprises the steps of constructing a state space S, an action space A and an instant rewarding space R of a Markov decision-making model of an intelligent vehicle passing through an intersection; initializing a neural network, and constructing an experience pool; performing action selection by adopting an epsilon-greed algorithm, and filling the experience into the experience pool constructed in the step 2; randomly selecting a part of experience from the experience pool, and training a neural network by adopting a stochastic gradient descent method; completing the speed decision of the intelligent vehicle at the current moment according to the latest neural network, adding the experience to an experience pool, randomly selecting a part of experience, and carrying out the training of a new round of neural network. The invention further discloses a simulation method of the intelligent vehicle speed decision-making method based on deep reinforcement learning. The method is advantaged in that simulation experiments are carried out based on a deep reinforcement learning simulation system established by a matlab automatic driving toolbox.

Description

technical field [0001] The invention relates to the technical field of deep reinforcement learning and the technical field of automatic driving, in particular to a speed decision method for a smart car based on deep reinforcement learning and Matlab. Background technique [0002] With the development of society and the continuous improvement of the level of science and technology, people put forward new requirements for cars. Many automakers and scientific research institutions are shifting their research focus from "traditional cars" to "smart cars that integrate people, vehicles, and roads." Various countries have also introduced regulations and policies to promote the development of smart cars. [0003] Autonomous driving technology has become the core technology and research hotspot for the future development of smart cars, and decision-making control, as one of the four branches of autonomous driving technology, occupies an important position. Existing intelligent vehi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/15G06F30/20G06N3/04G06N3/08
CPCG06F30/15G06F30/20G06N3/08G06N3/045
Inventor 赵海艳陈伟轩刘晓斌赵津杨冯宇驰
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products