Deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning

A technology of reinforcement learning and path planning, applied in vehicle position/route/height control, motor vehicles, two-dimensional position/course control, etc., can solve the problem that the agent cannot learn, and improve the actual generalization ability, The effect of narrowing the gap and simplifying the system architecture

Pending Publication Date: 2022-03-08
JILIN UNIV
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This optimal strategy is guided by the reward function, and the sparse reward in the learning process will cause the agent to fail to learn a good strategy. Therefore, the design of the reward function is an important part of reinforcement learning, and how to design the reward function reasonably is also important. One of the main research directions of reinforcement learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning
  • Deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning
  • Deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0040] The specific implementation of the present invention will be described in detail below in conjunction with specific embodiments.

[0041] In one embodiment of the invention, see figure 1 , the described a kind of deep reinforcement learning intelligent car behavior decision-making method based on path planning comprises the following steps:

[0042] S1. Model the task as a Markov decision process;

[0043] S2. Build a deep reinforcement learning algorithm;

[0044] S3. Intelligent body input design;

[0045] S4. Intelligent body output design;

[0046] S5. Building a training network st...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning, and belongs to the technical field of intelligent vehicle automatic driving, the deep reinforcement learning intelligent vehicle behavior decision-making method based on path planning comprises the steps of modeling a task as a Markov decision-making process, building a deep reinforcement learning algorithm, and making an intelligent vehicle behavior decision-making process. Designing agent input, designing agent output, building a training network structure, performing path planning on a task environment, improving a reward function, and training and testing an agent model; the method has the advantages that complex decisions are processed, the process from a simulation scene to practical application is simplified, the problems of low training speed and difficulty in convergence are solved, and the practical generalization ability of the intelligent agent model is improved.

Description

technical field [0001] The invention relates to the technical field of intelligent vehicle automatic driving, in particular to a path planning-based deep reinforcement learning intelligent vehicle behavior decision-making method. Background technique [0002] In the face of increasingly severe traffic congestion, driving safety and environmental pollution, autonomous driving smart cars have become an inevitable trend in the development of the automotive industry. At the same time, autonomous driving plays a role in promoting industrial prosperity, economic development, technological innovation, and social progress. Major countries have raised it to the level of national strategy. The system architecture of a smart car mainly includes a perception module, a decision-making module, and a control module. The decision-making module, as the "brain" of an unmanned vehicle, is a direct reflection of the intelligence of the autonomous driving system and plays a decisive role in the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0221G05D2201/02
Inventor 赵海艳靳英豪卢星昊刘万陈虹
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products