Vision-based deep imitation reinforcement learning driving strategy training method

A technology of reinforcement learning and driving strategy, applied in the direction of neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as inability to handle unknown scenarios, time-consuming, time-consuming training, etc., to improve the processing ability of unknown environments, The effect of ensuring comfort and safety and reducing learning costs

Active Publication Date: 2021-01-15
DALIAN UNIV
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The performance of the end-to-end model based on the end-to-end visual lane keeping method disclosed in Chinese patent document CN109446919A depends on the quantity and quality of the collected driving data. To obtain an excellent driving strategy, it is necessary to collect data of various driving scenarios, which consumes a lot of time. time
Secondly, because it is unrealistic to collect driving data for all scenarios, the model cannot handle unknown scenarios, and it is difficult to improve its performance in one step
[0007] Chinese patent document CN108897313A discloses a hierarchical end-to-end vehicle automatic driving system construction method. The first two layers of neural network models need to rely on a large amount of label data for training, and the second two layers of reinforcement learning model algorithms are still traditional reinforcement learning training methods. , requires a lot of exploration, and the training is very time-consuming
D...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vision-based deep imitation reinforcement learning driving strategy training method
  • Vision-based deep imitation reinforcement learning driving strategy training method
  • Vision-based deep imitation reinforcement learning driving strategy training method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments: taking this as an example to further describe and illustrate the present application. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them.

[0046] This embodiment proposes a vision-based deep imitation reinforcement learning driving strategy training method, combining the advantages of imitation learning and deep reinforcement learning, obtaining initial driving strategy learning through imitation learning, and then solving the online driving strategy learning problem by deep reinforcement learning. The output of imitation learning is used as the input of deep reinforcement learning, which reduces the exploration space and improves the learning efficiency; at the same time, deep reinforcement learning solves the driving strategy learning of unknown environment, thereby improvi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a vision-based deep imitation reinforcement learning driving strategy training method. The method comprises the steps of constructing an imitation learning network; training the imitation learning network; performing network splitting on the trained imitation learning network to obtain a sensing module; constructing a DDPG network to obtain a control module; completing theconstruction of a deep imitation reinforcement learning model through the sensing module and the control module; and training the deep imitation reinforcement learning model. An imitation learning network comprises five convolution layers and four full connection layers, the convolution layers are used for extracting features, and the full connection layers are used for predicting a steering angle, an accelerator and a brake opening degree; in addition, a reward function is set in the training process of the deep imitation reinforcement learning model, and comfort and safety of curve driving are guaranteed.

Description

technical field [0001] The invention relates to the technical field of automatic driving, in particular to a vision-based deep imitation reinforcement learning driving strategy training method. Background technique [0002] The rise of autonomous driving technology provides new solutions to existing traffic problems. Autonomous driving technology can effectively improve the driving efficiency of road motor vehicles, thereby alleviating traffic pressure. And by using the efficient and precise execution of the machine, traffic accidents are reduced and the driving safety index is improved. At the same time, the development of science and technology has promoted the rise of traffic intelligence. From computing power, traffic big data to popular deep learning, they have jointly promoted the rapid development of autonomous driving technology. [0003] In various tasks of autonomous driving, sensors such as radar, lidar, ultrasonic sensors and infrared cameras have been widely u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/08B60W60/00
CPCG06N3/08B60W60/001G06N3/048G06N3/045Y02T10/40
Inventor 邹启杰熊康高兵汪祖民王东
Owner DALIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products