Automatic driving method based on conditional imitation learning and reinforcement learning

A technology of reinforcement learning and automatic driving, applied in neural learning methods, computer components, structured data retrieval, etc., can solve problems such as low efficiency of reinforcement learning exploration, rough rough PP angle, poor interpretability, etc. Achieve the effects of reducing the dependence on training data sets, enhancing robustness, and improving performance

Inactive Publication Date: 2022-03-11
ANHUI COWAROBOT CO LTD +1
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, the methods proposed in the above papers and patents have the following deficiencies: simply using RGB images as the main source of perception has poor interpretability and robustness. Very unsafe; the rough PP angle calculated by image point selection is too rough, and a better control amount of real vehicle data can be selected in actual landing; the imitation learning framework has a huge convolutional layer and insufficient real-time performance
However, this invention does not reduce the model's dependence on the training data set by combining CIL and RL, nor does it solve the problem of low efficiency of reinforcement learning exploration due to random initialization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic driving method based on conditional imitation learning and reinforcement learning
  • Automatic driving method based on conditional imitation learning and reinforcement learning
  • Automatic driving method based on conditional imitation learning and reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0053] According to an automatic driving method based on conditional imitation learning and reinforcement learning provided by the present invention, such as Figure 1-Figure 7 shown, including:

[0054] Step S1: Collect driving data, and mark the corresponding decision-making actions as expert decisions;

[0055] Step S2: According to the driving data, calculate the heading angle, and obtain the feature vector and feature map;

[0056] Step S3: Construct a training data set based on the feature vector, feature map, heading angle and expert decision, and use the training data set to train the model;

[0057] Step S4: Apply the trained model to the vehicle to realize the automatic driving decision of the vehicle.

[0058] Specifically, in the step S1:

[0059] The driving data includes vehicle status information and perception information obtained from on-board sensors;

[0060] The vehicle status information includes vehicle position information, speed information, steerin...

Embodiment 2

[0080] Embodiment 2 is a preferred example of Embodiment 1 to describe the present invention more specifically.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an automatic driving method based on conditional imitation learning and reinforcement learning, and the method comprises the steps: S1, collecting driving data, and marking a corresponding decision action as an expert decision; s2, according to the driving data, calculating to obtain a course angle, and obtaining a feature vector and a feature map; s3, constructing a training data set based on the feature vector, the feature map, the course angle and the expert decision, and training a model by using the training data set; and S4, applying the trained model to the vehicle to realize the automatic driving decision of the vehicle. According to the method, by combining conditional imitation learning and reinforcement learning, the dependence of the model on a training data set is reduced, and meanwhile, the problem of low reinforcement learning exploration efficiency caused by random initialization is solved; real vehicle data are collected and applied to model training, so that the trained model is closer to a real driving scene.

Description

technical field [0001] The invention relates to the field of automatic driving, in particular to an automatic driving method based on conditional imitation learning and reinforcement learning. Background technique [0002] The application of autonomous driving can significantly reduce traffic accidents, alleviate traffic congestion, improve traffic efficiency, and save energy consumption, so it has been highly valued. Governments, enterprises, and research institutions have invested huge amounts of money and manpower in the research of autonomous driving, aiming to realize its commercial application as soon as possible. The traditional rule-based (Rule-Based) automatic driving solution subdivides the entire system into modules such as perception, positioning, planning, and control. Through the combined use of these modules, the automatic driving of the vehicle is realized. However, this scheme has a complex structure, high cost, and there are multiple intermediate links, an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/80G06V10/774G06V20/58G06K9/62G06F16/29G06N3/04G06N3/08
CPCG06F16/29G06N3/08G06N3/045G06F18/214G06F18/253
Inventor 何弢张润玺王辉廖文龙
Owner ANHUI COWAROBOT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products