Automatic driving method based on conditional imitation learning and reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and automatic driving, applied in neural learning methods, computer components, structured data retrieval, etc., can solve problems such as low efficiency of reinforcement learning exploration, rough rough PP angle, poor interpretability, etc. Achieve the effects of reducing the dependence on training data sets, enhancing robustness, and improving performance

Inactive Publication Date: 2022-03-11

ANHUI COWAROBOT CO LTD +1

View PDF3 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] However, the methods proposed in the above papers and patents have the following deficiencies: simply using RGB images as the main source of perception has poor interpretability and robustness. Very unsafe; the rough PP angle calculated by image point selection is too rough, and a better control amount of real vehicle data can be selected in actual landing; the imitation learning framework has a huge convolutional layer and insufficient real-time performance

However, this invention does not reduce the model's dependence on the training data set by combining CIL and RL, nor does it solve the problem of low efficiency of reinforcement learning exploration due to random initialization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0053] According to an automatic driving method based on conditional imitation learning and reinforcement learning provided by the present invention, such as Figure 1-Figure 7 shown, including:

[0054] Step S1: Collect driving data, and mark the corresponding decision-making actions as expert decisions;

[0055] Step S2: According to the driving data, calculate the heading angle, and obtain the feature vector and feature map;

[0056] Step S3: Construct a training data set based on the feature vector, feature map, heading angle and expert decision, and use the training data set to train the model;

[0057] Step S4: Apply the trained model to the vehicle to realize the automatic driving decision of the vehicle.

[0058] Specifically, in the step S1:

[0059] The driving data includes vehicle status information and perception information obtained from on-board sensors;

[0060] The vehicle status information includes vehicle position information, speed information, steerin...

Embodiment 2

[0080] Embodiment 2 is a preferred example of Embodiment 1 to describe the present invention more specifically.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an automatic driving method based on conditional imitation learning and reinforcement learning, and the method comprises the steps: S1, collecting driving data, and marking a corresponding decision action as an expert decision; s2, according to the driving data, calculating to obtain a course angle, and obtaining a feature vector and a feature map; s3, constructing a training data set based on the feature vector, the feature map, the course angle and the expert decision, and training a model by using the training data set; and S4, applying the trained model to the vehicle to realize the automatic driving decision of the vehicle. According to the method, by combining conditional imitation learning and reinforcement learning, the dependence of the model on a training data set is reduced, and meanwhile, the problem of low reinforcement learning exploration efficiency caused by random initialization is solved; real vehicle data are collected and applied to model training, so that the trained model is closer to a real driving scene.

Description

technical field [0001] The invention relates to the field of automatic driving, in particular to an automatic driving method based on conditional imitation learning and reinforcement learning. Background technique [0002] The application of autonomous driving can significantly reduce traffic accidents, alleviate traffic congestion, improve traffic efficiency, and save energy consumption, so it has been highly valued. Governments, enterprises, and research institutions have invested huge amounts of money and manpower in the research of autonomous driving, aiming to realize its commercial application as soon as possible. The traditional rule-based (Rule-Based) automatic driving solution subdivides the entire system into modules such as perception, positioning, planning, and control. Through the combined use of these modules, the automatic driving of the vehicle is realized. However, this scheme has a complex structure, high cost, and there are multiple intermediate links, an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06V10/80G06V10/774G06V20/58G06K9/62G06F16/29G06N3/04G06N3/08

CPCG06F16/29G06N3/08G06N3/045G06F18/214G06F18/253

Inventor 何弢张润玺王辉廖文龙

Owner ANHUI COWAROBOT CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Automatic driving method based on conditional imitation learning and reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology