Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A flight control and reinforcement learning technology, applied in attitude control, non-electric variable control, control/regulation systems, etc., can solve the problems of sparse rewards, large fluctuations, and excessive space for action exploration, achieving high similarity and scalability Strong and suitable for migration

Active Publication Date: 2021-01-01

NANJING UNIV

View PDF13 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] Purpose of the invention: In view of the problem of autonomous flight of drones in the prior art, rule control cannot handle complex and changing environments, and the reinforcement learning algorithm has too large action exploration space, sparse rewards, and the learned flight strategy is not stable enough and fluctuates greatly. The invention provides a UAV flight control method based on imitation learning and reinforcement learning algorithms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0031] A UAV flight control method based on imitation learning and reinforcement learning algorithms, which defines different complete flight actions according to actual flight data, and learns the collected flight trajectories through imitation learning to simplify the action space of the original problem Effect. Then use the pDQN algorithm, an improved algorithm of the DQN algorithm, to learn the strategy in the simplified space. Include the following steps:

[0032] step one:

[0033]Firstly...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms. The method comprises the following steps: creating an unmanned aerial vehicle flight simulation environment simulator; defining a basic action set of flight; classifying the trajectory data according to the flight basic actions; for each flight action, learning mapping network parameters from a flight basic action to an original action by utilizing imitation learning; counting the minimum continuous action number of each basic action; constructing an upper-layer reinforcement learning network, and adding the minimum continuous action number as punishment p of aircraft action inconsistency; in the simulator, obtaining current observation information andawards, and selecting corresponding flight basic actions by using a pDQN algorithm; inputting the state information of the aircraft into an imitation learning neural network corresponding to the flight basic action, and outputting the original action of the simulator; inputting the obtained original action into a simulator to obtain observation and awards of the next moment; and performing training by using a pDQN algorithm until the strategy network on the upper layer converges.

Description

technical field [0001] The invention relates to a UAV flight control method based on imitation learning and reinforcement learning algorithms in a complex and rapidly changing environment, and belongs to the technical field of UAV control. Background technique [0002] The problem of autonomous flight of UAVs in unknown environments has always been one of the main difficulties in autonomous flight of UAVs. The traditional UAV flight mainly adopts a rule-based control method. First, list the possible impacts of the environment, and then Let experts in relevant fields formulate flight rules. However, this method generally only achieves ideal results in relatively simple environments. In a complex and changing environment, due to many influencing factors and frequent changes in the surrounding environmental scenes, there will be a large number of sample data that have not appeared in the training data. Flight rules tend to be less effective or even non-existent in such situat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/08G05D1/10

CPCG05D1/0808G05D1/101

Inventor 俞扬詹德川周志华付聪张云天袁雷庞竟成罗凡明

Owner NANJING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology