Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms

A flight control and reinforcement learning technology, applied in attitude control, non-electric variable control, control/regulation systems, etc., can solve the problems of sparse rewards, large fluctuations, and excessive space for action exploration, achieving high similarity and scalability Strong and suitable for migration

Active Publication Date: 2021-01-01
NANJING UNIV
View PDF13 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Purpose of the invention: In view of the problem of autonomous flight of drones in the prior art, rule control cannot handle complex and changing environments, and the reinforcement learning algorithm has too large action exploration space, sparse rewards, and the learned flight strategy is not stable enough and fluctuates greatly. The invention provides a UAV flight control method based on imitation learning and reinforcement learning algorithms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms
  • Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms
  • Unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0031] A UAV flight control method based on imitation learning and reinforcement learning algorithms, which defines different complete flight actions according to actual flight data, and learns the collected flight trajectories through imitation learning to simplify the action space of the original problem Effect. Then use the pDQN algorithm, an improved algorithm of the DQN algorithm, to learn the strategy in the simplified space. Include the following steps:

[0032] step one:

[0033]Firstly...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unmanned aerial vehicle flight control method based on imitation learning and reinforcement learning algorithms. The method comprises the following steps: creating an unmanned aerial vehicle flight simulation environment simulator; defining a basic action set of flight; classifying the trajectory data according to the flight basic actions; for each flight action, learning mapping network parameters from a flight basic action to an original action by utilizing imitation learning; counting the minimum continuous action number of each basic action; constructing an upper-layer reinforcement learning network, and adding the minimum continuous action number as punishment p of aircraft action inconsistency; in the simulator, obtaining current observation information andawards, and selecting corresponding flight basic actions by using a pDQN algorithm; inputting the state information of the aircraft into an imitation learning neural network corresponding to the flight basic action, and outputting the original action of the simulator; inputting the obtained original action into a simulator to obtain observation and awards of the next moment; and performing training by using a pDQN algorithm until the strategy network on the upper layer converges.

Description

technical field [0001] The invention relates to a UAV flight control method based on imitation learning and reinforcement learning algorithms in a complex and rapidly changing environment, and belongs to the technical field of UAV control. Background technique [0002] The problem of autonomous flight of UAVs in unknown environments has always been one of the main difficulties in autonomous flight of UAVs. The traditional UAV flight mainly adopts a rule-based control method. First, list the possible impacts of the environment, and then Let experts in relevant fields formulate flight rules. However, this method generally only achieves ideal results in relatively simple environments. In a complex and changing environment, due to many influencing factors and frequent changes in the surrounding environmental scenes, there will be a large number of sample data that have not appeared in the training data. Flight rules tend to be less effective or even non-existent in such situat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/08G05D1/10
CPCG05D1/0808G05D1/101
Inventor 俞扬詹德川周志华付聪张云天袁雷庞竟成罗凡明
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products