A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles

A technology of autonomous control and training methods, applied in control/regulation systems, non-electric variable control, three-dimensional position/channel control, etc., can solve problems such as dependence, large space for exploration and learning, and difficult training, so as to achieve accelerated learning and narrowed exploration The effect of space, efficient trial and error costs

Active Publication Date: 2021-07-30
NANJING UNIV
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, simple reinforcement learning also has its limitations. The space for exploration and learning is too large, and the effect depends heavily on parameter tuning tricks, making training difficult.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles
  • A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles
  • A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0024] A fixed-wing unmanned aerial vehicle autonomous control cooperative strategy training method, comprising the following steps:

[0025] Step 1: Build a simulator Em_s for fixed-wing UAV control based on dynamics, and the visualization part of simulator Em_s is implemented based on the unity3D engine. The UAV simulates the environment E here s The training process in is defined as the tuple form of the Markov decision process (MDP), where S is the state information of the UAV, A is the acti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fixed-wing unmanned aerial vehicle autonomous control cooperation strategy training method, comprising the following steps: (1) constructing a fixed-wing unmanned aerial vehicle control simulation environment E based on dynamics s , collect the real trajectory data of the pilot controlling the UAV, and learn the flight control strategy of the UAV through supervised learning; (2) Construct a simplified abstract environment E that strips the flight control a , create two groups of unmanned aerial vehicle swarms for group confrontation, and use the APEX_QMIX algorithm to learn the cooperative strategy; (3) Combine the flight control strategy and the cooperative strategy in the way of hierarchical reinforcement learning, and in the simulation environment E s Integrate the strategies learned in middle school; (3) Migrate to the real environment. The method of the invention is of great significance in real scenes, and has the characteristics of good generalization, low cost, strong robustness and the like.

Description

technical field [0001] The invention relates to a fixed-wing unmanned aerial vehicle autonomous control cooperation strategy training method based on layered reinforcement learning and multi-agent reinforcement learning, and the technical field of unmanned aerial vehicle autonomous control cooperation strategy. Background technique [0002] For the traditional autonomous control and cooperation strategy of fixed-wing UAV, it mainly adopts the method of automatic control, artificial modeling, and formulation of strategy. Rely on the development of flight rules by experts in the relevant field. The cost is high and due to the frequent scene changes in the complex and changing environment, there are a large number of situations that are not considered in the flight rules. Therefore, flight rules are usually unable to deal with complex and changing environments, and their capabilities are low. [0003] Recently, with the vigorous development of machine learning technology, rei...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/10
CPCG05D1/104
Inventor 俞扬詹德川周志华王超袁雷陈立坤黄宇洋庞竟成
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products