A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of autonomous control and training methods, applied in control/regulation systems, non-electric variable control, three-dimensional position/channel control, etc., can solve problems such as dependence, large space for exploration and learning, and difficult training, so as to achieve accelerated learning and narrowed exploration The effect of space, efficient trial and error costs

Active Publication Date: 2021-07-30

NANJING UNIV

View PDF11 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, simple reinforcement learning also has its limitations. The space for exploration and learning is too large, and the effect depends heavily on parameter tuning tricks, making training difficult.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0024] A fixed-wing unmanned aerial vehicle autonomous control cooperative strategy training method, comprising the following steps:

[0025] Step 1: Build a simulator Em_s for fixed-wing UAV control based on dynamics, and the visualization part of simulator Em_s is implemented based on the unity3D engine. The UAV simulates the environment E here s The training process in is defined as the tuple form of the Markov decision process (MDP), where S is the state information of the UAV, A is the acti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a fixed-wing unmanned aerial vehicle autonomous control cooperation strategy training method, comprising the following steps: (1) constructing a fixed-wing unmanned aerial vehicle control simulation environment E based on dynamics s , collect the real trajectory data of the pilot controlling the UAV, and learn the flight control strategy of the UAV through supervised learning; (2) Construct a simplified abstract environment E that strips the flight control a , create two groups of unmanned aerial vehicle swarms for group confrontation, and use the APEX_QMIX algorithm to learn the cooperative strategy; (3) Combine the flight control strategy and the cooperative strategy in the way of hierarchical reinforcement learning, and in the simulation environment E s Integrate the strategies learned in middle school; (3) Migrate to the real environment. The method of the invention is of great significance in real scenes, and has the characteristics of good generalization, low cost, strong robustness and the like.

Description

technical field [0001] The invention relates to a fixed-wing unmanned aerial vehicle autonomous control cooperation strategy training method based on layered reinforcement learning and multi-agent reinforcement learning, and the technical field of unmanned aerial vehicle autonomous control cooperation strategy. Background technique [0002] For the traditional autonomous control and cooperation strategy of fixed-wing UAV, it mainly adopts the method of automatic control, artificial modeling, and formulation of strategy. Rely on the development of flight rules by experts in the relevant field. The cost is high and due to the frequent scene changes in the complex and changing environment, there are a large number of situations that are not considered in the flight rules. Therefore, flight rules are usually unable to deal with complex and changing environments, and their capabilities are low. [0003] Recently, with the vigorous development of machine learning technology, rei...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G05D1/10

CPCG05D1/104

Inventor 俞扬詹德川周志华王超袁雷陈立坤黄宇洋庞竟成

Owner NANJING UNIV

A cooperative strategy training method for autonomous control of fixed-wing unmanned aerial vehicles

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology