Improved PMADDPG multi-unmanned aerial vehicle task decision-making method based on transfer learning

A multi-UAV, transfer learning technology, applied in the field of flight control, can solve the problems of not considering dynamic changes and constraints, real-time performance needs to be improved, and insufficient performance

Active Publication Date: 2020-10-30
NORTHWESTERN POLYTECHNICAL UNIV
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of these traditional algorithms are proposed based on the optimization problem of a single objective function, and most of them are based on theoretical research, applicable to static combat environments, without considering various dynamic changes and constraints in actual air combat
On the other hand, although some UAV research results have introduced the method of deep reinforcement learning,...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Improved PMADDPG multi-unmanned aerial vehicle task decision-making method based on transfer learning
  • Improved PMADDPG multi-unmanned aerial vehicle task decision-making method based on transfer learning
  • Improved PMADDPG multi-unmanned aerial vehicle task decision-making method based on transfer learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0156] In this embodiment, the PMADDPG algorithm is mainly designed, and a deterministic action strategy is adopted. For the PMADDPG algorithm, every time a training is performed, a new environment is input for a transfer learning, and the size of the experience pool B is 2,000,000, and the size of the experience pool M is 1,000,000. The Actor network structure is a fully connected neural network of [56; 56; 2], and the structure of the Critic network is a fully connected neural network of [118; 78; 36; 1], such as Figure 7 As shown, the specific network parameter design is shown in Table 1:

[0157] Table 1 Specific network parameters

[0158]

[0159] The results of multi-UAV mission decision-making are as follows: Figure 8 As shown, the square shaded area in the figure is the threat area, and the circular area is the target area. It can be seen that the flight trajectories of the three drones all entered the target area and avoided all the threat areas. The results ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an improved PMADDPG multi-unmanned aerial vehicle task decision-making method based on transfer learning. The method comprises the steps of firstly, performing reasonable mathematical modeling for threats such as air defense missiles and radars in an environment under the background of a multi-unmanned aerial vehicle actual combat environment, then creating a plurality of different two-dimensional combat environment models, designing constraint conditions, and learning and training the plurality of combat environments in sequence, thereby obtaining a final multi-unmanned aerial vehicle task allocation model. According to the method, the defect that in the prior art, task decision making can only be carried out in a known or static combat environment is overcome, decision making can also be carried out efficiently in an unknown combat environment, tasks of unmanned aerial vehicles are achieved, and the viability of an unmanned aerial vehicle group in the unknowncombat environment is greatly guaranteed.

Description

technical field [0001] The invention belongs to the field of flight control, and in particular relates to a method for multi-unmanned aerial vehicle task decision-making. Background technique [0002] For the military of various countries, drones will become one of the indispensable weapons in the future battlefield. UAVs are likely to become the target of attacks and counterattacks by multiple combat platforms, and become the most common and deadly air combat "sword". The cooperative combat method of multiple UAVs will become the mainstream development trend in the future. At present, military academies and scholars at home and abroad pay great attention to the research on multi-UAV mission decision-making problems, and there are many achievements. However, there are still many problems in the research of multi-UAV mission decision-making. For example, in multi-UAV cooperative search, tracking, task assignment, track planning, formation control and other issues, on the on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F30/15G06F30/27G05D1/00G06F111/04
CPCG06F30/15G06F30/27G05D1/0088G06F2111/04
Inventor 李波甘志刚梁诗阳高晓光万开方高佩忻
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products