Multi-unmanned aerial vehicle task decision-making method based on MADDPG

A multi-UAV and decision-making technology, applied in the field of flight control, can solve problems such as increased environmental complexity, increased variance, and unstable environment

Active Publication Date: 2020-11-03
NORTHWESTERN POLYTECHNICAL UNIV
View PDF10 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in a multi-UAV environment, it is difficult for traditional reinforcement learning methods to play a role, because in a multi-UAV environment, each UAV is constantly changing, and the environment is no longer stable, and for traditional reinforcement learning algorithms For the strategy gradient method in , as the number of drones increases, the complexity of the environment also increases, which leads to the optimization method of estimating the gradient by sampling, the variance increases sharply, and it is difficult to calculate the final result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-unmanned aerial vehicle task decision-making method based on MADDPG
  • Multi-unmanned aerial vehicle task decision-making method based on MADDPG
  • Multi-unmanned aerial vehicle task decision-making method based on MADDPG

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0149] In this example, the final network structure is designed as: Actor network structure is [56; 56; 2] fully connected neural network, critic network structure is [118; 78; 36; 1] fully connected neural network, two neural networks The hidden layer uses the RELU function as the activation function, such as Figure 6 shown. The mini-batch size during training is 1024, the maximum learning step size (maxepisode) is 30000, the update rate of the auxiliary network is τ=0.01, the learning rate of the Critic network is 0.01, and the learning rate of the Actor network is 0.001. Both networks are The AdamOptimizer optimizer is used for learning, and the size of the experience pool is 1,000,000. Once the data in the experience pool exceeds the maximum value, the original experience data will be lost, and the performance of the multi-UAV task decision-making network constructed is optimal.

[0150] The present invention initializes the positions of three unmanned aerial vehicles in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-unmanned aerial vehicle task decision method based on MADDPG, which introduces an MADDPG algorithm into multi-unmanned aerial vehicle task allocation, and comprises thesteps of firstly establishing a two-dimensional combat environment model required by deep reinforcement learning according to the actual combat environment of multiple unmanned aerial vehicles, secondly establishing mathematical description of various threats such as air defense missiles in the combat environment of the multiple unmanned aerial vehicles, and finally, taking the tracks and distances of the multiple unmanned aerial vehicles and the defense threats of the battlefield as constraint conditions, and performing learning training to obtain a multi-unmanned aerial vehicle task decision model. According to the method, an experience pool and a double-network structure are adopted, so that the operation and convergence speed of the whole network is greatly improved, a result can be obtained more quickly in the high-speed flight process of the unmanned aerial vehicles, the purpose of autonomous decision making of the multiple unmanned aerial vehicles can be achieved, and the highefficiency of task decision making of the unmanned aerial vehicles can also be guaranteed in an unknown combat environment.

Description

technical field [0001] The invention belongs to the field of flight control, and in particular relates to a method for multi-unmanned aerial vehicle task decision-making. Background technique [0002] For the military of various countries, drones will become one of the indispensable weapons in the future battlefield. UAVs are likely to become the target of attacks and counterattacks by multiple combat platforms, and become the most common and deadly air combat "sharp sword". Although the existing multi-UAV system can complete some complex combat tasks, as a huge fleet coordination system, the traditional multi-UAV mission decision-making is generally carried out under the conditions of the known battlefield environment. If the set combat plan cannot meet the actual combat environment, it is difficult to make a timely response plan. Therefore, it is particularly important for future UAV operations to find a method that can quickly and efficiently assign tasks to multiple UA...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/104
Inventor 李波甘志刚越凯强高晓光万开方高佩忻
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products