Multi-unmanned aerial vehicle task decision-making method based on MADDPG

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-UAV and decision-making technology, applied in the field of flight control, can solve problems such as increased environmental complexity, increased variance, and unstable environment

Active Publication Date: 2020-11-03

NORTHWESTERN POLYTECHNICAL UNIV

View PDF10 Cites 35 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in a multi-UAV environment, it is difficult for traditional reinforcement learning methods to play a role, because in a multi-UAV environment, each UAV is constantly changing, and the environment is no longer stable, and for traditional reinforcement learning algorithms For the strategy gradient method in , as the number of drones increases, the complexity of the environment also increases, which leads to the optimization method of estimating the gradient by sampling, the variance increases sharply, and it is difficult to calculate the final result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0149] In this example, the final network structure is designed as: Actor network structure is [56; 56; 2] fully connected neural network, critic network structure is [118; 78; 36; 1] fully connected neural network, two neural networks The hidden layer uses the RELU function as the activation function, such as Figure 6 shown. The mini-batch size during training is 1024, the maximum learning step size (maxepisode) is 30000, the update rate of the auxiliary network is τ=0.01, the learning rate of the Critic network is 0.01, and the learning rate of the Actor network is 0.001. Both networks are The AdamOptimizer optimizer is used for learning, and the size of the experience pool is 1,000,000. Once the data in the experience pool exceeds the maximum value, the original experience data will be lost, and the performance of the multi-UAV task decision-making network constructed is optimal.

[0150] The present invention initializes the positions of three unmanned aerial vehicles in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-unmanned aerial vehicle task decision method based on MADDPG, which introduces an MADDPG algorithm into multi-unmanned aerial vehicle task allocation, and comprises thesteps of firstly establishing a two-dimensional combat environment model required by deep reinforcement learning according to the actual combat environment of multiple unmanned aerial vehicles, secondly establishing mathematical description of various threats such as air defense missiles in the combat environment of the multiple unmanned aerial vehicles, and finally, taking the tracks and distances of the multiple unmanned aerial vehicles and the defense threats of the battlefield as constraint conditions, and performing learning training to obtain a multi-unmanned aerial vehicle task decision model. According to the method, an experience pool and a double-network structure are adopted, so that the operation and convergence speed of the whole network is greatly improved, a result can be obtained more quickly in the high-speed flight process of the unmanned aerial vehicles, the purpose of autonomous decision making of the multiple unmanned aerial vehicles can be achieved, and the highefficiency of task decision making of the unmanned aerial vehicles can also be guaranteed in an unknown combat environment.

Description

technical field [0001] The invention belongs to the field of flight control, and in particular relates to a method for multi-unmanned aerial vehicle task decision-making. Background technique [0002] For the military of various countries, drones will become one of the indispensable weapons in the future battlefield. UAVs are likely to become the target of attacks and counterattacks by multiple combat platforms, and become the most common and deadly air combat "sharp sword". Although the existing multi-UAV system can complete some complex combat tasks, as a huge fleet coordination system, the traditional multi-UAV mission decision-making is generally carried out under the conditions of the known battlefield environment. If the set combat plan cannot meet the actual combat environment, it is difficult to make a timely response plan. Therefore, it is particularly important for future UAV operations to find a method that can quickly and efficiently assign tasks to multiple UA...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/10

CPCG05D1/104

Inventor 李波甘志刚越凯强高晓光万开方高佩忻

Owner NORTHWESTERN POLYTECHNICAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-unmanned aerial vehicle task decision-making method based on MADDPG

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology