Multi-unmanned aerial vehicle cooperative attack and defense confrontation method based on strategy set MADDPG

A multi-UAV and UAV technology, applied in the direction of non-electric variable control, instrument, control/adjustment system, etc., can solve dynamic instability and other problems, and achieve the effect of high attack and defense efficiency and good coordinated attack and defense confrontation ability

Pending Publication Date: 2021-12-03
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the most widely used is Deep Deterministic Policy Gradient (DDPG) and its improved algorithm. This algorithm uses the policy network to directly output actions and can cope with the output of continuous actions, but there is an important problem, because each The strategy of the agent is updated and iterated, resulting in the environment being dynamically unstable for a specific agent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-unmanned aerial vehicle cooperative attack and defense confrontation method based on strategy set MADDPG
  • Multi-unmanned aerial vehicle cooperative attack and defense confrontation method based on strategy set MADDPG
  • Multi-unmanned aerial vehicle cooperative attack and defense confrontation method based on strategy set MADDPG

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The technical solution of the present invention is described in detail in combination with the accompanying drawings.

[0023] Such as figure 1 As shown, the present invention is based on the strategy set MADDPG multi-UAV cooperative attack and defense confrontation method, which specifically includes the following steps:

[0024] Step 1. Establish a schematic diagram of multi-UAV cooperative offensive and defensive confrontation operations, including mission targets, hidden areas, and obstacle locations, as well as offensive and defensive UAVs, and then construct multi-UAV offensive and defensive confrontation combat missions, including target attack and defense and There are two aspects of UAV chasing and fleeing. The specific process is as follows:

[0025] (1.1) Establish a schematic diagram of multi-UAV cooperative attack and defense confrontation, such as figure 2 As shown, in the area (0km~2km, 7km~10km), 3 offensive UAVs are randomly generated, and the initia...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-unmanned aerial vehicle cooperative attack and defense confrontation method based on a strategy set MADDPG. The method comprises the following steps: firstly, constructing a multi-unmanned aerial vehicle cooperative attack and defense confrontation combat mission environment; secondly, establishing a combined state space and a combined action space of the multi-unmanned aerial vehicle system; then, designing a reward function based on a group target for the multi-unmanned aerial vehicle attack and defense confrontation problem, wherein the reward function comprises two parts, namely, an attack-party unmanned aerial vehicle and a defense-party unmanned aerial vehicle; thirdly, training the network based on the strategy set MADDPG; and finally, realizing a multi-unmanned aerial vehicle cooperative attack and defense confrontation decision by using the trained network model. According to the method, the MADDPG algorithm is improved, the reward function based on a group target is designed, learning of a cooperative attack and defense strategy is guided, the problem that a single agent in a multi-unmanned aerial vehicle system has a strong over-fitting strategy for competitors of the agent is effectively solved by using a strategy set, the attack and defense efficiency is higher, therefore, the unmanned aerial vehicle is endowed with a better cooperative attack and defense confrontation capability.

Description

technical field [0001] The invention belongs to the technical field of air combat decision-making, and specifically relates to a multi-unmanned aerial vehicle cooperative offensive and defensive confrontation method based on a strategy set MADDPG. [0002] technical background [0003] With the continuous advancement of computer technology, reinforcement learning algorithms have been widely used in the research of UAV air combat maneuver decision-making. In a multi-UAV system, while one UAV is learning, other UAVs are also learning, and the actions they perform affect the environment, making the current learning environment change, which will make the multi-UAV system not If the MDP model is satisfied, then reinforcement learning cannot be directly applied to multi-UAV systems. In order to solve this problem, some algorithms use the reinforcement learning method of state prediction to solve the multi-UAV intelligent decision-making problem. Each UAV first uses the past state...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/10
CPCG05D1/104Y02T10/40
Inventor 丁勇聂志诚何金
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products