Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning

A reinforcement learning, multi-agent technology, applied in neural learning methods, computer components, design optimization/simulation, etc., can solve problems such as inability to effectively obtain global information, inability to achieve autonomous decision-making, and limited vision of UAV information sensors

Pending Publication Date: 2020-12-25
NANJING UNIV
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in a MADDPG-based UAV cluster, a single UAV information sensor (camera, etc.) has a limited fie...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning
  • Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning
  • Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0030] A collaborative learning method for UAV clusters based on multi-agent reinforcement learning. In UAV cluster control, each UAV plays the role of Student and Teacher at the same time. It follows the idea of ​​MADDPG centralized training and distributed execution. UAV swarm collaborative learning. Include the following steps:

[0031] Step 1: Construct an aerodynamic environment simulator based on Unity3D and build a UAV cooperation cluster.

[0032] Step 2: The initial teammate information i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning. The method comprises the steps of constructing an environment simulator based on aerodynamics; enabling each unmanned aerial vehicle to acquire and maintain a local observation value of the unmanned aerial vehicle; enabling each unmanned aerial vehicle to acquire and maintain a suggested observation value from a teammate as a student; providing a guidance value for other unmanned aerial vehicles as a teacher; executing an action strategy based on the local observation value of the machine and the suggested observation value obtained from the teammate, obtaining a reward and transferring to the next state; performing training based on the MADDPG thought untilthe value network and the strategy network converge; carrying out the execution stage in a distributed mode, that is, each unmanned aerial vehicle executes an action strategy based on the local observation value and the suggested observation value provided by the teammate. Complementation of observation values between unmanned aerial vehicle clusters can be achieved on the premise of relatively low cost, independent autonomous decision of the unmanned aerial vehicles is achieved, and the communication problem caused by a 'master-slave' structure is solved.

Description

technical field [0001] The invention relates to a multi-agent reinforcement learning-based cooperative learning method for unmanned aerial vehicle clusters, which belongs to the technical field of unmanned aerial vehicle cluster cooperation. Background technique [0002] With the advancement of science and technology and the improvement of UAV technology, UAV swarms are becoming more and more important in daily life, and are widely used in daily transportation, disaster rescue, military games and other fields. In the existing drone cluster cooperation, the drone cluster is mainly controlled in a "master-slave" manner, that is, the "master" drone is mainly responsible for processing the data obtained by each drone and distributing instructions to the participants. The "slave" drone. This type of method has strict requirements on the communication channel. If the communication channel is interfered or maliciously attacked, the information obtained by the drone cluster of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F30/15G06F30/27G06F30/28G06K9/62G06N3/04G06N3/08
CPCG06F30/15G06F30/28G06F30/27G06N3/08G06N3/045G06F18/214Y02T90/00
Inventor 俞扬詹德川周志华袁雷张云天付聪庞竟成罗凡明贾俊华
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products