Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning, multi-agent technology, applied in neural learning methods, computer components, design optimization/simulation, etc., can solve problems such as inability to effectively obtain global information, inability to achieve autonomous decision-making, and limited vision of UAV information sensors

Pending Publication Date: 2020-12-25

NANJING UNIV

View PDF0 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in a MADDPG-based UAV cluster, a single UAV information sensor (camera, etc.) has a limited field of vision, cannot effectively obtain global information, and cannot achieve true autonomous decision-making.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0030] A collaborative learning method for UAV clusters based on multi-agent reinforcement learning. In UAV cluster control, each UAV plays the role of Student and Teacher at the same time. It follows the idea of MADDPG centralized training and distributed execution. UAV swarm collaborative learning. Include the following steps:

[0031] Step 1: Construct an aerodynamic environment simulator based on Unity3D and build a UAV cooperation cluster.

[0032] Step 2: The initial teammate information i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning. The method comprises the steps of constructing an environment simulator based on aerodynamics; enabling each unmanned aerial vehicle to acquire and maintain a local observation value of the unmanned aerial vehicle; enabling each unmanned aerial vehicle to acquire and maintain a suggested observation value from a teammate as a student; providing a guidance value for other unmanned aerial vehicles as a teacher; executing an action strategy based on the local observation value of the machine and the suggested observation value obtained from the teammate, obtaining a reward and transferring to the next state; performing training based on the MADDPG thought untilthe value network and the strategy network converge; carrying out the execution stage in a distributed mode, that is, each unmanned aerial vehicle executes an action strategy based on the local observation value and the suggested observation value provided by the teammate. Complementation of observation values between unmanned aerial vehicle clusters can be achieved on the premise of relatively low cost, independent autonomous decision of the unmanned aerial vehicles is achieved, and the communication problem caused by a 'master-slave' structure is solved.

Description

technical field [0001] The invention relates to a multi-agent reinforcement learning-based cooperative learning method for unmanned aerial vehicle clusters, which belongs to the technical field of unmanned aerial vehicle cluster cooperation. Background technique [0002] With the advancement of science and technology and the improvement of UAV technology, UAV swarms are becoming more and more important in daily life, and are widely used in daily transportation, disaster rescue, military games and other fields. In the existing drone cluster cooperation, the drone cluster is mainly controlled in a "master-slave" manner, that is, the "master" drone is mainly responsible for processing the data obtained by each drone and distributing instructions to the participants. The "slave" drone. This type of method has strict requirements on the communication channel. If the communication channel is interfered or maliciously attacked, the information obtained by the drone cluster of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F30/15G06F30/27G06F30/28G06K9/62G06N3/04G06N3/08

CPCG06F30/15G06F30/28G06F30/27G06N3/08G06N3/045G06F18/214Y02T90/00

Inventor 俞扬詹德川周志华袁雷张云天付聪庞竟成罗凡明贾俊华

Owner NANJING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Unmanned aerial vehicle cluster collaborative learning method based on multi-agent reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology