Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-agent, reinforcement learning technology, applied in neural learning methods, control/regulation systems, vehicle position/route/altitude control, etc., can solve problems such as making appropriate strategies to speed up training and reduce response delays , to avoid the effect of delay

Active Publication Date: 2021-01-22

SUN YAT SEN UNIV

View PDF4 Cites 17 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The purpose of the present invention is to provide a UAV control method and system based on multi-agent deep reinforcement learning to solve the problem that UAV systems are difficult to perform in a short time delay when facing various complex tasks and environments. technical issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0047] Explanation of terms:

[0048] Computation offloading: Computation offloading is the transfer of resource-intensive computing tasks to separate processors (such as hardware accelerators) or external platforms (such as cloud servers, edge servers). Offloading to a coprocessor can be used to accelerate applications, including image rendering and mathematical calculations. Offloading computation to external platforms over the network can provide computing power and overcome hardware limitations of devices, such as limited computing power, storage, and energy.

[0049] Multi-agent deep reinforcement learning (Multi-agent deep reinforcement learning): In a multi-agent system, each agent learns to improve its strategy by interacting with the environment to obtain a reward value (reward), so as to obtain the best process of optimal strategy.

[0050] Attention mechanism: The attention mechanism in deep learning is essentially similar to the selective mechanism of human being...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning. The unmanned aerial vehicle control method comprises the steps of: establishing an information acquisition task model according to parameters of an unmanned aerial vehicle group information acquisition system, wherein an information acquisition task is divided into an acquisition subtask and a calculation subtask; constructing a deep neural network model according to the task model, and training the deep neural network model by using a multi-agent deep reinforcement learning algorithm combined with an attention mechanism; and controlling an unmanned aerial vehicle group in an actual environment to complete an information acquisition task by using the trained deep neural network model. According to the unmanned aerial vehicle control method and the system, each unmanned aerial vehicle is used as an intelligent agent, a critic network with an attention unit is used for evaluating the performance of an actor network, and the training speed of the actor network can be accelerated with a more accurate evaluation value; and when an information acquisition task isexecuted, each unmanned aerial vehicle does not need to communicate with other intelligent agents, so that the communication time delay is reduced.

Description

technical field [0001] The present invention relates to the technical field of wireless communication, in particular to a control method and system for unmanned aerial vehicles based on multi-agent deep reinforcement learning. Background technique [0002] Unmanned Aerial Vehicles (UAV) is an unmanned aircraft that is remotely controlled by an operator through a radio remote control device or automatically controlled by a computer program. Most of the applications of UAVs are information collection tasks. In the prior art, the control instructions for multi-UAV system data collection tasks are mainly solved by two methods, namely the heuristic method and the method based on machine learning. [0003] Among them, the heuristic algorithm needs to go through multiple rounds of calculations after receiving the task to get the best information collection and calculation migration plan, resulting in a large time delay, which is not conducive to some urgent tasks; the depth enhance...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/10G06N3/04G06N3/08

CPCG05D1/104G06N3/08G06N3/045

Inventor 陈武辉杨志华郑子彬

Owner SUN YAT SEN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology