Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning

A multi-agent, reinforcement learning technology, applied in neural learning methods, control/regulation systems, vehicle position/route/altitude control, etc., can solve problems such as making appropriate strategies to speed up training and reduce response delays , to avoid the effect of delay

Active Publication Date: 2021-01-22
SUN YAT SEN UNIV
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a UAV control method and system based on multi-agent deep reinforcement learning to solve the problem that UAV systems are difficult to perform in a short time delay when facing various complex tasks and environments. technical issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning
  • Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning
  • Unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] Explanation of terms:

[0048] Computation offloading: Computation offloading is the transfer of resource-intensive computing tasks to separate processors (such as hardware accelerators) or external platforms (such as cloud servers, edge servers). Offloading to a coprocessor can be used to accelerate applications, including image rendering and mathematical calculations. Offloading computation to external platforms over the network can provide computing power and overcome hardware limitations of devices, such as limited computing power, storage, and energy.

[0049] Multi-agent deep reinforcement learning (Multi-agent deep reinforcement learning): In a multi-agent system, each agent learns to improve its strategy by interacting with the environment to obtain a reward value (reward), so as to obtain the best process of optimal strategy.

[0050] Attention mechanism: The attention mechanism in deep learning is essentially similar to the selective mechanism of human being...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an unmanned aerial vehicle control method and system based on multi-agent deep reinforcement learning. The unmanned aerial vehicle control method comprises the steps of: establishing an information acquisition task model according to parameters of an unmanned aerial vehicle group information acquisition system, wherein an information acquisition task is divided into an acquisition subtask and a calculation subtask; constructing a deep neural network model according to the task model, and training the deep neural network model by using a multi-agent deep reinforcement learning algorithm combined with an attention mechanism; and controlling an unmanned aerial vehicle group in an actual environment to complete an information acquisition task by using the trained deep neural network model. According to the unmanned aerial vehicle control method and the system, each unmanned aerial vehicle is used as an intelligent agent, a critic network with an attention unit is used for evaluating the performance of an actor network, and the training speed of the actor network can be accelerated with a more accurate evaluation value; and when an information acquisition task isexecuted, each unmanned aerial vehicle does not need to communicate with other intelligent agents, so that the communication time delay is reduced.

Description

technical field [0001] The present invention relates to the technical field of wireless communication, in particular to a control method and system for unmanned aerial vehicles based on multi-agent deep reinforcement learning. Background technique [0002] Unmanned Aerial Vehicles (UAV) is an unmanned aircraft that is remotely controlled by an operator through a radio remote control device or automatically controlled by a computer program. Most of the applications of UAVs are information collection tasks. In the prior art, the control instructions for multi-UAV system data collection tasks are mainly solved by two methods, namely the heuristic method and the method based on machine learning. [0003] Among them, the heuristic algorithm needs to go through multiple rounds of calculations after receiving the task to get the best information collection and calculation migration plan, resulting in a large time delay, which is not conducive to some urgent tasks; the depth enhance...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10G06N3/04G06N3/08
CPCG05D1/104G06N3/08G06N3/045
Inventor 陈武辉杨志华郑子彬
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products