A Deep Reinforcement Learning Approach for Continuous Coverage of Specific Areas by Multi-UAV Systems

A multi-UAV and reinforcement learning technology, applied in the field of deep reinforcement learning, can solve the problem of drone coordination without considering the continuous coverage of the area, without dealing with the control problem of heterogeneous drones, and without minimizing the cycle of continuous coverage and other issues to achieve the effect of minimizing the coverage period, increasing the convergence speed, and improving the coverage performance

Active Publication Date: 2022-06-21
NAT UNIV OF DEFENSE TECH
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Among the traditional methods mentioned above, some methods only study coverage-related issues, and do not consider the continuous coverage of the area and the coordination between UAVs; although some methods study the problem of continuous coverage, they do not minimize the continuous coverage. coverage period, and these methods are poor in robustness, weak in scalability, and do not deal with the control problem of heterogeneous UAVs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Deep Reinforcement Learning Approach for Continuous Coverage of Specific Areas by Multi-UAV Systems
  • A Deep Reinforcement Learning Approach for Continuous Coverage of Specific Areas by Multi-UAV Systems
  • A Deep Reinforcement Learning Approach for Continuous Coverage of Specific Areas by Multi-UAV Systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments.

[0045] like figure 1 As shown, the deep reinforcement learning method for continuous coverage of a specific area by a multi-UAV system of the present invention includes the following steps:

[0046] Step S1: establish a deep convolutional neural network model for feature extraction of the area to be covered;

[0047]By using the convolutional neural network to extract the features of the area to be covered, the spatial structure information of the area can be effectively used, and the coverage performance can be improved; by using the local perception characteristics of the convolutional neural network, local coordinated communication between UAVs can be realized; The weight sharing feature of convolutional neural network greatly reduces model parameters, thereby improving the convergence speed of model training.

[0048] In step S1, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep reinforcement learning method for continuous coverage of a specific area by a multi-unmanned aerial vehicle system. The steps include: step S1: establishing a deep convolutional neural network model for feature extraction of the area to be covered; step S2: based on reinforcement learning actor‑critic network, establish a deep reinforcement learning model for multi-UAV systems to continuously cover a specific area, respectively establish a multi-UAV system control decision model and a bidirectional cyclic neural network model for action value functions; An individual reward function is designed for the UAV decision-making system; Step S4: Based on the reinforcement learning policy gradient method, the neural network model in steps S1 and S2 is trained. The invention has the advantages of good robust performance, strong scalability, short coverage period, good coordination and the like.

Description

technical field [0001] The invention mainly relates to the technical field of coordinated control of unmanned aerial vehicles, in particular to a deep reinforcement learning method used for a multi-unmanned aerial vehicle system to continuously cover a specific area. Background technique [0002] UAV swarm area coverage is an important research problem, which mainly solves how to cover a designated area through large-scale UAVs, so as to achieve the best performance of the swarm system, such as the shortest coverage time and the highest coverage rate. The area coverage of UAV swarms has a very wide range of applications, such as geographic mapping, search and rescue, and disaster monitoring. [0003] The early research on coverage planning is mainly aimed at the method research of single-machine coverage of designated area, such as scanning method, area segmentation, and process planning. In recent years, researchers have focused on multi-UAV cooperative area coverage, such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06V20/17G06V10/40G06V10/82G06N3/04G06N3/08
CPCG06N3/08G06V20/13G06V10/40G06N3/045
Inventor 王楠孙兆梅牛轶峰康瀚文林弘丁宇航李雄
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products