Real-time path planning method for unmanned aerial vehicle based on deep reinforcement learning

A real-time path planning and reinforcement learning technology, applied in vehicle position/route/altitude control, instruments, 3D position/channel control, etc., can solve the problem of inapplicability of algorithms, etc., to improve autonomous flight capability, strong real-time performance and adaptability sexual effect

Active Publication Date: 2019-11-22
NAT UNIV OF DEFENSE TECH
View PDF11 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above methods have their own advantages under specific conditions, but they all need to establish an environment or platform model in advance. When the environment information cannot be obtained in advance or the problem model is too complex, the above algorithms are often not applicable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time path planning method for unmanned aerial vehicle based on deep reinforcement learning
  • Real-time path planning method for unmanned aerial vehicle based on deep reinforcement learning
  • Real-time path planning method for unmanned aerial vehicle based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0047] Such as figure 1 and Figure 6 As shown, a kind of unmanned aerial vehicle real-time path planning method based on deep reinforcement learning of the present invention comprises:

[0048] Step S1, offline training phase: obtain the current environmental state of the UAV from the simulation environment, calculate the threat level of the target defense unit to the UAV according to the situation assessment model, and construct a situation map of the UAV mission area; construct a convolution Main network and target network of neural network and competing neural network for action selection;

[0049] Step S2, online execution stage: According to the current environmental state of the UAV obtained from the communication link, the threat value of the target object defense unit to the UAV is calculated according to the situation asse...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a real-time path planning method for an unmanned aerial vehicle based on deep reinforcement learning. The method comprises the steps of S1, obtaining the current environment state of the unmanned aerial vehicle from a simulation environment, calculating the threat degree of a target object defense unit to the unmanned aerial vehicle according to a situation evaluation model, and constructing a situation map of a task area of the unmanned aerial vehicle; constructing a main network and a target network of the convolutional neural network and the competitive neural network to perform action selection; S2, obtaining the current environment state of the unmanned aerial vehicle according to the communication link, calculating a threat value of the target object defense unit to the unmanned aerial vehicle according to the situation evaluation model, constructing a situation map of the task areas of the unmanned aerial vehicle, constructing a competitive dual-Q network, loading the trained network model, evaluating the Q value of each action in the current state, selecting the action corresponding to the maximum Q value, determining the flight direction of the unmanned aerial vehicle, and completing the flight task. According to the invention, the autonomous decision-making ability of the unmanned aerial vehicle can be effectively improved, and the method has high robustness and application value.

Description

technical field [0001] The invention mainly relates to the technical field of unmanned aerial vehicles, in particular to a real-time path planning method for unmanned aerial vehicles based on deep reinforcement learning. Background technique [0002] With the continuous development of UAV system technology, UAV (Unmanned Aerial Vehicle, UAV) has been widely used in various military operations and civilian tasks such as industrial inspection, disaster search and rescue, geographic mapping, border patrol, military reconnaissance, etc. . Considering various influencing factors comprehensively, such as UAV range, target location, external threats, etc., planning the optimal path for UAV plays an important role in ensuring the successful completion of the flight mission. Therefore, UAV path planning is the basic and key technology necessary for UAVs to perform various military or civilian tasks. Although UAVs have made great progress in operational autonomy in recent years, it ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/10
CPCG05D1/101Y02T10/40
Inventor 相晓嘉闫超王菖牛轶峰尹栋吴立珍陈紫叶
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products