Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Unmanned aerial vehicle maneuvering avoidance decision-making method based on deep reinforcement learning

A technology that reinforces learning and decision-making methods, applied in non-electric variable control, vehicle position/route/altitude control, instruments, etc., can solve problems such as policies tending to be unsatisfactory behaviors, achieve a wide range of application scenarios, and improve data utilization efficiency , The system framework is complete and reliable

Active Publication Date: 2022-05-17
NANTONG UNIVERSITY
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But designing reward functions is challenging for more complex tasks that require domain-specific knowledge
In addition, reward shaping may bias the policy toward undesirable behavior and limit the agent's access to safe actions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle maneuvering avoidance decision-making method based on deep reinforcement learning
  • Unmanned aerial vehicle maneuvering avoidance decision-making method based on deep reinforcement learning
  • Unmanned aerial vehicle maneuvering avoidance decision-making method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. Of course, the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0065] The present invention provides a UAV maneuver avoidance decision-making method based on deep reinforcement learning, such as figure 1 shown, including the following five steps:

[0066] Step 1. Obtain the situation information of the enemy and the enemy according to the UAV airborne sensor system;

[0067] Step 2. Construct the UAV maneuver avoidance decision-making deep reinforcement learning model structure;

[0068] Step 3, constructing a hierarchical goal-oriented learning model structure;

[0069] Step 4. Complete the learning of the UAV maneuver avoidance decision-making method according to the inter...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an unmanned aerial vehicle maneuvering avoidance decision-making method based on deep reinforcement learning, and belongs to the technical field of unmanned aerial vehicles. According to the technical scheme, the method comprises the following steps: S1, acquiring friend or foe situation information according to an airborne sensor system of an unmanned aerial vehicle; s2, constructing an unmanned aerial vehicle maneuvering avoidance decision deep reinforcement learning model structure; s3, constructing a hierarchical target-oriented learning model structure; s4, completing unmanned aerial vehicle maneuvering avoidance decision-making method learning according to interactive training; and S5, deploying and applying the unmanned aerial vehicle maneuvering avoidance decision-making method. The method has the advantages that the unmanned aerial vehicle can be endowed with the learning ability from shallow to deep, the unmanned aerial vehicle can autonomously complete maneuvering avoidance decisions, and the survival ability of the unmanned aerial vehicle on a battlefield is improved.

Description

technical field [0001] The invention relates to the technical field of unmanned aerial vehicles, in particular to a decision-making method for maneuvering and avoiding unmanned aerial vehicles based on deep reinforcement learning. Background technique [0002] At present, under the background of airspace integrated intelligent combat, UAV technology has been widely used in military operations. It has achieved great success in many fields, such as cooperative reconnaissance, ground attack, destroying the enemy's air defense system, etc. In the application process, effectively avoiding threats and improving the survivability of UAVs in dynamic confrontation environments is the key to the success of combat missions. With the complexity and changeability of the modern battlefield environment, it is becoming more and more difficult for operators to complete complex flight tasks through manual operation. The survivability and combat capability of UAVs play a vital role in milita...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/101
Inventor 袁银龙焦朋朋戴傲寒许亚龙华亮程赟张雷李俊红傅怀梁
Owner NANTONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products