Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)

A technology of reinforcement learning and decision-making methods, applied in non-electric variable control, instrument, attitude control and other directions, can solve problems such as difficult computerized maneuver decision-making, difficult air combat mission situational space, etc.

Inactive Publication Date: 2018-07-24
NORTHWESTERN POLYTECHNICAL UNIV
View PDF9 Cites 56 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the situation of air combat is more complex than other tasks, it is difficult to fully cover the situation space of air com

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)
  • Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)
  • Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0044] The present invention completes the establishment of the entire reinforcement learning maneuvering decision-making algorithm from two aspects of state space description and environment modeling, and the main work includes the following contents:

[0045] 1) The division and description of the state space, using the fuzzy method to fuzzify each state in the air combat situation, as the state input of reinforcement learning.

[0046] 2) The construction of the reinforcement learning environment in the air combat process, constructing the motion control model of the UAV, clarifying the action space and state transition function of the reinforcement learning, and constructing the air combat advantage function based on various elements of the air combat situation, as the r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a reinforcement learning based air combat maneuver decision making method of a UAV. A motion model of an airplane platform is created; principle factors that influence the air combat situation are analyzed; on the basis of the motion model and analysis on the air combat situation factors, a dynamic fuzzy Q learning model of air combat maneuver decision making is designed, and essential factors and an algorithm flow of reinforcement learning are determined; a state space of air combat maneuver decision making is fuzzified and serves as state input of reinforcement learning; typical air combat motions are selected as basic motions of reinforcement learning, and the triggering intensities of fuzzy rules are summed in a weighted manner, and a continuous motion space is covered; and on the basis of an established air combat dominant function, a return value of reinforcement learning is set in a rewards and punishment values weighing-superposing method. Thus, the autonomous maneuver decision making capability of the UAV during air combat can be improved effectively, the robustness is higher, an autonomous searching optimization performance is higher, and decisionsmade by the UAV are improved continuously in continuous simulation and learning.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to an implementation method for air combat maneuver decision-making of unmanned aircraft. Background technique [0002] At present, drones have been able to complete tasks such as reconnaissance, surveillance and ground attack, and are playing an increasingly irreplaceable role in modern warfare. However, due to the higher real-time requirements for air combat, the current ground station remote control method for UAVs is difficult to achieve accurate and timely control of UAVs in order to gain an advantage in air combat. Therefore, improving the intelligence level of UAVs and enabling UAVs to automatically generate control commands to complete maneuvers in air combat according to the situational environment is the current main research direction. [0003] The essence of allowing UAVs to complete autonomous decision-making in air combat maneuvers is to co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/08G05D1/10
CPCG05D1/0808G05D1/101
Inventor 杨啟明张建东吴勇史国庆朱岩徐建城莫文莉
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products