Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Decision-Making Method for UAV Air Combat Maneuver Based on Reinforcement Learning

A technology of reinforcement learning and decision-making methods, applied in non-electric variable control, instruments, three-dimensional position/course control, etc., can solve problems such as difficult air combat mission situation space, difficult to calculate maneuvering action decision-making, etc.

Inactive Publication Date: 2020-09-22
NORTHWESTERN POLYTECHNICAL UNIV
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the situation of air combat is more complex than other tasks, it is difficult to fully cover the situation space of air combat missions by manual pre-programming methods, and it is even more difficult to calculate and produce optimal maneuver decisions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Decision-Making Method for UAV Air Combat Maneuver Based on Reinforcement Learning
  • A Decision-Making Method for UAV Air Combat Maneuver Based on Reinforcement Learning
  • A Decision-Making Method for UAV Air Combat Maneuver Based on Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0044] The present invention completes the establishment of the entire reinforcement learning maneuvering decision-making algorithm from two aspects of state space description and environment modeling, and the main work includes the following contents:

[0045] 1) The division and description of the state space, using the fuzzy method to fuzzify each state in the air combat situation, as the state input of reinforcement learning.

[0046] 2) The construction of the reinforcement learning environment in the air combat process, constructing the motion control model of the UAV, clarifying the action space and state transition function of the reinforcement learning, and constructing the air combat advantage function based on various elements of the air combat situation, as the r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a reinforcement learning based air combat maneuver decision making method of a UAV. A motion model of an airplane platform is created; principle factors that influence the air combat situation are analyzed; on the basis of the motion model and analysis on the air combat situation factors, a dynamic fuzzy Q learning model of air combat maneuver decision making is designed, and essential factors and an algorithm flow of reinforcement learning are determined; a state space of air combat maneuver decision making is fuzzified and serves as state input of reinforcement learning; typical air combat motions are selected as basic motions of reinforcement learning, and the triggering intensities of fuzzy rules are summed in a weighted manner, and a continuous motion space is covered; and on the basis of an established air combat dominant function, a return value of reinforcement learning is set in a rewards and punishment values weighing-superposing method. Thus, the autonomous maneuver decision making capability of the UAV during air combat can be improved effectively, the robustness is higher, an autonomous searching optimization performance is higher, and decisionsmade by the UAV are improved continuously in continuous simulation and learning.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to an implementation method for air combat maneuver decision-making of unmanned aircraft. Background technique [0002] At present, drones have been able to complete tasks such as reconnaissance, surveillance and ground attack, and are playing an increasingly irreplaceable role in modern warfare. However, due to the higher real-time requirements for air combat, the current ground station remote control method for UAVs is difficult to achieve accurate and timely control of UAVs in order to gain an advantage in air combat. Therefore, improving the intelligence level of UAVs and enabling UAVs to automatically generate control commands to complete maneuvers in air combat according to the situational environment is the current main research direction. [0003] The essence of allowing UAVs to complete autonomous decision-making in air combat maneuvers is to co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/08G05D1/10
CPCG05D1/0808G05D1/101
Inventor 杨啟明张建东吴勇史国庆朱岩徐建城莫文莉
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products