Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and decision-making methods, applied in non-electric variable control, instrument, attitude control and other directions, can solve problems such as difficult computerized maneuver decision-making, difficult air combat mission situational space, etc.

Inactive Publication Date: 2018-07-24

NORTHWESTERN POLYTECHNICAL UNIV

View PDF9 Cites 56 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Since the situation of air combat is more complex than other tasks, it is difficult to fully cover the situation space of air com

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0043] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0044] The present invention completes the establishment of the entire reinforcement learning maneuvering decision-making algorithm from two aspects of state space description and environment modeling, and the main work includes the following contents:

[0045] 1) The division and description of the state space, using the fuzzy method to fuzzify each state in the air combat situation, as the state input of reinforcement learning.

[0046] 2) The construction of the reinforcement learning environment in the air combat process, constructing the motion control model of the UAV, clarifying the action space and state transition function of the reinforcement learning, and constructing the air combat advantage function based on various elements of the air combat situation, as the r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a reinforcement learning based air combat maneuver decision making method of a UAV. A motion model of an airplane platform is created; principle factors that influence the air combat situation are analyzed; on the basis of the motion model and analysis on the air combat situation factors, a dynamic fuzzy Q learning model of air combat maneuver decision making is designed, and essential factors and an algorithm flow of reinforcement learning are determined; a state space of air combat maneuver decision making is fuzzified and serves as state input of reinforcement learning; typical air combat motions are selected as basic motions of reinforcement learning, and the triggering intensities of fuzzy rules are summed in a weighted manner, and a continuous motion space is covered; and on the basis of an established air combat dominant function, a return value of reinforcement learning is set in a rewards and punishment values weighing-superposing method. Thus, the autonomous maneuver decision making capability of the UAV during air combat can be improved effectively, the robustness is higher, an autonomous searching optimization performance is higher, and decisionsmade by the UAV are improved continuously in continuous simulation and learning.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to an implementation method for air combat maneuver decision-making of unmanned aircraft. Background technique [0002] At present, drones have been able to complete tasks such as reconnaissance, surveillance and ground attack, and are playing an increasingly irreplaceable role in modern warfare. However, due to the higher real-time requirements for air combat, the current ground station remote control method for UAVs is difficult to achieve accurate and timely control of UAVs in order to gain an advantage in air combat. Therefore, improving the intelligence level of UAVs and enabling UAVs to automatically generate control commands to complete maneuvers in air combat according to the situational environment is the current main research direction. [0003] The essence of allowing UAVs to complete autonomous decision-making in air combat maneuvers is to co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05D1/08G05D1/10

CPCG05D1/0808G05D1/101

Inventor杨啟明张建东吴勇史国庆朱岩徐建城莫文莉

OwnerNORTHWESTERN POLYTECHNICAL UNIV

Reinforcement learning based air combat maneuver decision making method of unmanned aerial vehicle (UAV)

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology