Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of distance priority and decision-making method, applied in non-electric variable control, instrument, three-dimensional position/channel control, etc., can solve problems such as large deviation, reduce time cost, avoid learning, improve training efficiency and sample utilization rate Effect

Active Publication Date: 2022-04-19

中国人民解放军军事科学院战略评估咨询中心

View PDF10 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The purpose of the present invention is to propose a UAV air combat maneuver decision-making method based on distance priority experience playback. Compared with the existing priority experience playback method, the key improvement is to use distance priority to solve the priority based on TD-error. There is a large deviation in the early stage of training, and UAV air combat maneuver samples are selected for training times to attenuate the influence of distance priority on the total priority in the middle and late training, so as to avoid UAV air combat maneuver decision-making agents The model does a lot of meaningless learning in the early stage of training, which improves training efficiency and sample utilization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment

[0134] In a specific embodiment, the UAV air combat maneuver decision-making method based on the distance priority constraint experience playback method can be sent through a remote terminal, or a training request can be sent through a pre-programmed script.

[0135] The deep Q-network is used to construct the UAV air combat maneuver decision-making agent model. In the training request of the UAV air combat maneuver decision-making agent model, the hardware resources are the hardware configurations selected by the user based on the scale of confrontation training.

[0136] When performing specific training in the present invention, appropriate hardware configuration can be selected for network setting. For example, including the number of machines, the amount of memory, the number of CPU servers, the number of GPU servers, and the disk capacity.

[0137] According to the training request of the UAV air combat maneuver decision-making agent model, hardware resources are configu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an unmanned aerial vehicle air combat maneuver decision-making method based on a distance priority constraint experience playback method. The method comprises an unmanned aerial vehicle air combat maneuver simulation preparation information setting step, an unmanned aerial vehicle air combat maneuver decision-making agent model building step, an unmanned aerial vehicle air combat maneuver decision-making agent model training step and a multi-time training and ending step. Compared with an existing priority experience playback method, the method has the advantages that the calculation logic for calculating the sample priority is improved, and the distance priority is introduced to relieve the inaccuracy of TD-error in the initial training stage, so that a sample close to a termination state is preferentially selected during intelligent agent learning in the initial training stage; therefore, meaningless learning of the intelligent agent at the initial stage of training is avoided, the training efficiency and the sample utilization rate are greatly improved, and the time cost of training is reduced.

Description

technical field [0001] The present invention relates to the field of virtual simulation of UAV air combat, in particular, it relates to a UAV air combat maneuver decision-making method based on distance-priority experience playback, which can use deep reinforcement learning methods to speed up the development of UAV air combat maneuver decision-making intelligent body models. The training in the air combat simulation, on the basis of the traditional priority experience playback method, further improves the sample utilization efficiency, avoids the meaningless learning of the UAV air combat maneuver decision-making agent model in the early stage of training, and improves the UAV air combat maneuver decision-making. The speed at which the agent model completes air combat maneuver training. Background technique [0002] With the development of unmanned and intelligent technology, the use of drones has become an important topic in the fields of civil and military science. The in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/10

CPCG05D1/106Y02T10/40

Inventor 林旺群田成平王伟王锐华黄其旺陶蔚毕华军

Owner 中国人民解放军军事科学院战略评估咨询中心

Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology