Unlock instant, AI-driven research and patent intelligence for your innovation.

Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback

A technology of distance priority and decision-making method, applied in non-electric variable control, instrument, three-dimensional position/channel control, etc., can solve problems such as large deviation, reduce time cost, avoid learning, improve training efficiency and sample utilization rate Effect

Active Publication Date: 2022-04-19
中国人民解放军军事科学院战略评估咨询中心
View PDF10 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to propose a UAV air combat maneuver decision-making method based on distance priority experience playback. Compared with the existing priority experience playback method, the key improvement is to use distance priority to solve the priority based on TD-error. There is a large deviation in the early stage of training, and UAV air combat maneuver samples are selected for training times to attenuate the influence of distance priority on the total priority in the middle and late training, so as to avoid UAV air combat maneuver decision-making agents The model does a lot of meaningless learning in the early stage of training, which improves training efficiency and sample utilization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback
  • Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback
  • Unmanned aerial vehicle air combat maneuver decision-making method based on distance-first experience playback

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0134] In a specific embodiment, the UAV air combat maneuver decision-making method based on the distance priority constraint experience playback method can be sent through a remote terminal, or a training request can be sent through a pre-programmed script.

[0135] The deep Q-network is used to construct the UAV air combat maneuver decision-making agent model. In the training request of the UAV air combat maneuver decision-making agent model, the hardware resources are the hardware configurations selected by the user based on the scale of confrontation training.

[0136] When performing specific training in the present invention, appropriate hardware configuration can be selected for network setting. For example, including the number of machines, the amount of memory, the number of CPU servers, the number of GPU servers, and the disk capacity.

[0137] According to the training request of the UAV air combat maneuver decision-making agent model, hardware resources are configu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unmanned aerial vehicle air combat maneuver decision-making method based on a distance priority constraint experience playback method. The method comprises an unmanned aerial vehicle air combat maneuver simulation preparation information setting step, an unmanned aerial vehicle air combat maneuver decision-making agent model building step, an unmanned aerial vehicle air combat maneuver decision-making agent model training step and a multi-time training and ending step. Compared with an existing priority experience playback method, the method has the advantages that the calculation logic for calculating the sample priority is improved, and the distance priority is introduced to relieve the inaccuracy of TD-error in the initial training stage, so that a sample close to a termination state is preferentially selected during intelligent agent learning in the initial training stage; therefore, meaningless learning of the intelligent agent at the initial stage of training is avoided, the training efficiency and the sample utilization rate are greatly improved, and the time cost of training is reduced.

Description

technical field [0001] The present invention relates to the field of virtual simulation of UAV air combat, in particular, it relates to a UAV air combat maneuver decision-making method based on distance-priority experience playback, which can use deep reinforcement learning methods to speed up the development of UAV air combat maneuver decision-making intelligent body models. The training in the air combat simulation, on the basis of the traditional priority experience playback method, further improves the sample utilization efficiency, avoids the meaningless learning of the UAV air combat maneuver decision-making agent model in the early stage of training, and improves the UAV air combat maneuver decision-making. The speed at which the agent model completes air combat maneuver training. Background technique [0002] With the development of unmanned and intelligent technology, the use of drones has become an important topic in the fields of civil and military science. The in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/106Y02T10/40
Inventor 林旺群田成平王伟王锐华黄其旺陶蔚毕华军
Owner 中国人民解放军军事科学院战略评估咨询中心