Unlock instant, AI-driven research and patent intelligence for your innovation.

Unmanned aerial vehicle combat analog simulation method based on multistage post experience playback

A simulation and UAV technology, applied in the field of virtual simulation, can solve the problems of low sample utilization rate, insufficient utilization rate of transfer samples, and affecting the final performance of the UAV agent, so as to speed up the learning speed and improve the efficiency

Active Publication Date: 2022-04-19
中国人民解放军军事科学院战略评估咨询中心
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the deep reinforcement learning method based on post-event experience playback uses an automated learning method and improves the training efficiency of UAV navigation in a simulated simulation environment, this method uses random sampling to collect UAV flight samples for the intelligent agent. learning, resulting in low sample utilization and even affecting the final performance of the UAV agent
[0005] Therefore, how to solve the problem of insufficient utilization of UAVs in the process of multi-objective agent reinforcement learning training and improve the learning efficiency of UAV agents has become an urgent technical problem.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned aerial vehicle combat analog simulation method based on multistage post experience playback
  • Unmanned aerial vehicle combat analog simulation method based on multistage post experience playback
  • Unmanned aerial vehicle combat analog simulation method based on multistage post experience playback

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0114] In a specific embodiment, the multi-target agent training request can be sent through a remote terminal, or can be sent through a pre-programmed script.

[0115] In the multi-target agent training request, the hardware resource is the hardware configuration selected by the user based on the confrontation training scale,

[0116] The initial assumption is the application environment when the agent model is executed, and the application environment needs to meet the requirement that multiple targets can be displayed and represented by the state of the environment. The maneuvering target position of the drone is represented by the coordinates of our drone;

[0117] The evaluation index is set according to the actual application scene, and can be the average reward sum of multiple episodes, or the target completion rate of multiple episodes. The example of the present invention mainly uses the second evaluation index, that is, the target completion rate of multiple episodes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unmanned aerial vehicle combat analog simulation method based on multistage post experience playback. The method comprises an unmanned aerial vehicle simulation preparation information setting step, an unmanned aerial vehicle training network construction step based on multi-target agent training, an unmanned aerial vehicle training network training step based on multi-target agent training, and a repeated simulation and ending step. According to the method, multiple levels of post experience playback pools are set, three plot samples with different priorities are stored and utilized, random sampling is carried out according to different priorities of the experience playback pools and different probabilities, the higher the priority is, the larger the sampling probability is, useful sample information is utilized, the training efficiency is greatly improved, the overall utilization rate of the samples is improved, and the training efficiency is improved. And learning of the multi-target agent model is accelerated.

Description

technical field [0001] The present invention relates to the field of virtual simulation, in particular, to a combat simulation method for unmanned aerial vehicles based on multi-level post-event experience playback, which can introduce a multi-level post-event experience playback mechanism in the deep reinforcement learning of multi-target agents, and quickly improve the intelligence of intelligent agents. The learning performance makes it possible to quickly improve the speed of the agent learning to complete multi-objective tasks in the UAV flight simulation simulation. Background technique [0002] With the development of unmanned and intelligent technology, the use of drones has become an important topic in the fields of civil and military science. The initial drones were mainly operated manually. With the development of intelligence and simulation, A variety of intelligent agent simulation control methods have been applied to UAV flight simulation operations. [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05B17/02
CPCG05B17/02
Inventor 林旺群田成平王伟王锐华李妍张世杰
Owner 中国人民解放军军事科学院战略评估咨询中心