Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning

A technology of reinforcement learning and control methods, which is applied in the field of UAVs, can solve the problems of complex control of fixed-wing UAV clusters and few research results, and achieve strong real-time performance and adaptability, and reduce the workload.

Active Publication Date: 2019-11-26
NAT UNIV OF DEFENSE TECH
View PDF7 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with rotorcraft, due to the non-holonomic constraints of the flight dynamics of fixed-wing UAVs, the swarm control of fixed-wing UAVs is more complicated, and the research results of applying reinforcement learning algorithms to the co-swarm control of fixed-wing UAVs are still less

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning
  • Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning
  • Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0033] Such as figure 1 and Figure 4 Shown, a kind of fixed-wing unmanned aerial vehicle cluster control method based on reinforcement learning of the present invention comprises:

[0034] Step S1, training phase: establish a random UAV dynamics model, an actuator deep neural network and an evaluator deep neural network, continuously collect the historical experience of the agent interacting with the environment, and store it in the experience pool; Perform batch sampling, continuously update the network parameters of the executor and evaluator, and finally form a network model for saving the evaluator;

[0035] Step S2, execution stage: the wingman obtains its own position and attitude information through the sensor, loads the network model of the evaluator, and the evaluator outputs the best roll action of the wingman according t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning. The method comprises S1) training phase: establishing a stochastic UAV dynamicmodel, an actuator deep neural network and an evaluator deep neural network, and continuously collecting historical experience of interaction between an agent and environment and storing the historical experience in an experience pool; carrying out batch sampling randomly in the experience pool, continuously updating network parameters of an actuator and an evaluator and eventually forming and saving an evaluator network model; and S2)execution phase: for each wing plane, obtaining position and attitude information thereof through a sensor, loading the evaluator network model, outputting thebest rolling action of the wing plane by the evaluator based on a current system combined state, a lead plane roll angle set value being given by an operator, and repeating the steps above until finishing the flight mission. The method has the advantages of high real-time performance and adaptability and being capable of transferring a strategy obtained by training in simulation to real environment and the like.

Description

technical field [0001] The invention mainly relates to the technical field of unmanned aerial vehicles, in particular to a fixed-wing unmanned aerial vehicle cluster control method based on reinforcement learning. Background technique [0002] In recent years, with the continuous development of sensor technology, communication technology and intelligent control technology, UAV technology has made great progress. Fixed-wing UAVs have the characteristics of fast flight speed, strong endurance and large payload, and have been widely used in disaster search and rescue, border patrol, anti-terrorism and other fields. Due to the lack of performance of a single UAV, the above tasks usually require the cooperation of multiple UAVs to complete efficiently. However, operating multiple fixed-wing UAVs requires a lot of manpower to monitor the status of each aircraft, and coordinating multiple UAVs to perform tasks still faces certain challenges. [0003] Consensus theory is widely us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/104Y02T10/40
Inventor 王菖闫超相晓嘉牛轶峰尹栋吴立珍陈紫叶
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products