Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and control methods, which is applied in the field of UAVs, can solve the problems of complex control of fixed-wing UAV clusters and few research results, and achieve strong real-time performance and adaptability, and reduce the workload.

Active Publication Date: 2019-11-26

NAT UNIV OF DEFENSE TECH

View PDF7 Cites 23 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Compared with rotorcraft, due to the non-holonomic constraints of the flight dynamics of fixed-wing UAVs, the swarm control of fixed-wing UAVs is more complicated, and the research results of applying reinforcement learning algorithms to the co-swarm control of fixed-wing UAVs are still less

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0032] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0033] Such as figure 1 and Figure 4 Shown, a kind of fixed-wing unmanned aerial vehicle cluster control method based on reinforcement learning of the present invention comprises:

[0034] Step S1, training phase: establish a random UAV dynamics model, an actuator deep neural network and an evaluator deep neural network, continuously collect the historical experience of the agent interacting with the environment, and store it in the experience pool; Perform batch sampling, continuously update the network parameters of the executor and evaluator, and finally form a network model for saving the evaluator;

[0035] Step S2, execution stage: the wingman obtains its own position and attitude information through the sensor, loads the network model of the evaluator, and the evaluator outputs the best roll action of the wingman according t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning. The method comprises S1) training phase: establishing a stochastic UAV dynamicmodel, an actuator deep neural network and an evaluator deep neural network, and continuously collecting historical experience of interaction between an agent and environment and storing the historical experience in an experience pool; carrying out batch sampling randomly in the experience pool, continuously updating network parameters of an actuator and an evaluator and eventually forming and saving an evaluator network model; and S2)execution phase: for each wing plane, obtaining position and attitude information thereof through a sensor, loading the evaluator network model, outputting thebest rolling action of the wing plane by the evaluator based on a current system combined state, a lead plane roll angle set value being given by an operator, and repeating the steps above until finishing the flight mission. The method has the advantages of high real-time performance and adaptability and being capable of transferring a strategy obtained by training in simulation to real environment and the like.

Description

technical field [0001] The invention mainly relates to the technical field of unmanned aerial vehicles, in particular to a fixed-wing unmanned aerial vehicle cluster control method based on reinforcement learning. Background technique [0002] In recent years, with the continuous development of sensor technology, communication technology and intelligent control technology, UAV technology has made great progress. Fixed-wing UAVs have the characteristics of fast flight speed, strong endurance and large payload, and have been widely used in disaster search and rescue, border patrol, anti-terrorism and other fields. Due to the lack of performance of a single UAV, the above tasks usually require the cooperation of multiple UAVs to complete efficiently. However, operating multiple fixed-wing UAVs requires a lot of manpower to monitor the status of each aircraft, and coordinating multiple UAVs to perform tasks still faces certain challenges. [0003] Consensus theory is widely us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/10

CPCG05D1/104Y02T10/40

Inventor 王菖闫超相晓嘉牛轶峰尹栋吴立珍陈紫叶

Owner NAT UNIV OF DEFENSE TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Fixed-wing UAV(unmanned aerial vehicle) cluster control method based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology