Aircraft route tracking method based on deep reinforcement learning

A technology of reinforcement learning and aircraft, applied in the direction of instruments, three-dimensional position/course control, vehicle position/route/altitude control, etc., can solve problems such as wind disturbance, poor performance, and few control methods

Active Publication Date: 2020-02-18
TSINGHUA UNIV
View PDF8 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although PID control is sufficient for most control tasks, its performance is not satisfactory in dynamic environments, such as: wind disturbance, load changes, voltage dips, etc.
[0003] The flight path tracking control problem of aircraft is a very challenging research field, which has been developed for decades, but most of the research is still in the simulation stage, and there are very few control methods that can be put into practical application
However, it is difficult to obtain an accurate aircraft model in practical applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Aircraft route tracking method based on deep reinforcement learning
  • Aircraft route tracking method based on deep reinforcement learning
  • Aircraft route tracking method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0077] The aircraft route tracking method based on deep reinforcement learning proposed by the present invention comprises the following steps:

[0078] (1) Construct a Markov model for aircraft route tracking, including the following steps:

[0079] The Markov model of aircraft route tracking includes four components: state variables, control variables, transfer model, and one-step loss function;

[0080] (1-1) Determine the state variables of the Markov model:

[0081] Such as figure 1 As shown, use (x, y, z) to represent the horizontal plane coordinates x, y and height z of the aircraft in the inertial coordinate system, and use Indicates the heading angle, pitch angle and roll angle of the aircraft in the inertial coordinate system, (p, q, r) indicates the three-axis velocity of the aircraft in the body coordinate system, and (u, v, w) indicates the aircraft in the body coordinate system The three-axis angular velocity under ;

[0082] Let the target height of the air...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an aircraft route tracking method based on deep reinforcement learning, and belongs to the field of robot intelligent control. The aircraft route tracking method comprises thesteps of: constructing a Markov decision process model for aircraft trajectory tracking control, and respectively obtaining expressions of a state variable, a control variable, a transfer model and aone-step loss function for the aircraft route tracking control; respectively establishing a strategy network and an evaluation network; through reinforcement learning, enabling an aircraft to continuously update the strategy network and the evaluation network when making a step forward every time in route tracking control training until convergence; and obtaining a final strategy network for theroute tracking control. The aircraft route tracking method has strong expansibility, is not only suitable for the problem of route tracking control of the aircraft, but also can be expanded to other complicated problems of aircraft control by only resetting the state variable, a control input and a loss function and adjusting the structure and parameters of a neural network.

Description

technical field [0001] The invention relates to an aircraft route tracking method based on deep reinforcement learning, which belongs to the field of robot intelligent control. Background technique [0002] The autopilot system of an aircraft often includes two control loops: the outer loop generates the desired attitude according to the specified route; the inner loop controls the aircraft to track the instructions generated by the outer loop. Different from the outer loop controller, the inner loop controller is often only effective for specific aircraft and specific flight tasks. At present, most of the aircraft autopilot systems adopt the classic PID control. Although PID control is sufficient for most control tasks, its performance in dynamic environments such as wind disturbances, load changes, and voltage dips is not satisfactory. [0003] The flight path tracking control problem of aircraft is a very challenging research field, which has been developed for decades,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10G05B13/02G05B13/04
CPCG05B13/027G05B13/042G05D1/101
Inventor 游科友董斐宋士吉
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products