Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method

A technology of reinforcement learning and trajectory tracking, applied in two-dimensional position/channel control, non-electric variable control, control/regulation system, etc. Problems such as reduced stickiness

Active Publication Date: 2019-07-16
DALIAN MARITIME UNIVERSITY
View PDF5 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0021] According to the above proposal, the existing optimal control method based on the reinforcement learning method mainly solves the optimal control of nonlinear systems with known system states, and does not consider unmanned surface vehicles with dead zones or completely unknown system dynamics. Control problems, which lead to the technical problems of reducing the accuracy and robustness of the actual control system, and provide an optimal trajectory tracking control method for unmanned surface vehicles based on reinforcement learning methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method
  • Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method
  • Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0140] The basic principle of reinforcement learning technology is: if a certain state of the controlled system receives a "positive" reward from the environment, it is a reinforcement signal, and the trend of each subsequent action of the system will be strengthened; otherwise, the system will produce a trend of this state. weakened. Therefore, the goal of reinforcement learning is to learn a behavior policy that enables the state of the system output to obtain the maximum reward from the environment. In a standard reinforcement learning framework, it mainly has four elements, namely policy, reward and punishment feedback (reward), cost function (cost function) and environment model (model of environment).

[0141] Such as Figure 1-2 As shown, the present invention provides a kind of unmanned surface ship optimal trajectory tracking control method based on reinforcement learning method, comprises the following steps:

[0142] S1: Establish the mathematical model M1 of the un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an unmanned surface ship optimal trajectory tracking control method based on a reinforced learning method. The unmanned surface ship optimal trajectory tracking control method based on the reinforced learning method comprises the following steps: S1, establishing an unmanned surface ship system mathematical model and a desired trajectory system mathematical model without considering a disturbance condition; S2, establishing a dead zone mathematical model, so as to obtain an unmanned surface ship system mathematical model introducing the dead zone; and further obtaining an tracking error system; S3, establishing an identifier system; and S4, evaluating whether the control strategy meets the requirements or not through an optimal cost function; if the control strategymeets the requirements, outputting the control strategy to the unmanned surface ship system as an optimal control strategy; and if the control strategy does not meet the requirements, evaluating whether the regenerated control strategy meets the requirements or not through the optimal cost function, and repeating the above process until the optimal control strategy is obtained and output to the unmanned surface ship system. The invention solves the technical problem that the unmanned ship optimal control method in the prior art does not consider the dead zone or completely unknown system dynamics, and the accuracy and robustness of the control system are reduced.

Description

technical field [0001] The invention relates to the fields of ship control engineering and ship automation navigation, in particular, to an optimal trajectory tracking control method for an unmanned surface ship based on a reinforcement learning method. Background technique [0002] At present, in the field of ship trajectory tracking control, the control method designed to enable unmanned ships to achieve high-precision trajectory tracking is the core content of the research. It can not only enable the safe and effective operation of the unmanned surface vehicle, but also enhance its robustness and complete the established tasks accurately and stably. Common unmanned ship tracking control algorithms include PID, sliding mode, neural network, etc. [0003] Trajectory tracking control system of unmanned surface vessel is a typical nonlinear control system. The traditional solution to the optimal control problem of nonlinear control system is to solve the Hamilton-Jacobi-Bel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0206
Inventor 王宁高颖李贺杨忱
Owner DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products