Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and trajectory tracking, applied in two-dimensional position/channel control, non-electric variable control, control/regulation system, etc. Problems such as reduced stickiness

Active Publication Date: 2019-07-16

DALIAN MARITIME UNIVERSITY

View PDF5 Cites 28 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0021] According to the above proposal, the existing optimal control method based on the reinforcement learning method mainly solves the optimal control of nonlinear systems with known system states, and does not consider unmanned surface vehicles with dead zones or completely unknown system dynamics. Control problems, which lead to the technical problems of reducing the accuracy and robustness of the actual control system, and provide an optimal trajectory tracking control method for unmanned surface vehicles based on reinforcement learning methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0140] The basic principle of reinforcement learning technology is: if a certain state of the controlled system receives a "positive" reward from the environment, it is a reinforcement signal, and the trend of each subsequent action of the system will be strengthened; otherwise, the system will produce a trend of this state. weakened. Therefore, the goal of reinforcement learning is to learn a behavior policy that enables the state of the system output to obtain the maximum reward from the environment. In a standard reinforcement learning framework, it mainly has four elements, namely policy, reward and punishment feedback (reward), cost function (cost function) and environment model (model of environment).

[0141] Such as Figure 1-2 As shown, the present invention provides a kind of unmanned surface ship optimal trajectory tracking control method based on reinforcement learning method, comprises the following steps:

[0142] S1: Establish the mathematical model M1 of the un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an unmanned surface ship optimal trajectory tracking control method based on a reinforced learning method. The unmanned surface ship optimal trajectory tracking control method based on the reinforced learning method comprises the following steps: S1, establishing an unmanned surface ship system mathematical model and a desired trajectory system mathematical model without considering a disturbance condition; S2, establishing a dead zone mathematical model, so as to obtain an unmanned surface ship system mathematical model introducing the dead zone; and further obtaining an tracking error system; S3, establishing an identifier system; and S4, evaluating whether the control strategy meets the requirements or not through an optimal cost function; if the control strategymeets the requirements, outputting the control strategy to the unmanned surface ship system as an optimal control strategy; and if the control strategy does not meet the requirements, evaluating whether the regenerated control strategy meets the requirements or not through the optimal cost function, and repeating the above process until the optimal control strategy is obtained and output to the unmanned surface ship system. The invention solves the technical problem that the unmanned ship optimal control method in the prior art does not consider the dead zone or completely unknown system dynamics, and the accuracy and robustness of the control system are reduced.

Description

technical field [0001] The invention relates to the fields of ship control engineering and ship automation navigation, in particular, to an optimal trajectory tracking control method for an unmanned surface ship based on a reinforcement learning method. Background technique [0002] At present, in the field of ship trajectory tracking control, the control method designed to enable unmanned ships to achieve high-precision trajectory tracking is the core content of the research. It can not only enable the safe and effective operation of the unmanned surface vehicle, but also enhance its robustness and complete the established tasks accurately and stably. Common unmanned ship tracking control algorithms include PID, sliding mode, neural network, etc. [0003] Trajectory tracking control system of unmanned surface vessel is a typical nonlinear control system. The traditional solution to the optimal control problem of nonlinear control system is to solve the Hamilton-Jacobi-Bel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/02

CPCG05D1/0206

Inventor 王宁高颖李贺杨忱

Owner DALIAN MARITIME UNIVERSITY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Unmanned surface ship optimal trajectory tracking control method based on reinforced learning method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology