Autonomous underwater vehicle trajectory tracking control method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An underwater vehicle and trajectory tracking technology, applied in adaptive control, general control system, control/regulation system, etc., can solve problems such as unstable learning process, low control accuracy, and inability to achieve continuous control

Active Publication Date: 2018-11-13

TSINGHUA UNIV

View PDF6 Cites 99 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The purpose of the present invention is to propose a deep reinforcement learning-based AUV trajectory tracking control method, which uses a hybrid policy-evaluation network structure, and uses multiple quasi-Q learning and deterministic policy gradients to train the evaluation network and policy respectively Network, to overcome the problems of low control accuracy, inability to achieve continuous control and unstable learning process in previous methods based on reinforcement learning, and realize high-precision AUV trajectory tracking control and stable learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0101] An autonomous underwater vehicle trajectory tracking control method based on deep reinforcement learning proposed by the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0102] The present invention proposes an autonomous underwater vehicle tracking control algorithm based on deep reinforcement learning, which mainly includes four parts: defining the AUV trajectory tracking control problem, establishing a Markov decision process model for the AUV trajectory tracking problem, and constructing a hybrid strategy- Evaluate the network structure and solve the target policy for AUV trajectory tracking control.

[0103] 1) Define the AUV trajectory tracking control problem

[0104] Defining the AUV trajectory tracking control problem includes four components: determining the AUV system input, determining the AUV system output, defining the trajectory tracking control error, and establishing th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an autonomous underwater vehicle (AUV) trajectory tracking control method based on deep reinforcement learning, belonging to the field of deep reinforcement learning and intelligent control. The autonomous underwater vehicle trajectory tracking control method based on deep reinforcement learning includes the steps: defining an AUV trajectory tracking control problem; establishing a Markov decision-making process model of the AUV trajectory tracking problem; constructing a hybrid policy-evaluation network which consists of multiple policy networks and evaluation networks;and finally, solving the target policy of AUV trajectory tracking control by the constructed hybrid policy-evaluation network, for the multiple evaluation networks, evaluating the performance of eachevaluation network by defining an expected Bellman absolute error and updating only one evaluation network with the lowest performance at each time step, and for the multiple policy networks, randomly selecting one policy network at each time step and using a deterministic policy gradient to update, so that the finally learned policy is the mean value of all the policy networks. The autonomous underwater vehicle trajectory tracking control method based on deep reinforcement learning is not easy to be influenced by the bad AUV historical tracking trajectory, and has high precision.

Description

technical field [0001] The invention belongs to the field of deep reinforcement learning and intelligent control, and relates to an autonomous underwater vehicle (AUV) trajectory tracking control method based on deep reinforcement learning. Background technique [0002] The development of deep-sea submarine science is highly dependent on deep-sea exploration technology and equipment. Due to the complex deep-sea environment and extreme conditions, currently, deep-sea operating autonomous underwater vehicles are mainly used to replace or assist humans in deep-sea detection, observation and sampling. For mission scenarios where humans cannot reach the on-site operations, such as marine resource exploration, seabed survey, and oceanographic mapping, ensuring the autonomy and controllability of AUV underwater movement is the most basic and important functional requirement. premise of the task. However, many offshore applications of AUVs (such as trajectory tracking control, targ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05B13/04

CPCG05B13/042

Inventor 宋士吉石文杰

Owner TSINGHUA UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Autonomous underwater vehicle trajectory tracking control method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology