Deep reinforcement learning control method for vertical path following of intelligent underwater robot

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An underwater robot, reinforcement learning technology, applied in machine learning, vehicle position/route/altitude control, control of finding targets, etc., can solve problems such as control effect dependence, achieve stable learning process, good adaptability, and improve efficiency Effect

Active Publication Date: 2019-09-06

HARBIN ENG UNIV

View PDF5 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] Traditional path-following control methods such as fuzzy logic control, PID control, and S-plane control require artificial adjustment of control parameters, and the control effect depends on human experience. Intelligent underwater robots cannot actively interact with the environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0068] The following examples describe the present invention in more detail.

[0069] combine figure 1 Shown, is the general structural diagram of the present invention, mainly comprises:

[0070] Step 1: According to the path-following control requirements of the intelligent underwater robot, an intelligent underwater robot environment for interacting with the agent is established.

[0071] Step 2, establish a collection of agents.

[0072] Step 3: Establish an experience cache pool.

[0073] Step four, build learners.

[0074] Step 5, using distributed deterministic policy gradients for path-following control of the intelligent underwater vehicle.

[0075] The deep reinforcement learning control method for intelligent underwater robot vertical surface path following proposed by the present invention will be described in more detail below in conjunction with the accompanying drawings and specific embodiments.

[0076] Detailed implementation method of the present inventi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a deep reinforcement learning control method for vertical path following ofan intelligent underwater robot. The deep reinforcement learning control method comprises the following steps that firstly, according to path following control requirements of the intelligent underwater robot, an intelligent underwater robot environment is established to interact with an agent; secondly, an agent set is established; thirdly, an experience buffer pool is established; fourthly, a learner is established; and fifthly, intelligent underwater robot path following control is conducted byusing a distributed deterministic strategy gradient. According to the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot, the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot is designed to solve the problem that marine environment in which the intelligent underwater robot is located is complex and variable, thus a traditional control method can not interact with the environment. The path followingand control task of the intelligent underwater robot can be finished in a distribution mode by using a deterministic strategy gradient, and the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot has the advantages of self-learning, high precision, good adaptability and stable learning process.

Description

technical field [0001] The invention relates to a control method of an underwater vehicle, in particular to a deep reinforcement learning control method for vertical surface path following of an intelligent underwater robot. Background technique [0002] With the deepening of marine development, intelligent underwater robots have been widely used in marine environmental protection and marine resource development due to their flexible movement, convenient portability, and autonomous operation, and their status has become increasingly important. In addition, the precise control of intelligent underwater robots makes some extremely dangerous tasks safe, such as exploring subsea oil, repairing subsea pipelines, and tracking and recording the location of explosive materials. [0003] Traditional path-following control methods such as fuzzy logic control, PID control, and S-plane control require artificial adjustment of control parameters, and the control effect depends on human e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05D1/00G05D1/12G06N7/00G06N20/00

CPCG05D1/0088G05D1/12G06N20/00G06N7/01

Inventor 李晔白德乾姜言清安力武皓微

Owner HARBIN ENG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Deep reinforcement learning control method for vertical path following of intelligent underwater robot

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology