Deep reinforcement learning control method for vertical path following of intelligent underwater robot

An underwater robot, reinforcement learning technology, applied in machine learning, vehicle position/route/altitude control, control of finding targets, etc., can solve problems such as control effect dependence, achieve stable learning process, good adaptability, and improve efficiency Effect

Active Publication Date: 2019-09-06
HARBIN ENG UNIV
View PDF5 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Traditional path-following control methods such as fuzzy logic control, PID control, and S-plane control require artificial adjustment of c

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep reinforcement learning control method for vertical path following of intelligent underwater robot
  • Deep reinforcement learning control method for vertical path following of intelligent underwater robot
  • Deep reinforcement learning control method for vertical path following of intelligent underwater robot

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The following examples describe the present invention in more detail.

[0069] combine figure 1 Shown, is the general structural diagram of the present invention, mainly comprises:

[0070] Step 1: According to the path-following control requirements of the intelligent underwater robot, an intelligent underwater robot environment for interacting with the agent is established.

[0071] Step 2, establish a collection of agents.

[0072] Step 3: Establish an experience cache pool.

[0073] Step four, build learners.

[0074] Step 5, using distributed deterministic policy gradients for path-following control of the intelligent underwater vehicle.

[0075] The deep reinforcement learning control method for intelligent underwater robot vertical surface path following proposed by the present invention will be described in more detail below in conjunction with the accompanying drawings and specific embodiments.

[0076] Detailed implementation method of the present inventi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a deep reinforcement learning control method for vertical path following ofan intelligent underwater robot. The deep reinforcement learning control method comprises the following steps that firstly, according to path following control requirements of the intelligent underwater robot, an intelligent underwater robot environment is established to interact with an agent; secondly, an agent set is established; thirdly, an experience buffer pool is established; fourthly, a learner is established; and fifthly, intelligent underwater robot path following control is conducted byusing a distributed deterministic strategy gradient. According to the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot, the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot is designed to solve the problem that marine environment in which the intelligent underwater robot is located is complex and variable, thus a traditional control method can not interact with the environment. The path followingand control task of the intelligent underwater robot can be finished in a distribution mode by using a deterministic strategy gradient, and the deep reinforcement learning control method for the vertical path following ofthe intelligent underwater robot has the advantages of self-learning, high precision, good adaptability and stable learning process.

Description

technical field [0001] The invention relates to a control method of an underwater vehicle, in particular to a deep reinforcement learning control method for vertical surface path following of an intelligent underwater robot. Background technique [0002] With the deepening of marine development, intelligent underwater robots have been widely used in marine environmental protection and marine resource development due to their flexible movement, convenient portability, and autonomous operation, and their status has become increasingly important. In addition, the precise control of intelligent underwater robots makes some extremely dangerous tasks safe, such as exploring subsea oil, repairing subsea pipelines, and tracking and recording the location of explosive materials. [0003] Traditional path-following control methods such as fuzzy logic control, PID control, and S-plane control require artificial adjustment of control parameters, and the control effect depends on human e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/00G05D1/12G06N7/00G06N20/00
CPCG05D1/0088G05D1/12G06N20/00G06N7/01
Inventor 李晔白德乾姜言清安力武皓微
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products