Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A control method of underwater robot based on reinforcement learning and its tracking control method

A technology of underwater robot and control method, which is applied in the direction of adaptive control, general control system, control/adjustment system, etc., can solve problems such as unreachable and unconsidered underwater robots, and reduce computing cost and simulation Number of times, the effect of good trajectory tracking

Active Publication Date: 2020-08-11
芽米科技(广州)有限公司
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this invention does not take into account the influence of the uncertainty in the underwater environment on the underwater robot. In the marine environment, the underwater robot will be subject to various disturbances during operation, such as the force of surge, swing and heave. Influence, if these uncertain factors are not taken into account in the algorithm, the ideal effect will not be achieved in actual operation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A control method of underwater robot based on reinforcement learning and its tracking control method
  • A control method of underwater robot based on reinforcement learning and its tracking control method
  • A control method of underwater robot based on reinforcement learning and its tracking control method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The present invention will be further described below in conjunction with accompanying drawing:

[0054] Such as figure 2 As shown, the present invention has a buoy relay on the water surface, uses it to self-position the underwater robot, and the control center gives the expected trajectory information of the underwater robot and sends it to the underwater robot; the underwater robot controller controls the driver according to the system control Drive to complete the motion of the underwater robot.

[0055] Such as figure 1 As shown, the present invention has set forth a kind of underwater robot control method based on reinforcement learning, and its method comprises the following steps:

[0056] Step 1 The underwater robot is affected by the surrounding environment underwater, and it is necessary to evaluate the uncertain factors in the underwater robot model in order to complete the control of the driver by the underwater robot controller; the underwater robot has...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an underwater robot control method based on reinforcement learning and a control method of tracking with an underwater robot, belonging to the technical field of underwater robot control. According to the methods provided by the invention, a control center gives desired trajectory information of the underwater robot, and sends the information to the underwater robot; according to a probability density function of uncertain parameters in an underwater robot model, sampling points are respectively selected for the uncertain parameters, and order reduction is executed foran original dynamical model based on the sampling points; the underwater robot performs interaction with surrounding environment so as to learn environment information, a one step cost function is calculated at different states and value updating is executed, weight of a value function corresponding to a control strategy is solved through a least square method, improvement of the control strategyis executed through a gradient descent method, loop iteration is executed for two processes of value updating and strategy improvement until convergence is realized, and consequently, an optimal control strategy of current position tracking desired trajectory is obtained; and steps above are repeated for obtaining the optimal control strategy for the rest desired trajectory tracking, and finally,a tracking task is completed.

Description

technical field [0001] The invention relates to the field of underwater robot control, in particular to an underwater robot control method based on reinforcement learning and a tracking control method thereof. Background technique [0002] With the application of marine resources more and more widely, underwater robots have also received more attention. An important application of underwater robots in the ocean is position tracking, but the underwater environment is complex and changeable, making it difficult to obtain the model parameters of underwater robots and difficult to control them. [0003] Searching in the prior art found that the patent application with publication number CN106708069A designed a coordinated planning and control method for an underwater mobile working robot. The method includes planning the current desired speed and state in real time through dynamic tracking of the differentiator, using iterative task priority method to transform the task plannin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05B13/04
CPCG05B13/042
Inventor 闫敬公雅迪罗小元杨晛李鑫
Owner 芽米科技(广州)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products