A Trajectory Tracking Control Method of Baxter Manipulator Based on Reinforcement Learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A control method and reinforcement learning technology, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve problems such as operational difficulties, affecting control accuracy, external disturbances of the system, etc., and achieve the effect of improving accuracy

Active Publication Date: 2022-06-17

ZHEJIANG UNIV OF TECH

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the traditional technology has limitations, which are caused by the uncertainty of the actual system, including the uncertainty of the system model and the disturbance outside the system

When using traditional methods, a model of the system is required, and the accuracy of the model directly affects the accuracy of the control. Even if the model is available, the state feedback controller obtained based on the model is only suitable for an approximate model of the real system dynamics

In addition, the optimal control of the time-varying system is difficult to operate in the actual system, the cost is high, the performance is average, and the actual use value is low. Therefore, through the data-driven method, the input and output data of the system are used to calculate the optimal control of the system. Optimal control is clearly necessary

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0051] In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention are further described below with reference to the accompanying drawings and simulation experiments.

[0052] refer to Figure 1 to Figure 8 , a Baxter manipulator trajectory tracking control method based on reinforcement learning. First, the first three joints of the Baxter manipulator are systematically identified, the state space equation of the continuous time is determined and discretized, and the discrete state space model is obtained. The step is only used to obtain the position and velocity tracking errors of the first three joints of the robot at the next moment during simulation; first, an initial state of the first three joints of the robot arm is given, and the next moment of the three joints is measured and recorded according to a fixed sampling time After preprocessing the collected position and velocity info...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A trajectory tracking control method for the Baxter manipulator based on reinforcement learning. First, system identification is performed on the first three joints of the Baxter manipulator, and its continuous-time state-space equation is determined and discretized to obtain a discrete state-space model. This step It is only used to obtain the position and velocity tracking errors of the first three joints of the robot at the next moment during simulation; first, given an initial state of the first three joints of the manipulator, measure and record the next moment of the three joints according to a fixed sampling time Position and speed tracking error, after preprocessing the collected position and speed information, use the recursive least squares method to calculate the weight matrix H corresponding to the optimal control strategy, and finally calculate the optimal feedback at the next moment according to the weight matrix control. The invention automatically adapts to model errors caused by model changes and improves the accuracy of the robot in daily use.

Description

technical field [0001] The invention belongs to the field of intelligent control of manipulators, and specifically provides a Baxter manipulator trajectory tracking control method based on reinforcement learning, which can calculate the optimal control method through the reinforcement learning strategy iteration method when the manipulator model is unknown. A control strategy to reduce the trajectory tracking error, thereby minimizing the loss function of the robotic system. Background technique [0002] In recent years, reinforcement learning theory has received extensive attention and research in the field of robot control. As a common tool in industrial production, industrial robotic arms are widely used in automatic production lines. How to apply the reinforcement learning theory to the motion control of the industrial manipulator, so that it has a certain ability of self-learning, is of great significance to expand the application of the manipulator and reduce the diff...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): B25J9/16

CPCB25J9/1664B25J9/1651

Inventor 夏振浩朱俊威张恒董子源王波顾曹源梁朝阳

Owner ZHEJIANG UNIV OF TECH

A Trajectory Tracking Control Method of Baxter Manipulator Based on Reinforcement Learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology