Reinforced learning based robot joint motion control method and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of robotic joints and reinforcement learning, applied in the direction of program control manipulators, manipulators, manufacturing tools, etc., can solve problems such as only consideration, value function influence, error accumulation, etc., to reduce influence, avoid accumulation, and compensate accurately and efficiently. Effect

Active Publication Date: 2019-04-12

XIAMEN UNIV

View PDF5 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The disadvantage of this method is that the errors will continue to accumulate, and slight changes in the strategy will have a great impact on the value function

The other is a method based on strategy search, also known as actor-only. This method directly improves the strategy. This method performs well in solving continuous state problems and is widely used in the field of robotics. However, only the current strategy is considered in the training process. Or several adjacent strategies, therefore, there is a local optimal problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0049] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0050] The purpose of the present invention is to provide a robot joint motion control method and system based on reinforcement learning, which has the characteristics of small error and high efficiency.

[0051] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0052] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a reinforced learning based robot joint motion control method and system. The method comprises the following steps: obtaining to-be-operated track of a robot terminal; calculating position increment within each interpolation period of the robot joint according to the to-be-operated track of the robot terminal and a robot inverse kinematic model; determining position increment compensation within each interpolation period of the robot joint according to a policy network; taking sum of given position increment within each interpolation period and position increment compensation as motion parameters of the robot joint, inputting the motion parameters into a robot to obtain practical motion amount within each interpolation period of the robot joint; performing real-timetraining update on a value network according to the given position increment and the practical motion amount; after operation of the to-be-operated track is accomplished, performing training update on the policy network according to parameters updated according to each interpolation period, of the value network; and controlling motion, in next to-be-operated track, of the robot joint by adoptingthe updated policy network. The reinforced learning based robot joint motion control method has the characteristics of being small in errors and high in efficiency.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a robot joint motion control method and system based on reinforcement learning. Background technique [0002] There is a problem of trajectory deviation at the end of the robot during operation, and reinforcement learning can be used to compensate and reduce the deviation. There are currently two main types of reinforcement learning applications in this field. One is the method based on the value function, also known as critic-only, which is to derive the corresponding optimal strategy by observing and evaluating the performance of the system. The disadvantage of this method is that the errors will continue to accumulate, and slight changes in the strategy will have a great impact on the value function. The other is a method based on strategy search, also known as actor-only. This method directly improves the strategy. This method performs well in solving continuous state problems ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/1602B25J9/163B25J9/1664

Inventor 刘暾东贺苗吴晓敏高凤强王若宇

Owner XIAMEN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Reinforced learning based robot joint motion control method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology