Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and control methods, applied in the direction of neural learning methods, program control manipulators, manipulators, etc., can solve the problems of unsatisfactory coordination and control of both arms, inability to receive positive rewards for both arms, and inability to carry out effective learning, etc. , to achieve the effect of improving control effect, maximizing performance, and solving exploration problems

Pending Publication Date: 2021-10-22

SHANDONG UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In the prior art, only relying on the reinforcement learning algorithm to control the dual-arm robot, the robot arms as the intelligent body cannot receive positive rewards and cannot carry out effective learning, thus making the coordination control effect of the dual arms unsatisfactory and easy There is a problem that only one robot arm completes the grabbing action and the other robot arm does not move

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0047] Such as Figure 1-10 As shown, the deep reinforcement learning dual-arm robot control method based on the demonstration example includes the following steps:

[0048] Obtain the initial coordinates of the end gripper of the dual-arm robot, the initial coordinates of the target object and the final coordinates of the target object;

[0049] Based on the shortest distance between the initial coordinates of the end gripper of the dual-arm robot and the initial coordinates of the target object, the three-dimensional coordinate motion increment and coordinate change coefficient of the position of the end gripper of each group of manipulators are obtained to form the first teaching method. a demonstration example;

[0050] After the gripper is closed and the object is grasped, based on the shortest distance between the initial coordinates of the target object and the final coordinates of the target object, the three-dimensional coordinate movement increment and coordinate chan...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a deep reinforcement learning dual-arm robot control method based on demonstration examples, and a system. The deep reinforcement learning dual-arm robot control method based on the demonstration examples comprises the steps of obtaining the three-dimensional coordinate motion increment of the position of a clamping jaw at the tail end of each group of mechanical arm based on the shortest distance between an initial coordinate of the clamping jaw at the tail end of a dual-arm robot and an initial coordinate of a target object, and forming a first demonstration example used for demonstration; after the clamping jaw closes to grab the object, obtaining the three-dimensional coordinate motion increment of the position of the clamping jaw at the tail end of each group of mechanical arm based on the shortest distance between the initial coordinate of the target object and a final coordinate of the target object, and forming a second demonstration example used for demonstration; and utilizing the demonstration examples of the two parts to form a complete expert demonstration track, adopting an experience playback pool of a double-agent deep deterministic policy gradient algorithm to randomly extract a small batch of samples to form training data, training a neural network, and achieving control over the dual-arm robot. The reinforcement learning algorithm reinforces the action sequence of the demonstration examples, and fast learning is carried out until the task is completed.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a demonstration example-based deep reinforcement learning dual-arm robot control method and system. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] The grasping task of the robot is a multi-step sequential decision-making task in the continuous action space, which needs to execute multiple steps continuously within a period of time, and any problem in any step will lead to the failure of the task. A dual-arm robot is a robot that completes the grasping task through two sets of robotic arms. There is a certain motion constraint relationship between the grasping arms, and it is not a simple superposition of two single-arm robots. [0004] The coordinated control of the dual-arm robot means that the two robotic arms adapt to the environment and coope...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16G06N3/04G06N3/08

CPCB25J9/163B25J9/1669G06N3/08G06N3/045Y02P90/02

Inventor 宋勇刘鲁钰庞豹袁宪锋许庆阳巩志

Owner SHANDONG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology