Supercharge Your Innovation With Domain-Expert AI Agents!

Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system

A technology of reinforcement learning and control methods, applied in the direction of neural learning methods, program control manipulators, manipulators, etc., can solve the problems of unsatisfactory coordination and control of both arms, inability to receive positive rewards for both arms, and inability to carry out effective learning, etc. , to achieve the effect of improving control effect, maximizing performance, and solving exploration problems

Pending Publication Date: 2021-10-22
SHANDONG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the prior art, only relying on the reinforcement learning algorithm to control the dual-arm robot, the robot arms as the intelligent body cannot receive positive rewards and cannot carry out effective learning, thus making the coordination control effect of the dual arms unsatisfactory and easy There is a problem that only one robot arm completes the grabbing action and the other robot arm does not move

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system
  • Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system
  • Deep reinforcement learning dual-arm robot control method based on demonstration examples, and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] Such as Figure 1-10 As shown, the deep reinforcement learning dual-arm robot control method based on the demonstration example includes the following steps:

[0048] Obtain the initial coordinates of the end gripper of the dual-arm robot, the initial coordinates of the target object and the final coordinates of the target object;

[0049] Based on the shortest distance between the initial coordinates of the end gripper of the dual-arm robot and the initial coordinates of the target object, the three-dimensional coordinate motion increment and coordinate change coefficient of the position of the end gripper of each group of manipulators are obtained to form the first teaching method. a demonstration example;

[0050] After the gripper is closed and the object is grasped, based on the shortest distance between the initial coordinates of the target object and the final coordinates of the target object, the three-dimensional coordinate movement increment and coordinate chan...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a deep reinforcement learning dual-arm robot control method based on demonstration examples, and a system. The deep reinforcement learning dual-arm robot control method based on the demonstration examples comprises the steps of obtaining the three-dimensional coordinate motion increment of the position of a clamping jaw at the tail end of each group of mechanical arm based on the shortest distance between an initial coordinate of the clamping jaw at the tail end of a dual-arm robot and an initial coordinate of a target object, and forming a first demonstration example used for demonstration; after the clamping jaw closes to grab the object, obtaining the three-dimensional coordinate motion increment of the position of the clamping jaw at the tail end of each group of mechanical arm based on the shortest distance between the initial coordinate of the target object and a final coordinate of the target object, and forming a second demonstration example used for demonstration; and utilizing the demonstration examples of the two parts to form a complete expert demonstration track, adopting an experience playback pool of a double-agent deep deterministic policy gradient algorithm to randomly extract a small batch of samples to form training data, training a neural network, and achieving control over the dual-arm robot. The reinforcement learning algorithm reinforces the action sequence of the demonstration examples, and fast learning is carried out until the task is completed.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a demonstration example-based deep reinforcement learning dual-arm robot control method and system. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] The grasping task of the robot is a multi-step sequential decision-making task in the continuous action space, which needs to execute multiple steps continuously within a period of time, and any problem in any step will lead to the failure of the task. A dual-arm robot is a robot that completes the grasping task through two sets of robotic arms. There is a certain motion constraint relationship between the grasping arms, and it is not a simple superposition of two single-arm robots. [0004] The coordinated control of the dual-arm robot means that the two robotic arms adapt to the environment and coope...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): B25J9/16G06N3/04G06N3/08
CPCB25J9/163B25J9/1669G06N3/08G06N3/045Y02P90/02
Inventor 宋勇刘鲁钰庞豹袁宪锋许庆阳巩志
Owner SHANDONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More