Mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning

A technology of reinforcement learning and manipulators, applied in the direction of manipulators, program-controlled manipulators, manufacturing tools, etc., can solve problems such as instability, achieve a high grasping success rate, solve time-consuming calculations, and overcome cumbersome calculations

Active Publication Date: 2021-12-07
ZHEJIANG UNIV OF TECH
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to potential failures such as sensor storage and transmission, the 3D point cloud depth data input to the network is not stable compared with traditional 2D RGB image data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning
  • Mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning
  • Mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Further illustrate the present invention below in conjunction with accompanying drawing.

[0044] The real-time grasping method of the mechanical arm based on YOLOv5 pruning network and reinforcement learning of the present invention, the specific process is as follows:

[0045] Step 1: Use the binocular camera to collect the image information of the objects on the grasping platform: First, fix the Intel D415 depth camera vertically at the end of the robotic arm so that it can capture the complete image information of the objects on the grasping platform.

[0046] Step 2, use the YOLOv5 pruning network model to perform target detection training on the image;

[0047] Step 2.1: Since theoretically, the deeper the network, the better its performance. However, the experiment shows that the activation HAN function needs to be derived during the backpropagation process. If the derivative is greater than 1, then as the number of network layers increases, the gradient update ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning. The mechanical arm six-degree-of-freedom real-time grabbing method based on deep reinforcement learning comprises the following steps of 1, acquiring image information of an object on a grabbing operation table through a binocular camera; 2, performing target detection training on the image by using a YOLOv5 pruning network model; 3, establishing a reinforcement learning network model; 4, completing mechanical arm grabbing movement through forward and inverse kinematics of a robot; and 5, carrying out reinforcement learning model training, so that a mechanical arm completes the grabbing action. The invention overcomes the defects of the prior art, provides a real-time object detection system which is easy to implement and high in applicability and is based on the YOLOv5 pruning network and the Policy Gradient reinforcement learning method, and the system can realize rapid and real-time target detection and complete the grabbing action while ensuring high precision.

Description

technical field [0001] The invention belongs to a real-time grasping object technology of a manipulator based on deep reinforcement learning, and specifically relates to YOLOv5 pruning network, Kinevt forward and reverse kinematics, CoppeliaSim Edu simulation software and Policy Gradient reinforcement learning strategy. Background technique [0002] Grasping is a fundamental and important problem in robotics. Despite its importance, the solutions to this problem have been unsatisfactory. However, with the rapid development of deep learning and reinforcement learning in recent years, it has provided many feasible ideas for the intelligent grasping method of the manipulator. Real-time object detection technology is a research hotspot in the field of computer vision in recent years. This technology includes the design of lightweight object detection networks, the production of target data sets, and the research on model deployment carriers. Among them, one of the most obvious ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16
CPCB25J9/16B25J9/1697B25J9/161B25J9/1628
Inventor 禹鑫燚徐靖黄睿邹超欧林林陈磊
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products