Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning

A reinforcement learning, dense object technology, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve problems such as difficulty, general grasping effect of densely stacked objects, and complex modeling process

Active Publication Date: 2021-02-26
CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The characteristics of the above grasping scenarios have brought difficulties to the grasping work of the robotic arm; the actual modeling process of the model-based method is usually complicated, and it is difficult to apply to unstructured scenarios; Grasping has some effect, but it is not effective for densely stacked objects, and it cannot give priority to hazard conditions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning
  • Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning
  • Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0141] The present invention uses a deep reinforcement learning algorithm to enable the robotic arm to learn the optimal grasping strategy faster under training, and has the ability to preferentially grasp objects with higher temperatures; the present invention uses the UR5 robotic arm and the RG2 robotic arm as examples In detail, the RG2 manipulator is the end effector of the manipulator, which moves in the horizontal and vertical directions; the image information is captured by the RGB-D camera and the infrared thermal imager, and the image is rendered by OpenGl;

[0142] The task scenario designed in this embodiment is to use the robotic arm to grab 10 objects of random temperature, color, and shape. These objects are stacked irregularly and densely until the robotic arm grabs all the objects.

[0143] Such as figure 2 As shown, the temperature-first grasping method for dense objects with a robotic arm based on deep reinforcement learning described in this embodiment incl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning. The method comprises the following steps of S1, constructing a mechanical arm working space, and constructing a real-time state; S2, preprocessing state information; S3, carrying out forward transmission on the preprocessed information through a Q network to obtain a corresponding Q value; S4, instructing a mechanical arm to act according to the Q value and an epsilon-greedy strategy, and obtaining rewards through a reward function; S5, continuously updating the weight through a target Q network to realize training of the Q network; and S6, recording related data in the training process and a model finally formed after training to obtain an optimal grabbing strategy of the mechanical arm. The method has the advantages that the method is specifically applied to the grabbing scenes where objects have irregular shapes and are densely stacked and temperature factors need to be preferentially considered; the mechanical arm action is designed according to the deep reinforcement learning algorithm, so that the grabbing performance of the mechanical arm is improved; and an infrared image is introduced, so that the mechanical arm has the capacity of preferentially grabbing objects with higher temperature.

Description

technical field [0001] The present invention relates to a temperature-priority grabbing method for dense objects with a manipulator based on deep reinforcement learning. Deep reinforcement learning is applied to the grasping task of a manipulator, and pushing and grasping are put into a joint action within a reinforcement learning framework to promote Promote grasping and set temperature rewards, so that the robotic arm can better grasp dense objects and have the ability to preferentially grasp high-temperature objects. Background technique [0002] At present, the application and function of robotic arms are becoming more and more perfect; with the rapid development of robotic arm technology, robotic arms have been widely used in industrial tasks such as handling, palletizing, cutting, welding, etc., which not only liberate manpower, but also improve industrial production. The efficiency and quality of the robot arm; among them, the grasping task of the robotic arm is the b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16
CPCB25J9/163
Inventor 陈满李茂军李宜伟赖志强李俊日熊凯飞
Owner CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products