Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning, dense object technology, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve problems such as difficulty, general grasping effect of densely stacked objects, and complex modeling process

Active Publication Date: 2021-02-26

CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY

View PDF7 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The characteristics of the above grasping scenarios have brought difficulties to the grasping work of the robotic arm; the actual modeling process of the model-based method is usually complicated, and it is difficult to apply to unstructured scenarios; Grasping has some effect, but it is not effective for densely stacked objects, and it cannot give priority to hazard conditions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0141] The present invention uses a deep reinforcement learning algorithm to enable the robotic arm to learn the optimal grasping strategy faster under training, and has the ability to preferentially grasp objects with higher temperatures; the present invention uses the UR5 robotic arm and the RG2 robotic arm as examples In detail, the RG2 manipulator is the end effector of the manipulator, which moves in the horizontal and vertical directions; the image information is captured by the RGB-D camera and the infrared thermal imager, and the image is rendered by OpenGl;

[0142] The task scenario designed in this embodiment is to use the robotic arm to grab 10 objects of random temperature, color, and shape. These objects are stacked irregularly and densely until the robotic arm grabs all the objects.

[0143] Such as figure 2 As shown, the temperature-first grasping method for dense objects with a robotic arm based on deep reinforcement learning described in this embodiment incl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning. The method comprises the following steps of S1, constructing a mechanical arm working space, and constructing a real-time state; S2, preprocessing state information; S3, carrying out forward transmission on the preprocessed information through a Q network to obtain a corresponding Q value; S4, instructing a mechanical arm to act according to the Q value and an epsilon-greedy strategy, and obtaining rewards through a reward function; S5, continuously updating the weight through a target Q network to realize training of the Q network; and S6, recording related data in the training process and a model finally formed after training to obtain an optimal grabbing strategy of the mechanical arm. The method has the advantages that the method is specifically applied to the grabbing scenes where objects have irregular shapes and are densely stacked and temperature factors need to be preferentially considered; the mechanical arm action is designed according to the deep reinforcement learning algorithm, so that the grabbing performance of the mechanical arm is improved; and an infrared image is introduced, so that the mechanical arm has the capacity of preferentially grabbing objects with higher temperature.

Description

technical field [0001] The present invention relates to a temperature-priority grabbing method for dense objects with a manipulator based on deep reinforcement learning. Deep reinforcement learning is applied to the grasping task of a manipulator, and pushing and grasping are put into a joint action within a reinforcement learning framework to promote Promote grasping and set temperature rewards, so that the robotic arm can better grasp dense objects and have the ability to preferentially grasp high-temperature objects. Background technique [0002] At present, the application and function of robotic arms are becoming more and more perfect; with the rapid development of robotic arm technology, robotic arms have been widely used in industrial tasks such as handling, palletizing, cutting, welding, etc., which not only liberate manpower, but also improve industrial production. The efficiency and quality of the robot arm; among them, the grasping task of the robotic arm is the b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): B25J9/16

CPCB25J9/163

Inventor 陈满李茂军李宜伟赖志强李俊日熊凯飞

Owner CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Mechanical arm dense object temperature priority grabbing method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology