Mechanical arm motion planning method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and motion planning, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve the problems of large nonlinear function approximators, difficult to deal with high-dimensional continuous action space, instability and other problems

Pending Publication Date: 2020-09-11

NANJING UNIV +1

View PDF5 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Before DQN, it was widely believed that learning value functions using large nonlinear function approximators was difficult and unstable

The algorithm combines deep learning with reinforcement learning, and can use a function-like approximator to learn the value function in a stable manner. In order to minimize the correlation between samples, the network uses samples in the buffer for offline training, but it is difficult to cope with High-dimensional continuous action space

The Actor-Critic method can handle continuous action space problems, but the use of random strategies makes it difficult for the network to converge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0037] This embodiment provides a method for motion planning of a robotic arm based on deep reinforcement learning, taking a 6-DOF robotic arm as an example for illustration, specifically including the following steps:

[0038] Step 1. The image acquisition device acquires an environment image before the movement of the manipulator. The environment image includes the manipulator in the initial state, moving target points and intermediate obstacles to obtain the initial planning space. The simplified schematic diagram is shown in 1.

[0039] Step 2, according to the collected environmental image, use the target segmentation algorithm to separate the forbidden area ( figure 1 Middle gray area), the working area is the movement space and target position of the end of the manipulator except the forbidden area in the planning space, and the initial...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a mechanical arm motion planning method based on deep reinforcement learning. The method comprises the following steps of 1, acquiring an environment image once before a mechanical arm moves, wherein the environment image comprises the mechanical arm in an initial state, a moving target point and an intermediate obstacle; 2, according to the acquired environment image, utilizing the target partitioning algorithm to separate a prohibited area, a working area and a target position from one another, and reconstructing a planning space; 3, dividing the reconstructed planning space into three-dimensional grid spaces, and establishing binarized grid spaces; 4, solving the corresponding analytical solution of each joint of the mechanical arm under known terminal coordinates by utilizing the robot inverse kinematics, and determining the relative positional relationships between the mechanical arm and the planned space boundary, the prohibited area boundary and the moving target under the global coordinate system; and 5, planning the motion strategy for the mechanical arm and acquiring the optimal motion strategy, so that the mechanical arm moves to the target position at the minimum cost under the premise of avoiding the obstacle.

Description

technical field [0001] The invention relates to a dynamic intelligent planning method based on reinforcement learning, in particular to a motion planning method of a 6-axis cooperative mechanical arm. Background technique [0002] The traditional trajectory planning of the manipulator mostly adopts the method of trajectory interpolation, combined with high-order polynomials to smooth the position, speed and acceleration of the manipulator. In this method, the motion scheme of the manipulator is relatively fixed, and it is impossible to actively avoid the environment. of obstacles. As the performance requirements of the robotic arm in the industrial field are getting higher and higher, and the robotic arm is required to complete more interactive or dynamic complex space tasks, the traditional method is no longer applicable. [0003] In recent years, reinforcement learning methods have been increasingly used in robot control tasks. Traditional reinforcement learning methods s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/1666B25J9/163B25J9/1697

Inventor 辛博傅汇乔陈春林程旭马晶

Owner NANJING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Mechanical arm motion planning method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology