Path Planning Method for Manipulator Based on Velocity Smooth Deterministic Policy Gradient

A deterministic, robotic arm technology, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve problems such as excessive acceleration, manipulator jitter, manipulator arm damage, etc., to reduce joint acceleration, improve training speed, and improve space. The effect of search efficiency

Active Publication Date: 2022-03-22
NANJING UNIV OF SCI & TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since there is no smoothing process, the output motion vector will be quite different at the front and rear moments, resulting in excessive acceleration, which will cause the mechanical arm to vibrate and cause damage to the mechanical arm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Path Planning Method for Manipulator Based on Velocity Smooth Deterministic Policy Gradient
  • Path Planning Method for Manipulator Based on Velocity Smooth Deterministic Policy Gradient
  • Path Planning Method for Manipulator Based on Velocity Smooth Deterministic Policy Gradient

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The solutions of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0025] In the present invention, the path planning method of the manipulator based on the velocity smooth deterministic strategy gradient takes the end of the manipulator to reach the specified pose as the task, and divides the path planning into a training phase and a testing phase. The training phase flow is as follows: figure 1 As shown in (a), it mainly includes the following steps:

[0026] Step 1. Construct a simulation environment of the manipulator with job task feedback. The specific steps are as follows:

[0027] Use virtual simulation technology to build a simulation environment for manipulators with job task feedback, such as figure 2 shown. The simulation environment has input and output interfaces. The input interface can input the initial state of the manipulator (the initial angle of each joint of the manipul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a path planning method of a manipulator based on a smooth deterministic strategy gradient of speed, and constructs a manipulator simulation environment with job task feedback in the training stage; introduces the previous step manipulator action vector into the deterministic strategy gradient network input, Construct a reinforcement learning network framework based on velocity smooth deterministic policy gradient; initialize network training parameters and manipulator simulation environment; obtain samples based on velocity smooth deterministic policy gradient network and simulation environment, and build a training sample library. If the number of training samples reaches the maximum number of samples number, the training samples are drawn from the training sample library according to the number of single training samples, and the training speed smoothes the deterministic policy gradient network; otherwise, proceed to the next step or the next simulation. On the basis of the deterministic strategy gradient network, the present invention adds the velocity vector of the previous step as the network input, effectively reduces the joint acceleration, and reduces the vibration of the mechanical arm.

Description

technical field [0001] The invention relates to a path planning technology of a manipulator, in particular to a path planning method of a manipulator based on velocity smoothing deterministic policy gradient. Background technique [0002] With the continuous development of robot technology, robotic arms have gradually replaced humans in special industries such as construction, palletizing, medical treatment, and live work. The traditional method of teaching and controlling the manipulator cannot meet the requirements of complex application scenarios and application tasks because it relies on manually given paths. Therefore, the autonomous path planning technology of the manipulator is a key technology for the development of intelligent robots. [0003] At present, the autonomous path planning method of the manipulator based on sampling path planning and multi-objective optimization can adapt to different operating environments, but it cannot be planned online, and the plann...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): B25J9/16
CPCB25J9/1664
Inventor 吴巍郭毓郭健肖潇蔡梁吴益飞吴钧浩郭飞张冕
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products