Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Industrial robot-oriented reinforcement learning reward value calculation method

An industrial robot and reinforcement learning technology, which is applied in the field of reinforcement learning reward value calculation for industrial robots, can solve problems such as high state and behavior dimensions of industrial robots, and achieve the effect of speeding up the exploration process

Active Publication Date: 2022-08-05
GUANGDONG POLYTECHNIC NORMAL UNIV
View PDF21 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The biggest problem with this manual planning method is that it is impossible to automatically adjust the movement of the robot according to the change of the job task.
[0004] However, due to the high dimensionality of the state and behavior of industrial robots, and the collision between the robot and obstacles needs to be considered in the motion trajectory planning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Industrial robot-oriented reinforcement learning reward value calculation method
  • Industrial robot-oriented reinforcement learning reward value calculation method
  • Industrial robot-oriented reinforcement learning reward value calculation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] Embodiment 1: The present invention provides a reinforcement learning reward value calculation method for industrial robots, including the following steps:

[0053] S1: Initial calculation of industrial robot state parameters:

[0054] Specifically include steps:

[0055] (1) Obtain the 3D point cloud distribution of obstacles in the working environment of industrial robots {p 1 ,p 2 ,…,p n }, and the target pose matrix of the industrial robot (In the matrix, R is a 3×3 rotation matrix representing the attitude, and P is a 3×1 translation matrix representing the position);

[0056] (2) According to the D-H parameters of the industrial robot, calculate the pose matrix of the robot end effector at the initial moment: The pose matrix at the previous moment: and the pose matrix at the current moment

[0057] (3) Calculate the shortest distance from the initial moment of the end effector of the industrial robot to the target according to the translation matrix P ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an industrial robot-oriented reinforcement learning reward value calculation method. The method comprises the following steps of S1, carrying out initialization calculation on state parameters of an industrial robot; s2, calculating a pose reward value of an end execution mechanism of the industrial robot; s3, calculating a collision reward value of the industrial robot; s4, calculating an exploration reward value of the industrial robot; s5, calculating a target reward value; according to the method, the target nearby area and the non-target nearby area are divided, so that the tail end of the industrial robot can be quickly close to the target position in the early stage and can be adjusted to be in a proper posture while being close to the target position in the later stage, and the exploration process of the industrial robot is accelerated; various state information (positions, postures, collision and the like) of the industrial robot are comprehensively considered, and the problem that postures of a motion track finally planned by the industrial robot cannot meet actual production requirements is solved.

Description

technical field [0001] The invention belongs to the technical field of industrial robots, and in particular relates to a reinforcement learning reward value calculation method for industrial robots. Background technique [0002] Industrial robots are multi-joint manipulators or multi-degree-of-freedom machine devices for industrial fields. They have been widely used in important industries such as automobile manufacturing, electrical industry, and metal products. However, before the industrial robot is put into actual production, it is necessary to plan the motion trajectory planning of the industrial robot through manual teaching programming or manual offline programming. The biggest problem with this manual planning method is that it cannot automatically adjust the motion of the robot according to the change of the task. [0003] The emergence of reinforcement learning provides a new method for intelligent motion trajectory planning of industrial robots. Through the "exp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16
CPCB25J9/1664B25J9/163Y02P90/02
Inventor 徐金雄班勃岑健熊建斌
Owner GUANGDONG POLYTECHNIC NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products