Industrial mechanical arm motion planning method based on reinforcement learning algorithm

A technology of reinforcement learning and motion planning, applied in the direction of manipulators, program control manipulators, claw arms, etc., can solve problems such as falling into local optimum, and achieve the effect of improving adaptability, good environmental adaptability and stability

Pending Publication Date: 2021-10-19
青岛博晟优控智能科技有限公司
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The value evaluation algorithm of reinforcement learning is based on the value iteration method, which is conducive to the convergence of the value function to the optimum but is not suitable for continuous motion processes. The policy evaluation algorithm based on parameter optimization is more suitable for high-dimensional and continuous action control, and has better convergence. attributes, but single-use policy evaluation is prone to fall into local optimum

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Industrial mechanical arm motion planning method based on reinforcement learning algorithm
  • Industrial mechanical arm motion planning method based on reinforcement learning algorithm
  • Industrial mechanical arm motion planning method based on reinforcement learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0068] Robotic arms are commonly used machinery and equipment in industrial production. The six-degree-of-freedom full-rotation joint manipulator is a relatively common manipulator structure in the actual production environment. This type of manipulator can basically meet the needs of general industrial production. Such as figure 1 It is a classic industrial mechanical watch structure. Six of the joints are rotary joints and the axes of the rear three axes intersect at one point. This type of structure has strong kinematic solvability. Moreover, this kind of mechanical arm structure can basically meet the positioning and grasping tasks in three-dimensional space in industrial production. Therefore, this kind of mechanical arm with classic structure has been widely used in industrial production.

[0069] Such as figure 2 As shown, reinforc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an industrial mechanical arm motion planning method based on a reinforcement learning algorithm, and belongs to the field of application of reinforcement learning to mechanical arm motion planning. The reinforcement learning Actor-Critic algorithm is applied to the motion planning of a mechanical arm, so that an interactive relationship is established between the mechanical arm and the environment, and the adaptive capacity of the mechanical arm to the environment is improved through real-time interaction with the environment for training, so that autonomous learning control is realized; and firstly, a simulation environment of a mechanical arm hand-eye system is built, then a reinforcement learning algorithm model is built according to the simulation environment, finally, motion planning training of the mechanical arm is completed, and intelligent control over the mechanical arm is achieved. The mechanical arm motion planning algorithm based on reinforcement learning has good environmental adaptability and stability.

Description

technical field [0001] The invention belongs to the field of reinforcement learning applied to motion planning of a mechanical arm, and in particular relates to a motion planning method for an industrial mechanical arm based on a reinforcement learning algorithm. Background technique [0002] Motion planning of manipulators to accomplish specific complex tasks in an uncertain environment has always been a very challenging problem. Traditional control methods often rely heavily on the system model. However, the model often has the characteristics of high-order, nonlinear, multi-variable, and strong coupling. It is difficult to make the manipulator system have good adaptability and certain autonomy. In recent years, artificial intelligence technology has developed vigorously, which provides new ideas for autonomous learning and control of robotic arms. The core idea of ​​artificial intelligence is to introduce an online learning mechanism in the planning and control of the ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16B25J18/00
CPCB25J9/1664B25J18/00
Inventor 聂君李强卢晓盛春阳张治国宋诗斌梁笑张焕水王倩
Owner 青岛博晟优控智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products