Industrial mechanical arm motion planning method based on reinforcement learning algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and motion planning, applied in the direction of manipulators, program control manipulators, claw arms, etc., can solve problems such as falling into local optimum, and achieve the effect of improving adaptability, good environmental adaptability and stability

Pending Publication Date: 2021-10-19

青岛博晟优控智能科技有限公司

View PDF0 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The value evaluation algorithm of reinforcement learning is based on the value iteration method, which is conducive to the convergence of the value function to the optimum but is not suitable for continuous motion processes. The policy evaluation algorithm based on parameter optimization is more suitable for high-dimensional and continuous action control, and has better convergence. attributes, but single-use policy evaluation is prone to fall into local optimum

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0067] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0068] Robotic arms are commonly used machinery and equipment in industrial production. The six-degree-of-freedom full-rotation joint manipulator is a relatively common manipulator structure in the actual production environment. This type of manipulator can basically meet the needs of general industrial production. Such as figure 1 It is a classic industrial mechanical watch structure. Six of the joints are rotary joints and the axes of the rear three axes intersect at one point. This type of structure has strong kinematic solvability. Moreover, this kind of mechanical arm structure can basically meet the positioning and grasping tasks in three-dimensional space in industrial production. Therefore, this kind of mechanical arm with classic structure has been widely used in industrial production.

[0069] Such as figure 2 As shown, reinforc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an industrial mechanical arm motion planning method based on a reinforcement learning algorithm, and belongs to the field of application of reinforcement learning to mechanical arm motion planning. The reinforcement learning Actor-Critic algorithm is applied to the motion planning of a mechanical arm, so that an interactive relationship is established between the mechanical arm and the environment, and the adaptive capacity of the mechanical arm to the environment is improved through real-time interaction with the environment for training, so that autonomous learning control is realized; and firstly, a simulation environment of a mechanical arm hand-eye system is built, then a reinforcement learning algorithm model is built according to the simulation environment, finally, motion planning training of the mechanical arm is completed, and intelligent control over the mechanical arm is achieved. The mechanical arm motion planning algorithm based on reinforcement learning has good environmental adaptability and stability.

Description

technical field [0001] The invention belongs to the field of reinforcement learning applied to motion planning of a mechanical arm, and in particular relates to a motion planning method for an industrial mechanical arm based on a reinforcement learning algorithm. Background technique [0002] Motion planning of manipulators to accomplish specific complex tasks in an uncertain environment has always been a very challenging problem. Traditional control methods often rely heavily on the system model. However, the model often has the characteristics of high-order, nonlinear, multi-variable, and strong coupling. It is difficult to make the manipulator system have good adaptability and certain autonomy. In recent years, artificial intelligence technology has developed vigorously, which provides new ideas for autonomous learning and control of robotic arms. The core idea of artificial intelligence is to introduce an online learning mechanism in the planning and control of the ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): B25J9/16B25J18/00

CPCB25J9/1664B25J18/00

Inventor 聂君李强卢晓盛春阳张治国宋诗斌梁笑张焕水王倩

Owner 青岛博晟优控智能科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Industrial mechanical arm motion planning method based on reinforcement learning algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology