Grinding constant force control method based on deep reinforcement learning PPO algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and control methods, which is applied in the field of grinding machinery control, can solve problems such as difficult models, nonlinear grinding of robots, and difficulty in achieving satisfactory results with control methods, so as to achieve good robust performance and simplify the process of constant force control Effect

Pending Publication Date: 2022-06-24

LIAOCHENG UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although the traditional force control strategy can achieve a certain control effect, it is difficult to achieve satisfactory results due to nonlinearity and a large number of uncertainties in robot grinding.

The force control based on the intelligent strategy can realize the constant force control through the intelligent algorithm, but the traditional intelligent algorithm generally needs to establish a priori model in advance, such as the relationship model between the grinding force difference and the compensation displacement. Modeling is very difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0057] Below in conjunction with embodiment, the present invention is further described.

[0058] The PPO algorithm adopts the standard Actor-Critic framework, the actor uses the method based on the policy function, and its network outputs the corresponding action after receiving the system state. Critic uses a value function-based approach, where the critic evaluates the actions produced by the actor and makes recommendations to the actor. As training progresses, the critic improves the prediction accuracy of the reward, and the actor improves the control strategy based on the reviewer's suggestion. In order to specifically illustrate the algorithm, the drawing parameter table is shown in Table 1:

[0059] Table 1: PPO algorithm parameter description table

[0060]

[0061]

[0062] The essence of the reinforcement learning algorithm is to make the agent learn the optimal strategy and maximize the cumulative reward that can be obtained on a complete trajectory, that i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a grinding constant force control method based on a deep reinforcement learning PPO algorithm, and the method comprises the steps: firstly carrying out the stress analysis of an end effector during the grinding process of a spherical grinding wheel, and obtaining the real grinding force through the force processing; a relation model between the difference between the current grinding force and the preset grinding force and the normal compensation displacement of the end effector is trained through a deep reinforcement learning PPO algorithm, and training data normalization preprocessing, Euclidean distance-based reward function design, targeted deep neural network structure design and algorithm convergence promotion design are included; training a controller based on a PPO algorithm and using a grinding robot constant force controller. According to the method, a prior model between the grinding force error and the normal displacement does not need to be established in advance, the task that the grinding force is controlled within the target grinding force during plane grinding and curved surface grinding can be completed, and meanwhile the good self-adaptive capacity is achieved.

Description

technical field [0001] The invention relates to grinding machine control, specifically a grinding constant force control method based on a deep reinforcement learning PPO algorithm. Background technique [0002] The traditional manual grinding not only has uneven grinding quality and low efficiency, but also the flying dust produced by grinding has caused harm to the health of workers. The use of grinding robots can solve the problem very well. Installing a grinding end effector at the end of a multi-degree-of-freedom robot can machine workpieces with complex surface geometries, and can also avoid the problems caused by manual grinding. [0003] Robot grinding force control can be divided into passive force control and active force control. Passive power control mainly relies on some auxiliary compliance mechanisms, so that the robot can naturally comply with the grinding force when it is in contact with the abrasive belt wheel. Although this control method can effectively...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05B13/04

CPCG05B13/042

Inventor 王启超陈林林孙群贾浩磊

Owner LIAOCHENG UNIV

Grinding constant force control method based on deep reinforcement learning PPO algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology