Unlock instant, AI-driven research and patent intelligence for your innovation.

Grinding constant force control method based on deep reinforcement learning PPO algorithm

A technology of reinforcement learning and control methods, which is applied in the field of grinding machinery control, can solve problems such as difficult models, nonlinear grinding of robots, and difficulty in achieving satisfactory results with control methods, so as to achieve good robust performance and simplify the process of constant force control Effect

Pending Publication Date: 2022-06-24
LIAOCHENG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the traditional force control strategy can achieve a certain control effect, it is difficult to achieve satisfactory results due to nonlinearity and a large number of uncertainties in robot grinding.
The force control based on the intelligent strategy can realize the constant force control through the intelligent algorithm, but the traditional intelligent algorithm generally needs to establish a priori model in advance, such as the relationship model between the grinding force difference and the compensation displacement. Modeling is very difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Grinding constant force control method based on deep reinforcement learning PPO algorithm
  • Grinding constant force control method based on deep reinforcement learning PPO algorithm
  • Grinding constant force control method based on deep reinforcement learning PPO algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Below in conjunction with embodiment, the present invention is further described.

[0058] The PPO algorithm adopts the standard Actor-Critic framework, the actor uses the method based on the policy function, and its network outputs the corresponding action after receiving the system state. Critic uses a value function-based approach, where the critic evaluates the actions produced by the actor and makes recommendations to the actor. As training progresses, the critic improves the prediction accuracy of the reward, and the actor improves the control strategy based on the reviewer's suggestion. In order to specifically illustrate the algorithm, the drawing parameter table is shown in Table 1:

[0059] Table 1: PPO algorithm parameter description table

[0060]

[0061]

[0062] The essence of the reinforcement learning algorithm is to make the agent learn the optimal strategy and maximize the cumulative reward that can be obtained on a complete trajectory, that i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a grinding constant force control method based on a deep reinforcement learning PPO algorithm, and the method comprises the steps: firstly carrying out the stress analysis of an end effector during the grinding process of a spherical grinding wheel, and obtaining the real grinding force through the force processing; a relation model between the difference between the current grinding force and the preset grinding force and the normal compensation displacement of the end effector is trained through a deep reinforcement learning PPO algorithm, and training data normalization preprocessing, Euclidean distance-based reward function design, targeted deep neural network structure design and algorithm convergence promotion design are included; training a controller based on a PPO algorithm and using a grinding robot constant force controller. According to the method, a prior model between the grinding force error and the normal displacement does not need to be established in advance, the task that the grinding force is controlled within the target grinding force during plane grinding and curved surface grinding can be completed, and meanwhile the good self-adaptive capacity is achieved.

Description

technical field [0001] The invention relates to grinding machine control, specifically a grinding constant force control method based on a deep reinforcement learning PPO algorithm. Background technique [0002] The traditional manual grinding not only has uneven grinding quality and low efficiency, but also the flying dust produced by grinding has caused harm to the health of workers. The use of grinding robots can solve the problem very well. Installing a grinding end effector at the end of a multi-degree-of-freedom robot can machine workpieces with complex surface geometries, and can also avoid the problems caused by manual grinding. [0003] Robot grinding force control can be divided into passive force control and active force control. Passive power control mainly relies on some auxiliary compliance mechanisms, so that the robot can naturally comply with the grinding force when it is in contact with the abrasive belt wheel. Although this control method can effectively...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05B13/04
CPCG05B13/042
Inventor 王启超陈林林孙群贾浩磊
Owner LIAOCHENG UNIV