Grinding constant force control method based on deep reinforcement learning PPO algorithm
A technology of reinforcement learning and control methods, which is applied in the field of grinding machinery control, can solve problems such as difficult models, nonlinear grinding of robots, and difficulty in achieving satisfactory results with control methods, so as to achieve good robust performance and simplify the process of constant force control Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0057] Below in conjunction with embodiment, the present invention is further described.
[0058] The PPO algorithm adopts the standard Actor-Critic framework, the actor uses the method based on the policy function, and its network outputs the corresponding action after receiving the system state. Critic uses a value function-based approach, where the critic evaluates the actions produced by the actor and makes recommendations to the actor. As training progresses, the critic improves the prediction accuracy of the reward, and the actor improves the control strategy based on the reviewer's suggestion. In order to specifically illustrate the algorithm, the drawing parameter table is shown in Table 1:
[0059] Table 1: PPO algorithm parameter description table
[0060]
[0061]
[0062] The essence of the reinforcement learning algorithm is to make the agent learn the optimal strategy and maximize the cumulative reward that can be obtained on a complete trajectory, that i...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


