Optimal path planning method for cleaning robot based on multi-step optimization of approximate model
A technology for optimal path planning and cleaning robots, applied to instruments, two-dimensional position/course control, vehicle position/route/altitude control, etc., can solve problems such as hindering strategy acquisition, accelerating strategy and algorithm convergence
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] The present invention will be further described below in conjunction with the examples, but not as a limitation of the present invention.
[0038] Please combine figure 1 As shown, the cleaning robot optimal path planning method based on the multi-step optimization of the approximate model involved in this embodiment includes the following steps:
[0039] Step 1), initialize the model, set the environmental state space X as the limit value of the horizontal and vertical coordinates of the two rooms, and the action in the action space U is a fixed value for the robot to move along the angle [-π,+π];
[0040] Step 2), initialize the hyperparameters, set the discount rate γ=0.9, the decay factor λ=0.9, the number of episodes=200, the exploration variance of the Gaussian function is 0.2, the maximum time step included in each episode is 200, and the learning of the value function The rate is 0.6, the learning rate of the strategy is 0.6, the learning rate of the model is 0...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


