Unlock instant, AI-driven research and patent intelligence for your innovation.

Optimal path planning method for cleaning robot based on multi-step optimization of approximate model

A technology for optimal path planning and cleaning robots, applied to instruments, two-dimensional position/course control, vehicle position/route/altitude control, etc., can solve problems such as hindering strategy acquisition, accelerating strategy and algorithm convergence

Active Publication Date: 2019-11-08
CHANGSHU INSTITUTE OF TECHNOLOGY
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the learned model is accurate enough, it can accelerate the convergence of strategies and algorithms. When the learned model is not accurate enough, using the model for planning will hinder the acquisition of the optimal solution of the strategy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimal path planning method for cleaning robot based on multi-step optimization of approximate model
  • Optimal path planning method for cleaning robot based on multi-step optimization of approximate model
  • Optimal path planning method for cleaning robot based on multi-step optimization of approximate model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the examples, but not as a limitation of the present invention.

[0038] Please combine figure 1 As shown, the cleaning robot optimal path planning method based on the multi-step optimization of the approximate model involved in this embodiment includes the following steps:

[0039] Step 1), initialize the model, set the environmental state space X as the limit value of the horizontal and vertical coordinates of the two rooms, and the action in the action space U is a fixed value for the robot to move along the angle [-π,+π];

[0040] Step 2), initialize the hyperparameters, set the discount rate γ=0.9, the decay factor λ=0.9, the number of episodes=200, the exploration variance of the Gaussian function is 0.2, the maximum time step included in each episode is 200, and the learning of the value function The rate is 0.6, the learning rate of the strategy is 0.6, the learning rate of the model is 0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cleaning robot optimal path planning method based on approximate model multi-step optimization. The method comprises steps: a model, hyperparameters and an environment are initialized; an exploration strategy is selected; the model is updated with the current sample; a value function, a strategy and the current state are updated; whether the current plot is ended is judged and further, the current sampling trajectory is used to update a trajectory pool; a reconstructed sampling trajectory is used to update the trajectory pool; all trajectories in the trajectory pool are used to update the model; the model is adopted for planning; whether to reach the maximum plot number is judged; and finally, according to a learnt optimal strategy, the optimal cleaning robot planning path is acquired. Through adopting the sampling trajectory and the single sample to update the model, the model learning precision is improved; the model is then used for planning, the learning speeds of the value function, the strategy and the whole algorithm are improved; and the sample utilization efficiency is also improved, and thus, the optimal cleaning robot planning path is acquired by adopting the fewer samples in the shorter time.

Description

technical field [0001] The invention relates to a path planning method for a cleaning robot, in particular to an optimal path planning method for a cleaning robot based on multi-step optimization of an approximate model. Background technique [0002] The problem of autonomous path planning for cleaning robots is a common problem in the field of control. This problem can model all possible states of the robot as a state space, model all actions that can occur as an action space, and model the next possible state reached after the current state takes an action as a transition function, and will reach the next The immediate reward obtained by a state is modeled as a reward function, which converts the problem into a Markov decision process. The conventional way to solve this problem is to use discrete reinforcement learning methods, such as Q-learning and SARSA algorithm to solve it, such as directly discretizing the state space and action space, that is, dividing the state sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02
CPCG05D1/0221
Inventor 钟珊龚声蓉董瑞志姚宇峰马帅
Owner CHANGSHU INSTITUTE OF TECHNOLOGY