Single robot path planning method based on Q-Learning algorithm
A path planning and robotics technology, applied in computer parts, instruments, calculations, etc., to solve problems such as affecting the effect of path planning, insufficient exploration completeness, and missing optimal solutions.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0102] Such as figure 1 Shown: A single-robot path planning method based on the Q-Learning algorithm, including the following steps:
[0103] S1: Initialize the action set A, the state set S, the maximum number of iterations n, the maximum number of exploration steps m, the minimum number of paths MinPathNum, the maximum number of successful pathfinding MaxSuccessNum, the exploration factor ε, the single update step size of the exploration factor eSize, and the change of the exploration factor Period eCycle, maximum counting threshold h, start update time B(s, a), finish update time, action value function Q(s, a), state-action access times C(s, a), reward function storage U( s, a), the number of successful pathfinding SuccessNum, the number of successful paths PathNum, the PathList of the successful path, the successful path storage table List, the number of iterations i and the current time t.
[0104] S2: Determine whether the number of iterations i is greater than the maxi...
Embodiment 2
[0129] This embodiment discloses a simulation experiment of path planning for a single robot.
[0130] 1. Description of the simulation experiment
[0131] 1) During the simulation experiment, the software platform uses Windows 10 operating system, the CPU uses Inter Core I5-8400, and the size of the running memory is 16GB. The path planning algorithm of the single robot system will use Python language and TensorFlow deep learning tool to complete the simulation experiment, and the multi-robot path planning algorithm will be written on the matlab2016a simulation software using matlab language.
[0132] 2) This paper will use the grid method to describe the environment, divide the working space of the robot system into small grids one by one, and each small grid can represent a state of the robot system. In the map, the white grid indicates the safe area, and the black grid indicates the existence of obstacles.
[0133] The target state and obstacles in the environment are st...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


