Path planning method for single robot based on q-learning algorithm
A technology of path planning and robotics, applied in computer parts, instruments, computing, etc., can solve problems such as missing the optimal solution, affecting the effect of path planning, and insufficient exploration completeness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0094] Such as figure 1 Shown: A single-robot path planning method based on the Q-Learning algorithm, including the following steps:
[0095] S1: Initialize the action set A, the state set S, the maximum number of iterations n, the maximum number of exploration steps m, the minimum number of paths MinPathNum, the maximum number of successful pathfinding MaxSuccessNum, the exploration factor ε, the single update step size of the exploration factor eSize, and the change of the exploration factor Period eCycle, maximum counting threshold h, start update time B(s, a), finish update time, action value function Q(s, a), state-action access times C(s, a), reward function storage U( s, a), the number of times of successful pathfinding SuccessNum, the number of successful paths PathNum, the PathList of successful paths, the storage table List of successful paths, the number of iterations i, the current time t and the target state parameters.
[0096] S2: Determine whether the number o...
Embodiment 2
[0121] This embodiment discloses a simulation experiment of path planning for a single robot.
[0122] 1. Description of the simulation experiment
[0123] 1) During the simulation experiment, the software platform uses Windows 10 operating system, the CPU uses Inter Core I5-8400, and the size of the running memory is 16GB. The path planning algorithm of the single robot system will use Python language and TensorFlow deep learning tool to complete the simulation experiment, and the multi-robot path planning algorithm will be written on the matlab2016a simulation software using matlab language.
[0124] 2) This paper will use the grid method to describe the environment, divide the working space of the robot system into small grids one by one, and each small grid can represent a state of the robot system. In the map, the white grid indicates the safe area, and the black grid indicates the existence of obstacles.
[0125] The target state and obstacles in the environment are st...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


