Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Robot optimal path planning method based on partially observable Markov decision process

A technology for optimal path planning and robotics, applied in instruments, two-dimensional position/channel control, vehicle position/route/altitude control, etc., which can solve problems such as poor algorithm performance and observation that does not consider the important impact of algorithm performance. , to achieve the effect of improving the efficiency of the algorithm

Active Publication Date: 2018-10-19
SUZHOU UNIV
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the trial-based search selects the optimal action and observation each time, without considering other observations that are very close to the optimal observation and have a significant impact on future algorithm performance
In large-scale observation space problems, the performance of the algorithm is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot optimal path planning method based on partially observable Markov decision process
  • Robot optimal path planning method based on partially observable Markov decision process
  • Robot optimal path planning method based on partially observable Markov decision process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Below in conjunction with principle of the present invention, accompanying drawing and embodiment the present invention is further described

[0031] see figure 1 As shown, the sweeping robot is in the living room on the right. Its task is to clean the bedroom on the left. According to the layout of the room, it needs to go around the dining table and pass through the door in the middle to enter the bedroom smoothly. Distance sensors are evenly installed on the robot’s head , each sensor can detect whether there is an obstacle within 1 unit length directly in front of it. There are 256 detection results of the sensor. The probability of each sensor receiving the correct detection result is 0.9, and the probability of receiving the wrong detection result is 0.1. The initial position of the sweeping robot in the room is random. Its goal is to reach the bedroom on the left as quickly as possible. The reward for the sweeping robot to reach the target position is .

[0032]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot optimal path planning method based on a partially observable Markov decision process. A robot searches an optimal path to a target position, a POMDP model and a SARSOPalgorithm are considered as a basis, and a GLS search method is utilized as a heuristic condition during searching. For a continuous state large-scale observation space problem, through usage of the robot optimal path planning method the number of times of belief upper and lower bound updating is reduced than that of early classical algorithms adopting test as a heuristic condition to repeat updating of multiple similar paths, and final optimal strategy is not affected, thereby improving the algorithm efficiency; and in the same time, the robot can get a better strategy and find a better path.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a robot optimal path planning method based on a partially perceptual Markov decision process. Background technique [0002] Machine Learning (ML) is a discipline that studies how to simulate or realize human learning behavior, and constantly reorganize and improve its original knowledge structure. Reinforcement learning is an important research branch of machine learning. It is a machine learning method that maps state to action through the interaction between agent and environment, so as to obtain the maximum long-term cumulative discount reward. Usually reinforcement learning uses Markov decision processes (Markov Decision Processes, MDPs) as a model, that is, the environment is completely observable. In the real world, however, uncertainty is ubiquitous. For example, the agent's sensor has its own limitations: (1) the sensor can only detect local limited environments, and the ag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G01C21/20G05D1/02
CPCG05D1/0219G05D1/0221G01C21/206G01C21/3446G05D1/0217
Inventor 刘全朱斐钱炜晟章宗长
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products