Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Optimal Path Planning Method for Robots Based on Partial Sensing Markov Decision Process

An optimal path planning and optimal path technology, which is applied in the directions of instruments, two-dimensional position/course control, vehicle position/route/altitude control, etc., can solve the problem of poor algorithm performance and the observation that the algorithm performance has an important impact and other problems to achieve the effect of improving the efficiency of the algorithm

Active Publication Date: 2020-09-08
SUZHOU UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the trial-based search selects the optimal action and observation each time, without considering other observations that are very close to the optimal observation and have a significant impact on future algorithm performance
In large-scale observation space problems, the performance of the algorithm is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimal Path Planning Method for Robots Based on Partial Sensing Markov Decision Process
  • Optimal Path Planning Method for Robots Based on Partial Sensing Markov Decision Process
  • Optimal Path Planning Method for Robots Based on Partial Sensing Markov Decision Process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Below in conjunction with principle of the present invention, accompanying drawing and embodiment the present invention is further described

[0028] see figure 1 As shown, the sweeping robot is in the living room on the right. Its task is to clean the bedroom on the left. According to the layout of the room, it needs to go around the dining table and pass through the door in the middle to enter the bedroom smoothly. Distance sensors are evenly installed on the robot’s head , each sensor can detect whether there is an obstacle within 1 unit length directly in front of it. There are 256 detection results of the sensor. The probability of each sensor receiving the correct detection result is 0.9, and the probability of receiving the wrong detection result is 0.1. The initial position of the sweeping robot in the room is random. Its goal is to reach the bedroom on the left as quickly as possible. The reward for the sweeping robot to reach the target position is +10.

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot optimal path planning method based on partly perceived Markov decision-making process. The robot searches for the optimal path to the target position, based on the POMDP model and the SARSOP algorithm, and uses the GLS search method as the heuristic condition when searching. In the large-scale observation space problem of continuous state, the use of the present invention can prevent the early classic algorithm from repeatedly updating multiple similar paths based on experiments as heuristic conditions, and the number of times to update the upper and lower bounds of the belief state without affecting the final optimal strategy. Improve the efficiency of the algorithm. In the same time, the robot can train a better strategy and find a better path.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a robot optimal path planning method based on a partially perceptual Markov decision process. Background technique [0002] Machine Learning (ML) is a discipline that studies how to simulate or realize human learning behavior, and constantly reorganize and improve its original knowledge structure. Reinforcement learning is an important research branch of machine learning. It is a machine learning method that maps state to action through the interaction between agent and environment, so as to obtain the maximum long-term cumulative discount reward. Usually reinforcement learning uses Markov Decision Processes (MDPs) as a model, that is, the environment is completely observable. In the real world, however, uncertainty is ubiquitous. For example, the agent's sensor has its own limitations: (1) the sensor can only detect local limited environments, and the agent cannot accurately disti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G01C21/20G05D1/02
CPCG05D1/0219G05D1/0221G01C21/206G01C21/3446G05D1/0217
Inventor 刘全朱斐钱炜晟章宗长
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products