Robot optimal path planning method based on partially observable Markov decision process

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for optimal path planning and robotics, applied in instruments, two-dimensional position/channel control, vehicle position/route/altitude control, etc., which can solve problems such as poor algorithm performance and observation that does not consider the important impact of algorithm performance. , to achieve the effect of improving the efficiency of the algorithm

Active Publication Date: 2018-10-19

SUZHOU UNIV

View PDF3 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the trial-based search selects the optimal action and observation each time, without considering other observations that are very close to the optimal observation and have a significant impact on future algorithm performance

In large-scale observation space problems, the performance of the algorithm is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] Below in conjunction with principle of the present invention, accompanying drawing and embodiment the present invention is further described

[0031] see figure 1 As shown, the sweeping robot is in the living room on the right. Its task is to clean the bedroom on the left. According to the layout of the room, it needs to go around the dining table and pass through the door in the middle to enter the bedroom smoothly. Distance sensors are evenly installed on the robot’s head , each sensor can detect whether there is an obstacle within 1 unit length directly in front of it. There are 256 detection results of the sensor. The probability of each sensor receiving the correct detection result is 0.9, and the probability of receiving the wrong detection result is 0.1. The initial position of the sweeping robot in the room is random. Its goal is to reach the bedroom on the left as quickly as possible. The reward for the sweeping robot to reach the target position is .

[0032]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a robot optimal path planning method based on a partially observable Markov decision process. A robot searches an optimal path to a target position, a POMDP model and a SARSOPalgorithm are considered as a basis, and a GLS search method is utilized as a heuristic condition during searching. For a continuous state large-scale observation space problem, through usage of the robot optimal path planning method the number of times of belief upper and lower bound updating is reduced than that of early classical algorithms adopting test as a heuristic condition to repeat updating of multiple similar paths, and final optimal strategy is not affected, thereby improving the algorithm efficiency; and in the same time, the robot can get a better strategy and find a better path.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a robot optimal path planning method based on a partially perceptual Markov decision process. Background technique [0002] Machine Learning (ML) is a discipline that studies how to simulate or realize human learning behavior, and constantly reorganize and improve its original knowledge structure. Reinforcement learning is an important research branch of machine learning. It is a machine learning method that maps state to action through the interaction between agent and environment, so as to obtain the maximum long-term cumulative discount reward. Usually reinforcement learning uses Markov decision processes (Markov Decision Processes, MDPs) as a model, that is, the environment is completely observable. In the real world, however, uncertainty is ubiquitous. For example, the agent's sensor has its own limitations: (1) the sensor can only detect local limited environments, and the ag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G01C21/20G05D1/02

CPCG05D1/0219G05D1/0221G01C21/206G01C21/3446G05D1/0217

Inventor 刘全朱斐钱炜晟章宗长

Owner SUZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Robot optimal path planning method based on partially observable Markov decision process

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology