Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A service robot and reinforcement learning technology, applied in the field of service robot pedestrian perception and obstacle avoidance based on deep reinforcement learning, can solve problems such as difficult convergence, difficult pedestrian obstacle avoidance mechanism modeling, slow convergence, etc., to achieve easy convergence and faster convergence Speed, increased intelligence and sociability effects

Active Publication Date: 2018-07-06

SHANGHAI JIAO TONG UNIV

View PDF10 Cites 48 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] In view of the above-mentioned defects of the prior art, the technical problem to be solved by the present invention is to overcome the problem existing in the prior art that it is difficult to model the obstacle avoidance mechanism of pedestrians, and to overcome the end-to-end The training method is usually difficult to converge, or the convergence is very slow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] The following describes several preferred embodiments of the present invention with reference to the accompanying drawings, so as to make the technical content clearer and easier to understand. The present invention can be embodied in many different forms of embodiments, and the protection scope of the present invention is not limited to the embodiments mentioned herein.

[0049] In the drawings, components with the same structure are denoted by the same numerals, and components with similar structures or functions are denoted by similar numerals. The size and thickness of each component shown in the drawings are shown arbitrarily, and the present invention does not limit the size and thickness of each component. In order to make the illustration clearer, the thickness of parts is appropriately exaggerated in some places in the drawings.

[0050] Such as figure 1 , figure 2 , image 3 , Figure 4 and Figure 5 As shown, the present invention proposes a service ro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a pedestrian-sensing obstacle avoidance method for a service robot based on deep reinforcement learning, and relates to the field of deep learning and service robot obstacle avoidance. In a training phase of the method, firstly, training data is generated by using an ORCA algorithm; then, an experimental scene is randomly generated, and an initialized reinforcement learningmodel is interacted with the environment to generate new training data to be merged into the original training data; finally, an SGD algorithm is used for training a network on the new training datato obtain a final network model. In an execution stage of the method, the state of the surrounding pedestrians is obtained by a laser radar, the predicted state is calculated according to the trainedmodel and the bonus function, and the action of obtaining the maximum reward is selected as the output and executed. The method has strong instantaneity and adaptability, in the pedestrian environment, the robot can comply with rules for pedestrians to walk on the right lines, an efficient, safe and natural path is planned, and the intelligence and sociality of the service robot are improved.

Description

technical field [0001] The invention relates to the field of deep learning and obstacle avoidance of service robots, in particular to a pedestrian-aware obstacle avoidance method for service robots based on deep reinforcement learning. Background technique [0002] With the increase in labor costs, robots have begun to replace human workers in various fields, especially in the field of public services, such as takeaway robots, express delivery robots, shopping guide robots, etc. The scenes faced by these robots generally have many highly dynamic obstacles, such as pedestrians. How to enable service robots to navigate autonomously in a pedestrian environment and avoid pedestrian obstacles efficiently, safely and naturally has become a key issue that limits the wider application of service robots. In the pedestrian environment, the adaptability of the traditional obstacle avoidance algorithm becomes poor, and sometimes it will show unsafe behaviors such as sudden stop and sha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05D1/02G06N3/04

CPCG05D1/0231G06N3/045

Inventor 赵忠华鲁兴龙曹一文晏懿琳

Owner SHANGHAI JIAO TONG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology