Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning

A service robot and reinforcement learning technology, applied in the field of service robot pedestrian perception and obstacle avoidance based on deep reinforcement learning, can solve problems such as difficult convergence, difficult pedestrian obstacle avoidance mechanism modeling, slow convergence, etc., to achieve easy convergence and faster convergence Speed, increased intelligence and sociability effects

Active Publication Date: 2018-07-06
SHANGHAI JIAO TONG UNIV
View PDF10 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the above-mentioned defects of the prior art, the technical problem to be solved by the present invention is to overcome the problem existing in the prior art that i...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning
  • Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning
  • Pedestrian-sensing obstacle avoidance method for service robot based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The following describes several preferred embodiments of the present invention with reference to the accompanying drawings, so as to make the technical content clearer and easier to understand. The present invention can be embodied in many different forms of embodiments, and the protection scope of the present invention is not limited to the embodiments mentioned herein.

[0049] In the drawings, components with the same structure are denoted by the same numerals, and components with similar structures or functions are denoted by similar numerals. The size and thickness of each component shown in the drawings are shown arbitrarily, and the present invention does not limit the size and thickness of each component. In order to make the illustration clearer, the thickness of parts is appropriately exaggerated in some places in the drawings.

[0050] Such as figure 1 , figure 2 , image 3 , Figure 4 and Figure 5 As shown, the present invention proposes a service ro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a pedestrian-sensing obstacle avoidance method for a service robot based on deep reinforcement learning, and relates to the field of deep learning and service robot obstacle avoidance. In a training phase of the method, firstly, training data is generated by using an ORCA algorithm; then, an experimental scene is randomly generated, and an initialized reinforcement learningmodel is interacted with the environment to generate new training data to be merged into the original training data; finally, an SGD algorithm is used for training a network on the new training datato obtain a final network model. In an execution stage of the method, the state of the surrounding pedestrians is obtained by a laser radar, the predicted state is calculated according to the trainedmodel and the bonus function, and the action of obtaining the maximum reward is selected as the output and executed. The method has strong instantaneity and adaptability, in the pedestrian environment, the robot can comply with rules for pedestrians to walk on the right lines, an efficient, safe and natural path is planned, and the intelligence and sociality of the service robot are improved.

Description

technical field [0001] The invention relates to the field of deep learning and obstacle avoidance of service robots, in particular to a pedestrian-aware obstacle avoidance method for service robots based on deep reinforcement learning. Background technique [0002] With the increase in labor costs, robots have begun to replace human workers in various fields, especially in the field of public services, such as takeaway robots, express delivery robots, shopping guide robots, etc. The scenes faced by these robots generally have many highly dynamic obstacles, such as pedestrians. How to enable service robots to navigate autonomously in a pedestrian environment and avoid pedestrian obstacles efficiently, safely and naturally has become a key issue that limits the wider application of service robots. In the pedestrian environment, the adaptability of the traditional obstacle avoidance algorithm becomes poor, and sometimes it will show unsafe behaviors such as sudden stop and sha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/02G06N3/04
CPCG05D1/0231G05D2201/02G06N3/045
Inventor 赵忠华鲁兴龙曹一文晏懿琳
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products