Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mobile robot collision avoidance planning method based on deep reinforcement learning in static environment

A mobile robot, static environment technology, applied in the field of mobile robot navigation, can solve the problems of unsuitable dynamic unknown environment, large limitations, etc., to achieve rich exploration strategies, smooth obstacle avoidance trajectory, and high portability.

Active Publication Date: 2020-02-04
HARBIN ENG UNIV
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally speaking, traditional obstacle avoidance methods have relatively large limitations, especially for complex and dynamic unknown environments; while intelligent obstacle avoidance algorithms, especially the popular obstacle avoidance algorithms combined with deep learning and reinforcement learning in recent years, are suitable for continuous High-dimensional complex dynamic unknown environment has great advantages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mobile robot collision avoidance planning method based on deep reinforcement learning in static environment
  • Mobile robot collision avoidance planning method based on deep reinforcement learning in static environment
  • Mobile robot collision avoidance planning method based on deep reinforcement learning in static environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with accompanying drawings and cases.

[0035] The invention discloses a mobile robot collision avoidance planning method based on deep reinforcement learning in a static environment, which can be used for effective obstacle avoidance when a mobile robot works in a complex static obstacle environment including a loop-shaped obstacle environment. The present invention performs corresponding data processing on the basis of the original data collected by the laser range finder, takes the processed data as the state S of the A3C algorithm, constructs the corresponding A3C-LSTM neural network, takes the state S as the network input, and passes the A3C Algorithm, the neural network outputs the corresponding parameters, and uses the relevant parameters to select the actions performed by the mobile robot at each step through the normal distribution. The overall obstacle avoidance algorithm flow chart is as foll...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of mobile robot navigation, and particularly relates to a mobile robot collision avoidance planning method based on deep reinforcement learning in a staticenvironment. The method comprises the following steps of: acquiring original data by using a laser range finder, taking the processed data as a state S of an A3C algorithm, constructing an A3C-LSTM neural network, taking the state S as network input, outputting corresponding parameters by the neural network through the A3C algorithm, and selecting actions executed by a mobile robot in each step through normal distribution by utilizing the parameters. According to the method, the environment does not need to be modeled, and the mobile robot successfully avoids obstacles in a complex static obstacle environment through a deep reinforcement learning algorithm. According to the method, a continuous action space model with bow turning constraint is designed, multi-thread asynchronous learningis adopted, and compared with a common deep reinforcement learning method, the learning training time is greatly prolonged, the sample correlation is reduced, the high utilization of the exploration space and the diversity of the exploration strategy are guaranteed, and the algorithm convergence, stability and obstacle avoidance success rate are improved.

Description

technical field [0001] The invention belongs to the technical field of mobile robot navigation, and in particular relates to a mobile robot collision avoidance planning method based on deep reinforcement learning in a static environment. Background technique [0002] The application of mobile robots has penetrated into all areas of life. In industrial production lines, robots can replace humans in heavy and repetitive tasks, improving production efficiency and liberating human labor. [0003] Obstacle avoidance is a key problem in robotics research in the process of multi-robots performing tasks. It is a dynamic, real-time, uncertain typical environment model. Furthermore, it requires the robot to quickly and independently adapt to changes in the environment. For obstacle avoidance problems, behavior-based response control methods and rule-based control strategies are often used. Both methods are simple, highly deterministic, and the robot responds better to the environme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02G06N3/04G06N3/08
CPCG05D1/0221G05D1/0214G06N3/08G06N3/044G06N3/045
Inventor 王宏健何姗姗张宏瀚袁建亚于丹贺巨义
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products