Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Collision avoidance planning method for mobile robots based on deep reinforcement learning in static environment

A mobile robot and static environment technology, applied in the field of mobile robot navigation, can solve the problems that the dynamic unknown environment is not suitable and has large limitations, and achieve the effects of rich exploration strategies, smooth obstacle avoidance trajectory, and good environmental adaptability

Active Publication Date: 2022-08-02
HARBIN ENG UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally speaking, traditional obstacle avoidance methods have relatively large limitations, especially for complex and dynamic unknown environments; while intelligent obstacle avoidance algorithms, especially the popular obstacle avoidance algorithms combined with deep learning and reinforcement learning in recent years, are suitable for continuous High-dimensional complex dynamic unknown environment has great advantages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Collision avoidance planning method for mobile robots based on deep reinforcement learning in static environment
  • Collision avoidance planning method for mobile robots based on deep reinforcement learning in static environment
  • Collision avoidance planning method for mobile robots based on deep reinforcement learning in static environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further described below in conjunction with the accompanying drawings and cases.

[0035] The invention discloses a collision avoidance planning method for a mobile robot based on deep reinforcement learning in a static environment, which can be used for effective obstacle avoidance when a mobile robot works in a complex static obstacle environment including a looped obstacle environment. The present invention performs corresponding data processing on the basis of the original data collected by the laser rangefinder, takes the processed data as the state S of the A3C algorithm, constructs a corresponding A3C-LSTM neural network, takes the state S as the network input, and uses the A3C-LSTM neural network as the network input. Algorithm, the neural network outputs the corresponding parameters, and uses the relevant parameters to select the action performed by each step of the mobile robot through the normal distribution. The overall obstacle ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of mobile robot navigation, in particular to a collision avoidance planning method for mobile robots based on deep reinforcement learning in a static environment. The present invention uses a laser rangefinder to collect raw data, takes the processed data as the state S of the A3C algorithm, constructs an A3C-LSTM neural network, takes the state S as the network input, and uses the A3C algorithm to output the corresponding parameters from the neural network. The actions performed by the mobile robot at each step are selected by the normal distribution. The invention does not need to model the environment, and finally realizes the successful obstacle avoidance of the mobile robot in the complex static obstacle environment through the deep reinforcement learning algorithm. The present invention designs a continuous action space model with turning constraints, and adopts multi-threaded asynchronous learning. Compared with the general deep reinforcement learning method, the learning and training time is greatly improved, the sample correlation is reduced, and the high utilization of the exploration space and the exploration strategy are guaranteed. The diversity of the algorithm improves the convergence, stability and obstacle avoidance success rate of the algorithm.

Description

technical field [0001] The invention belongs to the technical field of mobile robot navigation, in particular to a collision avoidance planning method for mobile robots based on deep reinforcement learning in a static environment. Background technique [0002] The application of mobile robots has penetrated into all fields of life. In industrial production lines, robots can replace humans to perform heavy and repetitive work, improve production efficiency and liberate human labor. [0003] Obstacle avoidance is a key issue in robotics research during multi-robot task execution. It is a dynamic, real-time, uncertain typical environment model. Furthermore, it requires robots to adapt quickly and independently to changes in the environment. For obstacle avoidance problems, behavior-based response control methods and rule-based control strategies are often used. Both methods are simple, deterministic, and the robot responds better to the environment. But when tasks and envir...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02G06N3/04G06N3/08
CPCG05D1/0221G05D1/0214G06N3/08G06N3/044G06N3/045
Inventor 王宏健何姗姗张宏瀚袁建亚于丹贺巨义
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products