Collision avoidance planning method for mobile robot based on deep reinforcement learning in dynamic environment

A mobile robot, dynamic environment technology, applied in neural learning methods, instruments, electromagnetic wave re-radiation and other directions, can solve the problems of unsuitable dynamic unknown environment, large limitations, etc., to achieve adaptable high efficiency, reduce difficulty, The effect of improving the success rate of obstacle avoidance

Active Publication Date: 2022-06-21
HARBIN ENG UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally speaking, traditional obstacle avoidance methods have relatively large limitations, especially for complex and dynamic unknown environments; while intelligent obstacle avoidance algorithms, especially the popular obstacle avoidance algorithms combined with deep learning and reinforcement learning in recent years, are suitable for continuous High-dimensional complex dynamic unknown environment has great advantages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Collision avoidance planning method for mobile robot based on deep reinforcement learning in dynamic environment
  • Collision avoidance planning method for mobile robot based on deep reinforcement learning in dynamic environment
  • Collision avoidance planning method for mobile robot based on deep reinforcement learning in dynamic environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further described below with reference to the accompanying drawings and cases.

[0042] The invention discloses a collision avoidance planning method for a mobile robot based on deep reinforcement learning in a dynamic environment, belongs to the technical field of mobile robot navigation, and can be used for effective obstacle avoidance when a mobile robot works in a multi-type dynamic obstacle environment. The invention collects the original data through a laser range finder, uses the original data as the input of the neural network after corresponding processing, establishes the LSTM neural network, outputs the corresponding parameters through the A3C algorithm, and obtains the action of each step of the robot through processing. The flow chart of the overall obstacle avoidance algorithm is as follows figure 1 shown. The present invention does not need to model the environment, is more suitable for unknown obstacle environment, adopts a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a mobile robot collision avoidance planning method based on deep reinforcement learning in a dynamic environment, and belongs to the technical field of mobile robot navigation. The invention collects original data through a laser rangefinder, processes the original data as the input of a neural network, and establishes an LSTM neural network. Through the A3C algorithm, the neural network outputs corresponding parameters, and obtains the action of each step of the robot after processing. The present invention does not need to model the environment, and is more suitable for environments with unknown obstacles. It adopts the actor-critic framework and time difference algorithm, realizes low variance and is more suitable for continuous action spaces, and realizes the effect of learning while training. Design a continuous action space with a heading angle limit, and use 4 threads for parallel learning and training. Compared with the general deep reinforcement learning method, it greatly improves the learning and training time, reduces sample correlation, and ensures the high utilization of the exploration space and exploration strategies. Diversity, thereby improving algorithm convergence, stability and success rate of obstacle avoidance.

Description

technical field [0001] The invention belongs to the technical field of mobile robot navigation, in particular to a collision avoidance planning method for mobile robots based on deep reinforcement learning in a dynamic environment. Background technique [0002] The research of mobile robots began in the 1960s, and so far this field has become one of the important research directions of robotics. With the continuous advancement and development of science and technology, autonomous mobile robots have a high level of intelligence and can complete tasks autonomously without supervision. Therefore, mobile robots have good development space and prospects in the fields of search, detection, fire protection, and reconnaissance. In the above-mentioned special fields, the environment is usually complex and changeable, and there are not only dynamic obstacles but also static obstacles. In order to successfully realize the function of a mobile robot, an appropriate obstacle avoidance a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G01S17/93G05D1/02G06N3/04G06N3/08
CPCG05D1/024G05D1/0221G05D1/0214
Inventor 王宏健何姗姗严浙平付忠健阮力刘超伟
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products