Collision avoidance planning method for mobile robots based on deep reinforcement learning in dynamic environment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A mobile robot and dynamic environment technology, applied in the direction of instruments, non-electric variable control, two-dimensional position/channel control, etc., can solve the problems of inappropriate and large limitations in dynamic unknown environments, and achieve high adaptability and efficiency , Reduce the difficulty and improve the success rate of obstacle avoidance

Active Publication Date: 2019-12-31

HARBIN ENG UNIV

View PDF4 Cites 40 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Generally speaking, traditional obstacle avoidance methods have relatively large limitations, especially for complex and dynamic unknown environments; while intelligent obstacle avoidance algorithms, especially the popular obstacle avoidance algorithms combined with deep learning and reinforcement learning in recent years, are suitable for continuous High-dimensional complex dynamic unknown environment has great advantages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with accompanying drawings and cases.

[0042] The invention discloses a mobile robot collision avoidance planning method based on deep reinforcement learning in a dynamic environment, belongs to the technical field of mobile robot navigation, and can be used for effective obstacle avoidance when a mobile robot works in a multi-type dynamic obstacle environment. The invention collects raw data through a laser rangefinder, processes the raw data as input to a neural network, and establishes an LSTM neural network. Through the A3C algorithm, the neural network outputs corresponding parameters, and obtains the action of each step of the robot after processing. The flow chart of the overall obstacle avoidance algorithm is as follows: figure 1 shown. The present invention does not need to model the environment, and is more suitable for environments with unknown obstacles. It adopts the actor-critic framewo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a collision avoidance planning method for mobile robots based on deep reinforcement learning in a dynamic environment, and belongs to the technical field of mobile robot navigation. The method of the invention includes the following steps of: collecting raw data through a laser rangefinder, processing the raw data as input of a neural network, and building an LSTM neural network; through an A3C algorithm, outputting corresponding parameters by the neural network, and processing the corresponding parameters to obtain the action of each step of the robot. The scheme of the invention does not need to model the environment, is more suitable for an unknown obstacle environment, adopts an actor-critic framework and a temporal difference algorithm, is more suitable for a continuous motion space while realizing low variance, and realizes the effect of learning while training. The scheme of the invention designs the continuous motion space with a heading angle limitationand uses 4 threads for parallel learning and training, so that compared with general deep reinforcement learning methods, the learning and training time is greatly improved, the sample correlation isreduced, the high utilization of exploration spaces and the diversity of exploration strategies are guaranteed, and thus the algorithm convergence, stability and the success rate of obstacle avoidance can be improved.

Description

technical field [0001] The invention belongs to the technical field of mobile robot navigation, and in particular relates to a mobile robot collision avoidance planning method based on deep reinforcement learning in a dynamic environment. Background technique [0002] The research on mobile robots began in the 1960s, and so far this field has become one of the important research directions of robotics. With the continuous progress and development of science and technology, autonomous mobile robots have a high level of intelligence and can complete tasks autonomously without supervision. Therefore, in the fields of search, detection, fire protection, and investigation, mobile robots have good development space and prospects. In the above-mentioned special fields, the environment is usually complex and changeable. There are not only dynamic obstacles but also static obstacles. In order to successfully realize the functions of mobile robots, it is necessary to design appropria...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/02

CPCG05D1/024G05D1/0221G05D1/0214

Inventor 王宏健何姗姗严浙平付忠健阮力刘超伟

Owner HARBIN ENG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Collision avoidance planning method for mobile robots based on deep reinforcement learning in dynamic environment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology