Robot map-free navigation method based on deep safety reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and navigation method, applied in two-dimensional position/channel control, instrument, vehicle position/route/altitude control and other directions, it can solve the problem that the strategy cannot guarantee safety, etc., and achieve a shortened navigation path, a better path, The effect of improving the success rate of navigation

Active Publication Date: 2021-07-09

HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL

View PDF8 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005]Although reinforcement learning has been successfully applied in the fields of games and control, most of the reinforcement learning work is developed based on simulation platforms, and it is more used in real mobile robots. The reason is that the strategies learned by reinforcement learning cannot be guaranteed to be safe, which may cause serious consequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0057] In order to further describe the technical solution of the present invention in detail, this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific steps.

[0058] Such as figure 1 Shown is the overall flowchart of the method of the present invention, a method for robot map-free navigation based on deep security reinforcement learning, comprising the following steps:

[0059] S1: Initialize the training environment, set the parameters of the mobile robot, including the maximum linear velocity, minimum linear velocity, maximum angular velocity, minimum angular velocity, and maximum number of steps of the mobile robot, set the distance from the mobile robot to the target point, laser radar information and image information input According to the dimension of the training environment, the mobile robot reward function and safety risk cost function are designed;

[0060] In one e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention particularly relates to a robot map-free navigation method based on deep safety reinforcement learning, which specifically comprises the following steps: initializing a training environment, and designing a mobile robot reward function and a safety risk cost function; combining image information and laser radar information detected by a sensor with target information and motion information of the mobile robot, processing state information and then outputting decision actions to the robot through an Actor network, so the robot executes the actions output by the Actor network, and new state observation and reward information at the next moment is obtained from the environment; storing experience obtained by interaction between the robot and the environment in an experience pool, and updating network parameters regularly; and judging whether the training is finished or not, and applying the trained model to a real mobile robot for navigation. According to the method, based on deep safety reinforcement learning of an actor-critic-safety (ACS) framework, by introducing a constraint strategy optimization (CPO) algorithm, the safety of reinforcement learning for a map-free navigation task is improved.

Description

technical field [0001] The invention relates to the field of map-free navigation of robots, in particular to a map-free navigation method for robots based on deep safety reinforcement learning. Background technique [0002] Robot navigation refers to the technology that the robot reaches the target position from the current initial position without colliding with other static or dynamic obstacles during the process. In recent years, robot navigation technology has been widely used in the fields of sweeping robots, service robots, logistics robots, special rescue robots and Mars exploration robots. Safety in navigation is mainly reflected in the ability to avoid collisions. [0003] Comparing with maps and without maps: the current relatively mature navigation technology is basically based on SLAM mapping with map navigation. However, when special robots such as field search and rescue are performing tasks, the environment is often unknown, and maps cannot be built at this t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/02

CPCG05D1/0223G05D1/0214G05D1/0221G05D1/0276

Inventor 吕少华李衍杰许运鸿刘奇陈美玲赵威龙刘悦丞庞玺政

Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Robot map-free navigation method based on deep safety reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology