Robot map-free navigation method based on deep safety reinforcement learning

A technology of reinforcement learning and navigation method, applied in two-dimensional position/channel control, instrument, vehicle position/route/altitude control and other directions, it can solve the problem that the strategy cannot guarantee safety, etc., and achieve a shortened navigation path, a better path, The effect of improving the success rate of navigation

Active Publication Date: 2021-07-09
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF8 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005]Although reinforcement learning has been successfully applied in the fields of games and control, most of the reinforcement learning work is developed based on simulation platforms, and it is more used in real mobile robots. The reason is that the strategies learned by reinforcement learning cannot be guaranteed to be safe, which may cause serious consequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot map-free navigation method based on deep safety reinforcement learning
  • Robot map-free navigation method based on deep safety reinforcement learning
  • Robot map-free navigation method based on deep safety reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to further describe the technical solution of the present invention in detail, this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific steps.

[0058] Such as figure 1 Shown is the overall flowchart of the method of the present invention, a method for robot map-free navigation based on deep security reinforcement learning, comprising the following steps:

[0059] S1: Initialize the training environment, set the parameters of the mobile robot, including the maximum linear velocity, minimum linear velocity, maximum angular velocity, minimum angular velocity, and maximum number of steps of the mobile robot, set the distance from the mobile robot to the target point, laser radar information and image information input According to the dimension of the training environment, the mobile robot reward function and safety risk cost function are designed;

[0060] In one e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention particularly relates to a robot map-free navigation method based on deep safety reinforcement learning, which specifically comprises the following steps: initializing a training environment, and designing a mobile robot reward function and a safety risk cost function; combining image information and laser radar information detected by a sensor with target information and motion information of the mobile robot, processing state information and then outputting decision actions to the robot through an Actor network, so the robot executes the actions output by the Actor network, and new state observation and reward information at the next moment is obtained from the environment; storing experience obtained by interaction between the robot and the environment in an experience pool, and updating network parameters regularly; and judging whether the training is finished or not, and applying the trained model to a real mobile robot for navigation. According to the method, based on deep safety reinforcement learning of an actor-critic-safety (ACS) framework, by introducing a constraint strategy optimization (CPO) algorithm, the safety of reinforcement learning for a map-free navigation task is improved.

Description

technical field [0001] The invention relates to the field of map-free navigation of robots, in particular to a map-free navigation method for robots based on deep safety reinforcement learning. Background technique [0002] Robot navigation refers to the technology that the robot reaches the target position from the current initial position without colliding with other static or dynamic obstacles during the process. In recent years, robot navigation technology has been widely used in the fields of sweeping robots, service robots, logistics robots, special rescue robots and Mars exploration robots. Safety in navigation is mainly reflected in the ability to avoid collisions. [0003] Comparing with maps and without maps: the current relatively mature navigation technology is basically based on SLAM mapping with map navigation. However, when special robots such as field search and rescue are performing tasks, the environment is often unknown, and maps cannot be built at this t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0223G05D1/0214G05D1/0221G05D1/0276
Inventor 吕少华李衍杰许运鸿刘奇陈美玲赵威龙刘悦丞庞玺政
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products