Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mobile robot navigation method and device, computer equipment and storage medium

A technology of mobile robots and navigation methods, applied in mechanical equipment, combustion engines, neural learning methods, etc., can solve problems such as low data utilization, high complexity and cost, weak generalization performance in different scenarios, etc., and reduce computing power. volume effect

Active Publication Date: 2021-11-05
NAT UNIV OF DEFENSE TECH
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the complexity and uncertainty of the environment, the low utilization of data and difficult generalization of reinforcement learning itself, and the diversity of tasks all pose challenges to the robot navigation of reinforcement learning.
[0003] The traditional robot navigation technology is mainly based on the understanding of the entire environment scene. It needs to use external auxiliary equipment and its own sensors to achieve accurate pose solution. However, it generally needs to rely on equipment and is complex and costly. Difficulty adapting to changes in the environment
[0004] The robot navigation technology using reinforcement learning has some obstacles of reinforcement learning itself, the training efficiency is low, the utilization rate of data is relatively low, and it needs a lot of interaction with the environment to get a better model; it is difficult for the unknown environment The reward function is well set, and the generalization performance for different scenarios is weak

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mobile robot navigation method and device, computer equipment and storage medium
  • Mobile robot navigation method and device, computer equipment and storage medium
  • Mobile robot navigation method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0065] The mobile robot navigation method provided in this application can be applied in the following application environments. Among them, the terminal implements a mobile robot navigation method, and extracts the features of the target point image and the scene image through the feature extraction module to obtain the state characteristics of the current state; solves the preset expert trajectory through the inverse reinforcement learning module, and obtains the reward function; The predicted execution action of the robot is output through the policy network in the A3C ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a mobile robot navigation method and device, computer equipment and a storage medium. The method comprises the following steps of: extracting features of a target point image and a scene image through a feature extraction module to obtain state features of a current state; resolving a preset expert track through an inverse reinforcement learning module to obtain a reward function; outputting a predicted execution action of a robot through a strategy network in an A3C reinforcement learning network, obtaining a predicted value function through a value network, and after the execution action obtains a next state, calculating a TD error according to the current state, the next state and the execution action to obtain a first loss function; obtaining an expert reward value according to the state features and a weight parameter, and obtaining a second loss function according to the network reward value and the expert reward value; and training the A3C reinforcement learning network and a reward network to obtain a trained mobile robot navigation model for navigation. According to the invention, the accuracy and efficiency of indoor navigation of the robot can be improved, and the generalization ability is high.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a mobile robot navigation method, device, computer equipment and storage medium based on reward network and inverse reinforcement learning. Background technique [0002] Mobile robot navigation is one of the most basic tasks of robots. All kinds of robots must have perfect navigation capabilities before they can be put into practical applications. Robots under traditional navigation tasks will be equipped with RGB cameras or depth cameras, radar, GPS and other sensors, and some even rely on high-precision prior maps, etc., which require a large cost of manpower and material resources. In recent years, it has been realized through reinforcement learning. Robot navigation only needs visual sensors to achieve better results and reduces the dependence on other sensors, which has become a hot research topic. However, the complexity and uncertainty of the environment, the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/27G06N3/04G06N3/08G06F111/04G06F111/08
CPCG06F30/27G06N3/04G06N3/08G06F2111/04G06F2111/08Y02T10/40
Inventor 方强王熙童徐昕曾宇俊
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products