Supercharge Your Innovation With Domain-Expert AI Agents!

Robot path navigation method and system based on deep reinforcement learning

A technology of reinforcement learning and path navigation, applied in navigation calculation tools, general control systems, control/regulation systems, etc., can solve problems such as poor flexibility, inapplicability to dynamic and unknown environments, and large limitations

Active Publication Date: 2020-08-04
SHANDONG NORMAL UNIV
View PDF8 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In addition, the inventor believes that the traditional robot navigation method mainly depends on the obstacle map of the navigation environment, and the navigation process requires all or part of the prior environmental knowledge, which has poor flexibility and large limitations, and is not suitable for complex and dynamic unknowns. environment; and due to the huge difference between the simulated environment and the highly complex real environment, it is difficult to convert the trained model into real robot navigation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot path navigation method and system based on deep reinforcement learning
  • Robot path navigation method and system based on deep reinforcement learning
  • Robot path navigation method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] Such as figure 1 As shown, this embodiment provides a robot path navigation method based on deep reinforcement learning, including:

[0029] S1: Construct a dual Actor-Critic neural network based on deep reinforcement learning, and use the first Actor-Critic neural network to output the robot's initial movement action and the evaluation value of the initial movement action according to the acquired robot's current motion state;

[0030] S2: Use the current motion state of the robot and the evaluation value of the initial movement action as the training set to train the second Actor-Critic neural network, and update the first Actor-Critic neural network according to the trained second Actor-Critic neural network, The updated first Actor-Critic neural network is used to output the optimal movement action according to the current motion state of the robot, so as to navigate the optimal path of the robot.

[0031] In this embodiment, the real environment is simulated, and ...

Embodiment 2

[0073] This embodiment provides a robot path navigation system based on deep reinforcement learning, including:

[0074] The initial path navigation module is used to construct a dual Actor-Critic neural network based on deep reinforcement learning, and uses the first Actor-Critic neural network to output the initial movement action of the robot and the evaluation value of the initial movement action according to the acquired current motion state of the robot;

[0075] The path navigation update module is used to train the second Actor-Critic neural network with the current motion state of the robot and the evaluation value of the initial movement action as a training set, and to train the first Actor-Critic neural network according to the trained second Actor-Critic neural network. The neural network is updated, and the updated first Actor-Critic neural network outputs an optimal movement action according to the current motion state of the robot, so as to navigate the optimal ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot path navigation method and system based on deep reinforcement learning. The method comprises the following steps: constructing a double Actor-Critic neural network based on deep reinforcement learning, and outputting an initial movement action of a robot and an evaluation value of the initial movement action by adopting a first Actor-Critic neural network accordingto an obtained current movement state of the robot; training a second Actor-Critic neural network by taking the evaluation values of the current movement state and the initial movement action of the robot as a training set; and updating the first Actor-Critic neural network according to the trained second Actor-Critic neural network, and outputting an optimal moving action by using the updated first Actor-Critic neural network according to the current movement state of the robot, thereby performing optimal path navigation on the robot. The optimal action strategy under the current movement state of the robot is found in combination with the perception capability of the deep learning method and the strategy capability of the reinforcement learning method, and the limitation that traditionalrobot navigation depends on an obstacle map is eliminated in a highly complex scene.

Description

technical field [0001] The invention relates to the technical field of path planning, in particular to a robot path navigation method and system based on deep reinforcement learning. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] Motion planning consists of path planning and trajectory planning. The sequence of points or curves connecting the starting point and the end point is called a path, and the strategy that forms the path is called path planning. The non-collision movement of the robot, that is, the robot navigation is also a kind of path planning. Traditional path planning algorithms include: simulated annealing algorithm, artificial potential field method, fuzzy logic algorithm, tabu search algorithm, etc. [0004] Deep reinforcement learning combines the perception ability of deep learning and the decision-making ability of re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05B13/02G05B13/04G05D1/02G01C21/20
CPCG05B13/027G05B13/042G05D1/0088G05D1/0221G05D1/024G05D1/0274G01C21/20Y02T10/40
Inventor 吕蕾周青林丁昊张凤军刘翔
Owner SHANDONG NORMAL UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More