Robot path navigation method and system based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and path navigation, applied in navigation calculation tools, general control systems, control/regulation systems, etc., can solve problems such as poor flexibility, inapplicability to dynamic and unknown environments, and large limitations

Active Publication Date: 2020-08-04

SHANDONG NORMAL UNIV

View PDF8 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In addition, the inventor believes that the traditional robot navigation method mainly depends on the obstacle map of the navigation environment, and the navigation process requires all or part of the prior environmental knowledge, which has poor flexibility and large limitations, and is not suitable for complex and dynamic unknowns. environment; and due to the huge difference between the simulated environment and the highly complex real environment, it is difficult to convert the trained model into real robot navigation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] Such as figure 1 As shown, this embodiment provides a robot path navigation method based on deep reinforcement learning, including:

[0029] S1: Construct a dual Actor-Critic neural network based on deep reinforcement learning, and use the first Actor-Critic neural network to output the robot's initial movement action and the evaluation value of the initial movement action according to the acquired robot's current motion state;

[0030] S2: Use the current motion state of the robot and the evaluation value of the initial movement action as the training set to train the second Actor-Critic neural network, and update the first Actor-Critic neural network according to the trained second Actor-Critic neural network, The updated first Actor-Critic neural network is used to output the optimal movement action according to the current motion state of the robot, so as to navigate the optimal path of the robot.

[0031] In this embodiment, the real environment is simulated, and ...

Embodiment 2

[0073] This embodiment provides a robot path navigation system based on deep reinforcement learning, including:

[0074] The initial path navigation module is used to construct a dual Actor-Critic neural network based on deep reinforcement learning, and uses the first Actor-Critic neural network to output the initial movement action of the robot and the evaluation value of the initial movement action according to the acquired current motion state of the robot;

[0075] The path navigation update module is used to train the second Actor-Critic neural network with the current motion state of the robot and the evaluation value of the initial movement action as a training set, and to train the first Actor-Critic neural network according to the trained second Actor-Critic neural network. The neural network is updated, and the updated first Actor-Critic neural network outputs an optimal movement action according to the current motion state of the robot, so as to navigate the optimal ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a robot path navigation method and system based on deep reinforcement learning. The method comprises the following steps: constructing a double Actor-Critic neural network based on deep reinforcement learning, and outputting an initial movement action of a robot and an evaluation value of the initial movement action by adopting a first Actor-Critic neural network accordingto an obtained current movement state of the robot; training a second Actor-Critic neural network by taking the evaluation values of the current movement state and the initial movement action of the robot as a training set; and updating the first Actor-Critic neural network according to the trained second Actor-Critic neural network, and outputting an optimal moving action by using the updated first Actor-Critic neural network according to the current movement state of the robot, thereby performing optimal path navigation on the robot. The optimal action strategy under the current movement state of the robot is found in combination with the perception capability of the deep learning method and the strategy capability of the reinforcement learning method, and the limitation that traditionalrobot navigation depends on an obstacle map is eliminated in a highly complex scene.

Description

technical field [0001] The invention relates to the technical field of path planning, in particular to a robot path navigation method and system based on deep reinforcement learning. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] Motion planning consists of path planning and trajectory planning. The sequence of points or curves connecting the starting point and the end point is called a path, and the strategy that forms the path is called path planning. The non-collision movement of the robot, that is, the robot navigation is also a kind of path planning. Traditional path planning algorithms include: simulated annealing algorithm, artificial potential field method, fuzzy logic algorithm, tabu search algorithm, etc. [0004] Deep reinforcement learning combines the perception ability of deep learning and the decision-making ability of re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G05B13/02G05B13/04G05D1/02G01C21/20

CPCG05B13/027G05B13/042G05D1/0088G05D1/0221G05D1/024G05D1/0274G01C21/20Y02T10/40

Inventor 吕蕾周青林丁昊张凤军刘翔

Owner SHANDONG NORMAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Robot path navigation method and system based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology