Path planning method based on reinforcement learning algorithm in dynamic environment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A dynamic environment and reinforcement learning technology, applied in the directions of road network navigators, navigation, instruments, etc., can solve the problems of inability to deal with dynamic environments, low efficiency of heuristic algorithm search, etc., and achieve a wide range of applications.

Active Publication Date: 2020-09-11

EAST CHINA NORMAL UNIV

View PDF5 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

But the general algorithm cannot deal with the problem of dynamic environment

And in a complex environment, the search efficiency of the heuristic algorithm will become lower

At the same time, for obstacle avoidance planning in unknown dynamic environments, such as D*, Lifelong A* and other algorithms are just a timely planning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0039] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0040] like figure 1 , figure 2 , image 3 , Figure 4 and Figure 5 As shown, a path planning method based on a reinforcement learning algorithm under a dynamic environment, the path planning method based on a reinforcement learning algorithm under the dynamic environment includes the following steps:

[0041]Step S1: Model the working environment by using the grid method according to the dynamic known environment, and set the starting point and target p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a path planning method based on a reinforcement learning algorithm in a dynamic environment. The method comprises the following steps of carrying out the modeling of an operation environment through a grid method according to a dynamic known environment, and setting a starting point and a target point according to an operation task, constructing an improved Q-learning algorithm based on the time variable t, and endowing the intelligent agent with a stop action,learning a dynamic known environment based on an improved Q-learning algorithm according to the coordinates ofthe starting point and the target point, and outputting a planned path,exploring and learning a dynamic unknown environment state through the intelligent agent and obtaining an output optimal path. After training is completed, the speed is very high, which is attributed to learning attributes of a machine learning algorithm. In the aspect of effect, no matter what conditions the environment is, the algorithm cannot generate collision, and the application of the algorithm in path planning is very wide; and as a result, since the intelligent agent is endowed with a new stop action, a better pathcan be found in the planning.

Description

technical field [0001] The invention relates to a path planning method, in particular to a path planning method based on a reinforcement learning algorithm in a dynamic environment. Background technique [0002] Path planning problems generally deal with known and unknown environmental conditions. Many existing algorithms can solve its path planning problem, but for the path planning problem in a dynamic environment (moving obstacles), the algorithm cannot quickly solve the problem in this scenario, and the given planned route may collide. [0003] For example, the general heuristic algorithm combines the advantages of depth search and breadth search, so that the pathfinding algorithm can find the optimal solution with a high probability while ensuring the speed. But the general algorithm cannot deal with the problem of dynamic environment. And in a complex environment, the search efficiency of the heuristic algorithm will become lower. At the same time, for obstacle avoi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G01C21/34

CPCG01C21/3446

Inventor 吕长虹朱玥炜

Owner EAST CHINA NORMAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Path planning method based on reinforcement learning algorithm in dynamic environment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology