A real-time online path planning method for deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and path planning, applied in two-dimensional position/channel control, etc., can solve problems affecting decision-making optimization, poor generalization ability, state-space combination explosion, etc., to solve path planning problems, and has wide applicability , method design reasonable effect

Active Publication Date: 2019-10-22

NORTHWESTERN POLYTECHNICAL UNIV

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0013] Neural network and reinforcement learning each have certain problems. Neural network has excellent learning ability, but poor generalization ability is its fatal shortcoming. For reinforcement learning, when the system becomes complex, a large number of parameters are required to describe it. This causes a combinatorial explosion of state-space-to-action-space mappings, which in turn affects the optimization problem of action decisions

There are no research reports on path planning through deep reinforcement learning using image parsing information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] Embodiments of the present invention are described in detail below, and the embodiments are exemplary and intended to explain the present invention, but should not be construed as limiting the present invention.

[0049] The deep reinforcement learning path planning method in this embodiment includes the following steps:

[0050] Step 1: The camera image is collected, and then the image is input into the scene analysis network to obtain the corresponding analysis result map. The scene parsing network includes a convolutional neural network feature learning layer (Feature Learning Layers), a conditional random field structured learning layer (Structural Learning Layer), and a feature fusion layer based on a deep belief network (Feature Fusion Layers).

[0051] Step 1.1: Feature Learning Layers: The convolutional neural network performs feature learning on the images collected by the camera, and generates high-level information features corresponding to each pixel of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention provides a deep and reinforcement learning-based real-time online path planning method. According to the method, the high-level semantic information of an image is obtained through using a deep learning method, the path planning of the end-to-end real-time scenes of an environment can be completed through using a reinforcement learning method. In a training process, image information collected in the environment is brought into a scene analysis network as a current state, so that an analytical result can be obtained; the analytical result is inputted into a designed deep cyclic neural network; and the decision-making action of each step of an intelligent body in a specific scene can be obtained through training, so that an optimal complete path can be obtained. In an actual application process, image information collected by a camera is inputted into a trained deep and reinforcement learning network, so that the direction information of the walking of the intelligent body can be obtained. With the method of the invention, obtained image information can be utilized to the greatest extent under a premise that the robustness of the method is ensured and the method slightly depends on the environment, and real-time scene walking information path planning can be realized.

Description

technical field [0001] The invention relates to the fields of computer image processing and machine learning, and specifically relates to a real-time online path planning method of deep reinforcement learning, which implements path planning of real-time scenes by applying deep learning and reinforcement learning. Background technique [0002] Traditional path planning methods include simulated annealing algorithm, artificial potential field method, fuzzy logic algorithm, tabu search algorithm, etc.; intelligent bionics methods include ant colony algorithm, neural network algorithm, particle swarm algorithm, genetic algorithm, etc.; there are also some artificially invented Algorithms are widely used because of their excellent characteristics. These algorithms generally have strong path search capabilities and can play a good role in discrete path topology networks, including: A* algorithm, Dijkstra algorithm, Floyd algorithm, etc. With the continuous development of science a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G05D1/02

Inventor 布树辉孙林杰

Owner NORTHWESTERN POLYTECHNICAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A real-time online path planning method for deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology