Deep and reinforcement learning-based real-time online path planning method of

A technology of reinforcement learning and path planning, applied in two-dimensional position/channel control and other directions, it can solve problems such as poor generalization ability, explosion of state-space combinations, and optimization of action decision-making, achieving wide applicability, reasonable method design, The effect of solving the path planning problem

Active Publication Date: 2017-07-21
NORTHWESTERN POLYTECHNICAL UNIV
View PDF4 Cites 178 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] Neural network and reinforcement learning each have certain problems. Neural network has excellent learning ability, but poor generalization ability is its fatal shortcoming. For reinforcement learning, when the system becomes complex, a large number of parameters are required to describe it. This causes a combinatorial explosion of state-space-to-action-space mappings, which in turn affects the optimization problem of action decisions
There are no research reports on path planning through deep reinforcement learning using image parsing information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep and reinforcement learning-based real-time online path planning method of
  • Deep and reinforcement learning-based real-time online path planning method of
  • Deep and reinforcement learning-based real-time online path planning method of

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] Embodiments of the present invention are described in detail below, and the embodiments are exemplary and intended to explain the present invention, but should not be construed as limiting the present invention.

[0049] The deep reinforcement learning path planning method in this embodiment includes the following steps:

[0050] Step 1: The camera image is collected, and then the image is input into the scene analysis network to obtain the corresponding analysis result map. The scene parsing network includes a convolutional neural network feature learning layer (Feature Learning Layers), a conditional random field structured learning layer (Structural Learning Layer), and a feature fusion layer based on a deep belief network (Feature Fusion Layers).

[0051] Step 1.1: Feature Learning Layers: The convolutional neural network performs feature learning on the images collected by the camera, and generates high-level information features corresponding to each pixel of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a deep and reinforcement learning-based real-time online path planning method. According to the method, the high-level semantic information of an image is obtained through using a deep learning method, the path planning of the end-to-end real-time scenes of an environment can be completed through using a reinforcement learning method. In a training process, image information collected in the environment is brought into a scene analysis network as a current state, so that an analytical result can be obtained; the analytical result is inputted into a designed deep cyclic neural network; and the decision-making action of each step of an intelligent body in a specific scene can be obtained through training, so that an optimal complete path can be obtained. In an actual application process, image information collected by a camera is inputted into a trained deep and reinforcement learning network, so that the direction information of the walking of the intelligent body can be obtained. With the method of the invention, obtained image information can be utilized to the greatest extent under a premise that the robustness of the method is ensured and the method slightly depends on the environment, and real-time scene walking information path planning can be realized.

Description

technical field [0001] The invention relates to the fields of computer image processing and machine learning, and specifically relates to a real-time online path planning method of deep reinforcement learning, which implements path planning of real-time scenes by applying deep learning and reinforcement learning. Background technique [0002] Traditional path planning methods include simulated annealing algorithm, artificial potential field method, fuzzy logic algorithm, tabu search algorithm, etc.; intelligent bionics methods include ant colony algorithm, neural network algorithm, particle swarm algorithm, genetic algorithm, etc.; there are also some artificially invented Algorithms are widely used because of their excellent characteristics. These algorithms generally have strong path search capabilities and can play a good role in discrete path topology networks, including: A* algorithm, Dijkstra algorithm, Floyd algorithm, etc. With the continuous development of science a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
Inventor 布树辉孙林杰
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products