A method for solving the shortest path problem based on an improved Q-learning algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A shortest path and solution technology, applied in the field of reinforcement learning, can solve problems such as difficult convergence of Q tables, and achieve the effect of alleviating the shortest path problem and accelerating convergence

Inactive Publication Date: 2019-05-28

NANJING UNIV OF AERONAUTICS & ASTRONAUTICS

View PDF0 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a solution to the shortest path problem based on the improved Q-learning algorithm, which solves the problem that the Q table is difficult to converge in the process of using the Q-learning algorithm to find the shortest path

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0028] Embodiments of the invention are described in detail below, examples of which are illustrated in the accompanying drawings. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0029] Such as figure 1 Shown, be a kind of flow chart of the solution method of the shortest path problem based on improved Q-learning algorithm of the present invention, concrete steps are:

[0030] Step 1. Establish a position-action value table. The position refers to the position of the agent in space, and the action refers to the moving direction of the agent. Hereinafter, it is referred to as the Q table. Q(x,a) represents the value that can be obtained by taking action a at position x, initialize all the Q tables to 0, and select the appropriate update step size α, greedy parameter ε, reward discount γ and smoothing parameter s;

[0031] Step 2. Set the initial...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for solving the shortest path problem based on an improved Q-learning algorithm. The method specifically comprises the following steps of: establishing a position-action value table, completely initializing the Q table to be 0, and selecting an appropriate updating step length, greedy parameters, reward discount and smooth parameters; setting an initial position tobe recorded as x0 and a target position to be recorded as xtf according to the requirement of the shortest path problem; The current position is marked as X, the initial position is assigned to the current position, and epson-is used for marking the current position as X; selecting an action according to a greedy method, and recording the action as A; Taking an action A at the X position, recording the obtained earnings r and the position X'transferred after the action A is taken, if X 'is equal to the target position xtf, assigning the initial position x0 to X, and otherwise assigning the X'to X; Updating the Q table; All positions adjacent to the position X are marked as X1, X2,..., Xn, and a Q table is updated; And repeating the above steps until the Q table converges. According to the invention, the existing Q-use at present is relieved; And the problem that the Q table convergence time is long when the problem of the shortest path is solved by the leader is solved.

Description

technical field [0001] The invention relates to a method for solving the shortest path problem based on an improved Q-learning algorithm, and belongs to the technical field of reinforcement learning. Background technique [0002] The shortest path problem widely exists in navigation, urban planning, video games and other fields. Traditional solutions to shortest path problems generally rely on shortest path algorithms, such as Floyd's algorithm, Dijkstra's algorithm, or A * Algorithms and other search algorithms. In recent years, with the rise of reinforcement learning, the use of reinforcement learning in the shortest path problem has become more and more extensive. The Q-Learning algorithm is a classic reinforcement learning algorithm, which was first proposed in the early 1990s. The earliest Q-Learning is based on the Q table, that is, a table that records any action taken in any state. This value is generally recorded as Q(x,a), where x represents the state and a repr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N20/00

Inventor 魏小辉廖玮李旭波高天驰李龙

Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A method for solving the shortest path problem based on an improved Q-learning algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology