Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for solving the shortest path problem based on an improved Q-learning algorithm

A shortest path and solution technology, applied in the field of reinforcement learning, can solve problems such as difficult convergence of Q tables, and achieve the effect of alleviating the shortest path problem and accelerating convergence

Inactive Publication Date: 2019-05-28
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a solution to the shortest path problem based on the improved Q-learning algorithm, which solves the problem that the Q table is difficult to converge in the process of using the Q-learning algorithm to find the shortest path

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for solving the shortest path problem based on an improved Q-learning algorithm
  • A method for solving the shortest path problem based on an improved Q-learning algorithm
  • A method for solving the shortest path problem based on an improved Q-learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the invention are described in detail below, examples of which are illustrated in the accompanying drawings. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0029] Such as figure 1 Shown, be a kind of flow chart of the solution method of the shortest path problem based on improved Q-learning algorithm of the present invention, concrete steps are:

[0030] Step 1. Establish a position-action value table. The position refers to the position of the agent in space, and the action refers to the moving direction of the agent. Hereinafter, it is referred to as the Q table. Q(x,a) represents the value that can be obtained by taking action a at position x, initialize all the Q tables to 0, and select the appropriate update step size α, greedy parameter ε, reward discount γ and smoothing parameter s;

[0031] Step 2. Set the initial...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for solving the shortest path problem based on an improved Q-learning algorithm. The method specifically comprises the following steps of: establishing a position-action value table, completely initializing the Q table to be 0, and selecting an appropriate updating step length, greedy parameters, reward discount and smooth parameters; setting an initial position tobe recorded as x0 and a target position to be recorded as xtf according to the requirement of the shortest path problem; The current position is marked as X, the initial position is assigned to the current position, and epson-is used for marking the current position as X; selecting an action according to a greedy method, and recording the action as A; Taking an action A at the X position, recording the obtained earnings r and the position X'transferred after the action A is taken, if X 'is equal to the target position xtf, assigning the initial position x0 to X, and otherwise assigning the X'to X; Updating the Q table; All positions adjacent to the position X are marked as X1, X2,..., Xn, and a Q table is updated; And repeating the above steps until the Q table converges. According to the invention, the existing Q-use at present is relieved; And the problem that the Q table convergence time is long when the problem of the shortest path is solved by the leader is solved.

Description

technical field [0001] The invention relates to a method for solving the shortest path problem based on an improved Q-learning algorithm, and belongs to the technical field of reinforcement learning. Background technique [0002] The shortest path problem widely exists in navigation, urban planning, video games and other fields. Traditional solutions to shortest path problems generally rely on shortest path algorithms, such as Floyd's algorithm, Dijkstra's algorithm, or A * Algorithms and other search algorithms. In recent years, with the rise of reinforcement learning, the use of reinforcement learning in the shortest path problem has become more and more extensive. The Q-Learning algorithm is a classic reinforcement learning algorithm, which was first proposed in the early 1990s. The earliest Q-Learning is based on the Q table, that is, a table that records any action taken in any state. This value is generally recorded as Q(x,a), where x represents the state and a repr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
Inventor 魏小辉廖玮李旭波高天驰李龙
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products