Fast routing decision algorithm based on Q learning and LSTM neural network

A neural network and decision-making algorithm technology, applied in the field of fast routing decision-making algorithm, can solve the problems of long training process and slow convergence, and achieve the effect of saving time and cost

Active Publication Date: 2018-10-16
NANJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the deficiencies in the prior art, the purpose of the present invention is to provide a fast routing decision algorithm based on Q-learning and LSTM neural network, which solves the problems of slow convergence and long training process of traditional heuristic algorithms, and can save a lot of time and cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fast routing decision algorithm based on Q learning and LSTM neural network
  • Fast routing decision algorithm based on Q learning and LSTM neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0030] This method uses reinforcement learning to continuously try in the environment, adjust the strategy according to the feedback information obtained from the trial, and finally generate an optimal strategy. According to this optimal strategy, the machine can know what action to perform in what state.

[0031] First, the state variables and action variables are selected to establish a Markov decision model, and then Q reinforcement learning is used to solve it. In order to establish an optimal routing policy model, it is necessary to consider and set more network state parameters as variables and constraints for routing optimization problems, such as link utilization, node hops, delay, packe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fast routing decision algorithm based on Q learning and a LSTM neural network. The algorithm is mainly divided into two stages including model training and dynamic routing decision; and the model training stage is mainly used for calculating an optimal or a relatively better path which satisfies a constraint condition according to different QoS requests through a heuristic algorithm. Then, input and corresponding output of the heuristic algorithm are combined to form a training set of a machine learning model, and the training set is taken as a target Q value of different routing to train a decision model. On the basis, when a controller receives a new QoS request, correspondingly, the machine learning model jointly takes the current network state and the constraint condition in the request as input of the model, corresponding Q value can be calculated quickly through the routing decision model in which LSTM and Q learning are combined, and prediction is completed and an optimal path can be output. Time for the process is greatly reduced in comparison with the heuristic algorithm, but the result is very similar.

Description

technical field [0001] The invention relates to a fast routing decision algorithm based on Q learning and LSTM neural network, belonging to the technical field of wireless communication. Background technique [0002] Traditional IP networks integrate control and forwarding in one device, while software-defined networking (Software Defined Networking, SDN) separates control and forwarding. The advantage of this structure is that on the one hand, developers can program the controller through the open northbound interface, which can quickly realize personalized control of the network and meet the different needs of the business on the network; on the other hand, the controller can And the standard OpenFlow protocol communicates with the switch at the data forwarding layer, which reduces the dependence on the underlying forwarding device and makes the deployment more flexible. The present invention mainly utilizes the characteristic that the network control plane and the data p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/751H04L12/725H04L45/02
CPCH04L45/08H04L45/302
Inventor 朱晓荣陈必康王树同韩嗣诚
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products