Network traffic scheduling method and system based on deep reinforcement learning

A technology of reinforcement learning and network traffic, applied in the network field, can solve problems such as link congestion, low traversal efficiency, waste of storage space, etc., and achieve a logical and clear effect

Active Publication Date: 2022-05-13
HOHAI UNIV
View PDF15 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Traditional research ideas use OSPF and other traditional algorithms for link selection in the control layer of the SDN architecture, but there are great limitations. The traditional algorithm only selects the link with the shortest number of hops, not the link with the smallest delay or the largest bandwidth. road, it is easy to cause link congestion
On this basis, Q-learning provides a good help for pathfinding. Q-learning based on Markov decision traverses all possible actions for each state, so as to select the optimal action. The limitation is that when the state and action When gradually increasing, the traversal efficiency becomes low and a lot of storage space is wasted

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network traffic scheduling method and system based on deep reinforcement learning
  • Network traffic scheduling method and system based on deep reinforcement learning
  • Network traffic scheduling method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The scheme of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0038] like figure 1 As shown, a network traffic scheduling method based on deep reinforcement learning provided by an embodiment of the present invention includes the following steps:

[0039] Step 1. Network information collection. The topology discovery module in the SDN controller obtains real-time network information for a period of time through the southbound protocol open flow, including port information, bandwidth information, and delay information. The controller collects these information for backup and accumulates information for subsequent information processing.

[0040] Step 2: The data processing module of the management layer processes the collected network information, and obtains status indicators of all links through calculation.

[0041] 1. Bandwidth index: For any link k∈M (M is the link set in the real-ti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a network traffic scheduling method based on deep reinforcement learning, and aims to provide intelligent routing and explore, utilize and learn an optimal path when a routing decision is made by utilizing network state information. According to the method, all link state indexes in the whole network are calculated by collecting network information, a reward function related to bandwidth, time delay and packet loss rate is designed, path state information is explored and learned by adopting a double deep Q learning network DDQN, and an optimal path between each pair of source and destination nodes is obtained. According to the method, an optimal route is installed on a switch of a data layer in advance by utilizing a network global view provided by an SDN and interaction between a DRL intelligent agent and an environment; the invention also provides a traffic scheduling system based on the method, which adopts a four-layer SDN structure comprising a knowledge layer, a management layer, a control layer and a data layer, and realizes dynamic scheduling of network traffic along with an actual environment.

Description

technical field [0001] The invention relates to the field of network technology, in particular to a network traffic scheduling method and system based on deep reinforcement learning. Background technique [0002] With the rapid development of the Internet, people have higher requirements for WAN data transmission. On the one hand, greater bandwidth guarantee is required, and on the other hand, the reliability of data transmission has been raised. In this case, how to select links that meet the QoS requirements has become a current research hotspot. [0003] Based on the earlier statement that "innovation to promote the network requires programming on the hardware data path", that is, the dynamic network, the researchers' initial idea was to layer and separate data from control. With the research going on, the three-layer SDN (Software Defined Network) came into being. The decoupling of the underlying data is realized, and the data is separated from the control layer. Tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L45/125H04L45/02H04L41/12H04L41/142G06N3/04G06N3/08
CPCH04L45/125H04L45/02H04L41/142H04L41/12G06N3/08G06N3/045
Inventor 江志远廖小平
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products