Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method of Controlling Routing Action Based on Multi-agent Reinforcement Learning Routing Policy

A reinforcement learning, multi-agent technology, applied in the information field, can solve the problem of reducing the average delivery time of data packets, and achieve the effect of reducing the average delivery time

Active Publication Date: 2022-02-25
SHENZHEN RES INST OF BIG DATA +1
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These simulated network models ignore many important network characteristics, such as dynamically changing network loads and mobile users, so the routing choices made under these models often cannot minimize the average delivery time of data packets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method of Controlling Routing Action Based on Multi-agent Reinforcement Learning Routing Policy
  • A Method of Controlling Routing Action Based on Multi-agent Reinforcement Learning Routing Policy
  • A Method of Controlling Routing Action Based on Multi-agent Reinforcement Learning Routing Policy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0047] In the present disclosure, it should be understood that terms such as "comprising" or "having" are intended to indicate the presence of features, numbers, steps, acts, components, parts or combinations thereof disclosed in the specification, and are not intended to exclude one or a plurality of other features, numbers, steps, acts, parts, parts or combinations thereof exist or are added.

[0048] In addition, it should be noted that, in the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other. The present disclosure will be described in detail below with reference to the accompanying drawings and embodiments.

[0049] figure 1 A flowcha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the field of information technology, and discloses a method for controlling routing actions based on multi-agent reinforcement learning routing strategies, including: training reinforcement learning models, the reinforcement learning models update the decision values ​​of routing nodes using Q-learning algorithms, and combine The updated decision value uses a policy gradient algorithm to update policy parameters; according to the target node where the routing node forwards the data packet and the network load in the communication network where the routing node is located, the policy parameter is determined using the reinforcement learning model; according to the The policy parameter is used to determine the outgoing link of the routing node. The present invention aims at dynamically changing network connection modes and network loads, and routing nodes can adjust routing strategies in time, select the appropriate shortest path according to the target node of the data packet, and finally greatly reduce the average delivery time of the data packet.

Description

technical field [0001] The invention relates to the field of information technology, in particular to a method for controlling routing actions based on multi-agent reinforcement learning routing strategies. Background technique [0002] Packet routing in communication networks is an important application problem in sequential decision making. A communication network consists of a set of nodes and the links connecting these nodes. Data center networks and the Internet can be seen as real-world examples of communication networks. In a communication network, information is passed between nodes in the form of data packets. Routing is the decision-making process that guides how data packets pass through a series of intermediate nodes from the initial node to the target node. Usually, there are multiple paths for a data packet to choose from in the communication network, and the choice of the path usually determines the average delivery time of the data packet. [0003] At pres...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L45/00H04L45/02H04L45/037H04L45/30
CPCH04L45/08H04L45/02H04L45/3065H04L45/38
Inventor 陈怿曾思亮许行飞
Owner SHENZHEN RES INST OF BIG DATA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products