Network selection method based on Q-learning algorithm

A learning algorithm and network selection technology, applied in access restriction, electrical components, wireless communication, etc., can solve the problems of poor adaptability to the network environment, inability to dynamically select access networks, etc., and reduce voice blocking rate and data packet loss rate , the effect of increasing the average throughput

Active Publication Date: 2018-02-13
NANJING NARI GROUP CORP +1
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the multi-attribute decision-making method can consider network parameters more comprehensively, because it needs to determine the network weight in advance based on experience or expert judgment, this will lead to poor adaptability of this type of network selection algorithm to the network environment. The environment cannot dynamically select the appropriate access network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network selection method based on Q-learning algorithm
  • Network selection method based on Q-learning algorithm
  • Network selection method based on Q-learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the present invention will be further described below in conjunction with specific embodiments.

[0060] The invention combines the Q-learning algorithm with the selection problem of the power wireless communication network, constructs a Q-value function step by step through iteration according to the current network load status and business type, uses the Q-learning algorithm to find the optimal strategy, and selects the best access network. The advantage of this algorithm is that it can adapt to dynamically changing network selection problems; in addition, the optimal strategy is set as a function of the network load state, and through the online learning ability of the Q-learning algorithm, the algorithm can effectively improve network throughput and reduce business blocking rate. Improve network selection performance.

[0061] 1. System mo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a network selection method based on a Q-learning algorithm. The network selection method includes the steps of (1) initializing a Q value table and setting a discount factor gamma and a learning rate alpha; (2) determining a business type k and load rates of two current networks when a set time is up so as to obtain a current state sn; (3) selecting a useable action from anaction set A and recording such action as well as next network state sn+1; (4) computing an immediate return function r according to the network state after the selected action is implemented; (5), updating a Q value function Qn (s, a), and gradually decreasing the learning rate alpha to 0 according to rules of an inverse proportional function; (6), repeating the steps (2)-(5) until Q values areconverged, in other words, a difference value of the Q values before and after updating is smaller than a threshold value; (7) returning to the step (3) to select the action and accessing to an optimal network. The network selection method based on the Q-learning algorithm is capable of decreasing a voice business block rate and a data service packet loss rate and increasing average network throughout.

Description

technical field [0001] The invention relates to a network selection method based on a Q learning algorithm, and belongs to the technical field of electric power wireless communication. Background technique [0002] At present, with the development of the power industry, the scale of the power grid is gradually expanding, and the network topology is becoming more and more complex. The current power communication backbone network based on optical fiber communication can no longer meet the needs of various services. For long-distance access nodes, directly laying optical fiber lines is expensive and lacks practical value; under sudden disasters, it is difficult to repair fiber optic line failures in time; Substation commissioning. In these fields where wired communication cannot play a key role, the application of wireless communication network technology and its networking system can provide high-quality communication guarantee. [0003] Therefore, wireless communication tec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W48/18H04W24/06
CPCH04W24/06H04W48/18
Inventor 李洋冯宝刘金锁赵高峰张立武蔡世龙刘文贵完颜绍澎卞宇翔马涛丁晨阳胡阳蒯本链
Owner NANJING NARI GROUP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products