Distributed reinforcement learning stable topology generation method based on adaptive boundary

A technology of reinforcement learning and topology generation, applied in the field of communication, can solve problems such as long execution time, no consideration of the comprehensive impact of links, network communication congestion, etc., to improve network communication quality, enhance link connection time, and avoid network overhead. Effect

Active Publication Date: 2020-02-04
XIAN UNIV OF POSTS & TELECOMM
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing methods are divided into the following aspects: 1.) Predict the stability degree and network topology of the link connection in the network through the mobility characteristics of the node, and predict the trajectory of the node based on the adaptive neuro-fuzzy system to select the link The node transmits, but a large amount of control information generated between nodes in the prediction process causes excessive energy consumption and large computing overhead; 2.) Collect the received signal strength of the node, and perform deep learning training on it to predict the node's Movement, to build a stable link connection according to the movement trajectory, only considering the relative movement characteristics of the nodes in the process of predicting the position can not reflect the changes of the movement characteristics of the nodes in time, and the collected data only use the movement parameters of a certain period can not reflect well The current movement characteristics of the node; 3.) The method of selecting a stable path according to the received signal strength, dividing the link into two types of strong connection and weak connection according to the average value of the received signal strength of the node within a period of time, and setting the threshold to select a certain threshold However, this method does not consider the comprehensive influence of other factors on the link
[0004] In the information collection process of the existing method, when there are many mobile nodes, there will be disadvantages such as network communication blockage, large amount of node calculation, and high energy consumption of nodes. In the path stability determination link, the information cannot be transmitted in time or the node transmission information is lost due to the unsmooth information communication, resulting in the distributed MANET topology not being able to efficiently predict the link stability, or the link prediction can be made but the stability is not Guaranteed, poor reliability, and long method execution time
[0005] The above defects limit the performance of MANET, resulting in increased energy consumption, shortened life cycle and increased network delay, thus affecting the link stability prediction method in MANET Applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed reinforcement learning stable topology generation method based on adaptive boundary
  • Distributed reinforcement learning stable topology generation method based on adaptive boundary
  • Distributed reinforcement learning stable topology generation method based on adaptive boundary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] The mobile ad hoc network plays an important role in the communication network without infrastructure. The network has no infrastructure support. Each mobile node has both router and host functions, and can form any network topology through wireless connection. Mobile ad hoc networks have broad application prospects in military communications, mobile networks, connecting personal area networks, emergency services and disaster recovery, and wireless sensor networks. Therefore, the mobile ad hoc network has also become one of the current research hotspots. The mobility of nodes in the mobile ad hoc network causes the network topology formed by the entire wireless channel to change at any time. In order to effectively reduce the impact of dynamic topology changes, the existing methods use the mobility of nodes to predict the link connection in the network. The degree of stability and network topology to reduce the impact of dynamic topology changes. However, the existing ...

Embodiment 2

[0055] The distributed reinforcement learning stable topology generation method based on adaptive boundaries is the same as that in Embodiment 1, and the reinforcement learning method described in step 4 of the present invention, the specific implementation process includes the following steps:

[0056] Step 4.1 Determine the overall structure of the reinforcement learning method: In the interval [a,b], each node in the mobile ad hoc network is regarded as an agent agent, and the dynamic changes of MANET can be regarded as a distributed multi-agent collaborative system . For each distributed agent Agent, it is assumed that its environment state set is S, the action set is A, and the reward function is The action selection strategy is π(s i ,a j ).

[0057] The present invention builds a reinforcement learning model in the mobile self-organizing network, regards the network as a multi-agent cooperative system, effectively combines the scene of the mobile self-organizing net...

Embodiment 3

[0079] The distributed reinforcement learning stable topology generation method based on adaptive boundaries is the same as that in Embodiment 1-2, and the adaptive interval boundary update formula in step 6 of the present invention is as follows:

[0080]

[0081] In the formula: a is the upper boundary of the interval; b is the lower boundary of the interval; RSSI is the received signal strength indicator value of the neighbor node; s' is the actual connection variable state of the node and the neighbor node at the next moment; For the prediction of the connection variable state between the node and the neighbor node at the next moment; in the present invention, adaptive_rate is set as the proportional coefficient of the adaptive boundary adjustment, that is, the ratio of the connection state prediction error times and the total number of predictions at the next transmission moment predicted by the current node . like When a0.1, adjust the adaptive boundary a=RSSI; if ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed reinforcement learning stable topology generation method based on an adaptive boundary, and solves the problems of poor link node connection continuity and stability in routing. The method comprises the following steps of: constructing a node dynamic topology in the mobile ad hoc network; dividing an adaptive reinforcement learning interval and initializing aQ value table; processing the received signal intensity values by sections; performing reinforcement learning in the adaptive interval, updating the Q value by using an adaptive reward function, and judging the stability of the connection state; judging a direct decision interval state; self-adaptively updating the boundary of the self-adaptive interval; and generating a distributed adaptive stable connection topology. According to the method, the received signal strength value is combined with reinforcement learning, adaptive interval boundary updating is combined with adaptive reward function updating, a stable topology link in the dynamic topology change process is accurately achieved, node energy consumption is reduced, large network expenditure is avoided, the learning rate is high, and complexity is low. The method is used for mobile ad hoc network distributed topology generation.

Description

technical field [0001] The invention belongs to the field of communication technology, and relates to stable topology generation of mobile ad hoc networks, in particular to a self-adaptive boundary-based distributed reinforcement learning stable topology generation method for mobile ad hoc networks, which is used in mobile ad hoc distributed networks. Background technique [0002] As a special wireless mobile network, mobile ad hoc networks (MANET) are widely used in civilian applications due to the characteristics of no need to set up network facilities, fast deployment, free movement of network nodes and ability to communicate with each other in any way. and modern military communications. Mobile ad hoc network is a comprehensive technology combining multiple disciplines. How to build a safe, stable and reliable mobile ad hoc network is an aspect to be solved in the current communication field. The impact of mobile nodes on building a stable topology in mobile ad hoc netw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W40/24H04W52/02H04W84/18
CPCH04W40/24H04W52/0212H04W84/18Y02D30/70
Inventor 黄庆东石斌宇杜昭强
Owner XIAN UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products