A hierarchical reinforcement learning anti-interference algorithm for joint optimization of channel selection and transmission time

A technology of transmission time and reinforcement learning, applied in the field of wireless communication, can solve the problems of not comprehensively considering the influence of anti-jamming communication performance, single anti-jamming influencing factor, and not considering the influence of various factors of anti-jamming quality.

Active Publication Date: 2019-05-10
ARMY ENG UNIV OF PLA
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the above literature, a better solution to the channel selection problem in single-user and multi-user scenarios is proposed, and the performance of the proposed algorithm is verified by simulation, but the actual communication is restricted by factors such as channel and transmission time. In the literature, only the influence of a single factor is considered
[0004] At present, in the field of anti-jamming, the influence of channel selection on anti-jamming performance is mainly studied, and the anti-jamming quality is not considered to be affected by various factors in actual communication. Impact

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A hierarchical reinforcement learning anti-interference algorithm for joint optimization of channel selection and transmission time
  • A hierarchical reinforcement learning anti-interference algorithm for joint optimization of channel selection and transmission time
  • A hierarchical reinforcement learning anti-interference algorithm for joint optimization of channel selection and transmission time

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0099] The first embodiment of the present invention is specifically described as follows. The system uses matlab software to simulate and verify the proposed joint optimization model of channel selection and transmission time and the layered reinforcement learning anti-jamming algorithm, and simultaneously analyzes the convergence of the proposed algorithm.

[0100] The wireless communication environment includes 1 channel of sweeping interference signal, 1 transmitter and 1 receiver, there are M=5 available channels, and N=5 optional transmission time lengths. The specific parameter settings related to hierarchical reinforcement learning are shown in Table 1.

[0101] Table 1 Simulation parameter settings

[0102]

[0103] Simulation result analysis:

[0104] Figure 6 Under different transmission time lengths, the channel selection algorithm based on reinforcement learning obtains optimized throughput performance curves. Through the simulation results, it can be found...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical reinforcement learning anti-interference algorithm for joint optimization of channel selection and transmission time. The algorithm comprises a wireless communication network consisting of a transmitter, a receiver and an interference machine, wherein the interference machine generates an interference signal according to any one of a frequency sweeping mode,a comb mode and a random mode; On a data channel, a transmitter and a receiver carry out data communication in a dynamic spectrum access mode so as to withstand Communication interference to the transmitter / receiver caused by interference signals released by the jammer; And on the control channel, the transmitter and the receiver realize coordination of the dynamic spectrum of the transmitting andreceiving end through information interaction. In a dynamic interference scene, channel selection optimization based on fast reinforcement learning is carried out with a small time granularity, transmission time length optimization based on a random automatic learning machine is carried out with a large time granularity, and loop execution is carried out until the data transmission time length isconverged or reaches the maximum iteration frequency. The throughput of the wireless communication network system is improved.

Description

technical field [0001] The invention belongs to the technical field of wireless communication, in particular to a layered reinforcement learning anti-jamming algorithm for joint optimization of channel selection and transmission time. Background technique [0002] Based on cognitive radio theory, anti-jamming communication by means of dynamic spectrum access is a major research hotspot in the field of anti-jamming. However, in the actual anti-jamming communication process, the length of data transmission time has a non-negligible effect on the throughput of the anti-jamming system. When the transmission time length is much longer than the interference time length, it is easy to suffer from multiple interferences within one data transmission time, which will seriously reduce the quality of communication. Conversely, when the transmission time length is much smaller than the interference time length, it will cause the user to switch working channels frequently, consume more s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04W72/08H04B17/382
Inventor 徐煜华孔利君郭秋菊徐以涛江汉
Owner ARMY ENG UNIV OF PLA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products