Anti-interference wireless communication method based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of wireless communication and reinforcement learning, which is applied in the field of wireless communication, can solve problems such as difficulty in implementation, inability to select transmission power, slow running speed, etc., and achieve the goal of improving convergence speed and learning speed, improving memory utilization, and increasing learning rate Effect

Active Publication Date: 2020-05-19

GUANGZHOU UNIVERSITY

View PDF7 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The disadvantages of this invention application are: 1), without knowing the specific frequency band attacked by the attacker, the communication efficiency of random frequency hopping technology cannot be guaranteed

2) The application for this invention is only applicable to communication scenarios where frequency hopping technology is used for frequency band selection, and channel adaptive selection of transmission power is not possible

The shortcoming that this scheme exists is: 1), it is difficult to regulate and optimize for the number n of extensions

The disadvantages of this solution are: 1) It is necessary to manually construct subsets of different frequency channels and transmission power combinations. Once the subset expands or the dimension increases, the amount of calculation will increase geometrically, making it difficult to apply to real-time communication applications; 2. ), the invention needs to iteratively calculate the metric value within the time range T, and the calculation process of the metric value is relatively complicated, and the running speed is very slow; 3), the invention uses a traditional reinforcement learning algorithm, which requires a large amount of memory for state value calculation Storage, when the dimension of the problem expands, the invention is difficult to implement

However, the frequency hopping method is difficult to select the optimal frequency band, and the traditional reinforcement learning method cannot quickly find the optimal strategy when the action set is large, and they can only be applied to the situation of constant attack

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0027] The basic idea and principle of the present invention can be expressed as follows: the combination of reinforcement learning and deep neural network is adopted, so that the sending device can select the optimal sending power and communication frequency band in a short period of time. First, two convolutional neural networks are used, one for value function calculation and the other for action selection, which can effectively reduce the correlation between single-network action selection and value estimation; secondly, use Priority experience sampling enables the experience samples with higher priority to be sampled first. Since different samples improve learning efficiency differently, the priority method enables samples that contribute greatly to the improvement of learning efficiency to be sampled first, which further improves the The convergence speed of the utility function; again, a new forward action preservation algorithm is proposed, that is, the greater the valu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a wireless communication technology, in particular to an anti-interference wireless communication method based on deep reinforcement learning. The method comprises the following steps: using two convolutional neural networks: one convolutional neural network calculates a value function, and the other convolutional neural network performs action selection based on a calculation result of the value function; adopting priority experience sampling in an experience playback stage, so that experience samples with higher priorities are sampled preferentially, updating parameters of the convolutional neural network based on the experience samples, and updating the priorities of all the experience samples through calculation of the updated convolutional neural network; adopting a forward action reservation strategy, designing a Gaussian-like function to judge the value of the current action, and dynamically adjusting and controlling the probability that the current action is continuously executed. According to the method, the optimal sending power and the optimal communication frequency band can be intelligently selected, the learning speed of the whole system is improved, and the optimal sending mode can be learned under the condition that a third-party attacker model is unknown.

Description

technical field [0001] The invention relates to wireless communication technology, in particular to an anti-jamming wireless communication method based on deep reinforcement learning. Background technique [0002] In wireless communication, the frequency band used by the sending device to send information to the receiving device is very vulnerable to third-party attacks, which greatly reduces communication efficiency. Among many attacks, blocking attack is the most important attack method. Jamming attackers attack multiple frequency bands at the same time with a certain power through frequency scanning and other methods. Since the frequency band under attack is unknown, the strategy of random frequency hopping to select the communication frequency band becomes very inefficient. Therefore, efficient intelligent frequency band selection technology becomes an urgent need. At the same time, most of the current sending devices use constant power to send signals, which will bec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04W12/12H04W24/06H04W28/02H04W12/122

CPCH04W24/06H04W28/0221H04W28/0236H04W12/122Y02D30/70

Inventor 王员根叶培根李进王捍贫

Owner GUANGZHOU UNIVERSITY

Anti-interference wireless communication method based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology