Unlock instant, AI-driven research and patent intelligence for your innovation.

A Phase Optimization Method for Intelligent Reflective Surfaces Based on Deep Reinforcement Learning

A reflective surface, reinforcement learning technology, applied in the field of communication, can solve the problems of high computational complexity of SDR algorithm, and achieve the effect of stable training process, good convergence, and low implementation complexity

Active Publication Date: 2022-05-10
SOUTHEAST UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the high computational complexity of the SDR algorithm and the sample acquisition problem of deep learning. The present invention provides an intelligent reflective surface optimization method based on deep reinforcement learning for the downlink transmission system of the base station using a large-scale uniform linear antenna array , the proposed algorithm can train the network model online according to the samples in the experience pool, saving sample storage space and phase optimization time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Phase Optimization Method for Intelligent Reflective Surfaces Based on Deep Reinforcement Learning
  • A Phase Optimization Method for Intelligent Reflective Surfaces Based on Deep Reinforcement Learning
  • A Phase Optimization Method for Intelligent Reflective Surfaces Based on Deep Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0040] Such as figure 1 As shown, the present invention discloses a method for optimizing the phase of an intelligent reflective surface based on deep reinforcement learning, which specifically includes the following steps:

[0041] Step 1. The base station in the wireless communication system is configured with a uniform linear antenna array, the antenna array includes M antenna elements, and the smart reflective surface is configured with a uniform planar reflective unit, including the vertical direction N y Row reflection unit, N per row in the horizontal direction x A reflection unit, the user configures a single receiving antenna; the base station and the reflection unit know the channel state information of the user;

[0042] The channel state information includes: base station to user channel vector Channel matrix from base station to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for optimizing the phase of an intelligent reflective surface based on deep reinforcement learning, which includes the following steps: initializing the action network, evaluation network, intelligent reflective surface phase bias matrix and experience pool in the intelligent reflective surface (intelligent body); according to User channel state information to obtain the initial state of the agent; store the experience pool through the interaction between the intelligent reflective surface and the wireless communication system; randomly sample from the experience pool to train the action network and evaluation network so that the evaluation value output by the evaluation network reaches the maximum, and then Obtain the network model parameters after convergence; output the optimal phase offset matrix coefficient of the smart reflective surface that maximizes the signal-to-noise ratio received by the user under the channel state information. The invention can effectively reduce the time required for optimizing the phase offset matrix and the storage space of training samples, and has better robustness.

Description

technical field [0001] The invention relates to the field of communication technology, in particular to a phase optimization method for an intelligent reflective surface based on deep reinforcement learning. Background technique [0002] In recent years, the birth of a number of key 5G technologies has significantly improved the spectral efficiency and capacity of wireless communication systems. However, in the actual deployment process, there are still practical problems such as high energy consumption, hardware implementation complexity, and signal processing algorithm complexity. With the development of radio frequency micro-electromechanical systems and metamaterials, the application of intelligent reflecting surfaces (Intelligent Reflecting Surface, IRS) with low energy consumption and adaptable to time-varying wireless communication systems becomes possible. IRS generally consists of a large number of passive printed dipole antenna elements, and each passive antenna c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04B7/06G06N3/04G06N3/08
CPCH04B7/0617G06N3/08G06N3/045H04B7/04013
Inventor 李潇冯轲铭金石
Owner SOUTHEAST UNIV