Intelligent reflection surface phase optimization method based on deep reinforcement learning

A reflective surface and reinforcement learning technology, applied in the field of communication, can solve problems such as the high computational complexity of the SDR algorithm

Active Publication Date: 2020-05-19
SOUTHEAST UNIV
View PDF5 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the high computational complexity of the SDR algorithm and the sample acquisition problem of deep learning. The present invention provides an intelligent reflective surface optimization method based on deep reinforcement learning for the downlink transmission system of the base station using a large-scale uniform linear antenna array , the proposed algorithm can train the network model online according to the samples in the experience pool, saving sample storage space and phase optimization time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intelligent reflection surface phase optimization method based on deep reinforcement learning
  • Intelligent reflection surface phase optimization method based on deep reinforcement learning
  • Intelligent reflection surface phase optimization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0040] Such as figure 1 As shown, the present invention discloses a method for optimizing the phase of an intelligent reflective surface based on deep reinforcement learning, which specifically includes the following steps:

[0041] Step 1. The base station in the wireless communication system is configured with a uniform linear antenna array, the antenna array includes M antenna elements, and the smart reflective surface is configured with a uniform planar reflective unit, including the vertical direction N y Row reflection unit, N per row in the horizontal direction x A reflection unit, the user configures a single receiving antenna; the base station and the reflection unit know the channel state information of the user;

[0042] The channel state information includes: base station to user channel vector Channel matrix from base station to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an intelligent reflection surface phase optimization method based on deep reinforcement learning. The method comprises the following steps: initializing an action network, an evaluation network, an intelligent reflection surface phase offset matrix and an experience pool in an intelligent reflection surface (intelligent agent); acquiring an initial state of the agent according to the user channel state information; storing the experience pool through interaction between the intelligent reflection surface and the wireless communication system; randomly sampling from theexperience pool to train the action network and the evaluation network so as to maximize the evaluation value output by the evaluation network, and then obtaining network model parameters after convergence; and outputting the optimal phase offset matrix coefficient of the intelligent reflecting surface, which maximizes the receiving signal-to-noise ratio of the user under the channel state information. According to the method, the time required for optimizing the phase offset matrix can be effectively reduced, the storage space of the sample is trained, and the robustness is better.

Description

technical field [0001] The invention relates to the field of communication technology, in particular to a phase optimization method for an intelligent reflective surface based on deep reinforcement learning. Background technique [0002] In recent years, the birth of a number of key 5G technologies has significantly improved the spectral efficiency and capacity of wireless communication systems. However, in the actual deployment process, there are still practical problems such as high energy consumption, hardware implementation complexity, and signal processing algorithm complexity. With the development of radio frequency micro-electromechanical systems and metamaterials, the application of intelligent reflecting surfaces (Intelligent Reflecting Surface, IRS) with low energy consumption and adaptable to time-varying wireless communication systems becomes possible. IRS generally consists of a large number of passive printed dipole antenna elements, and each passive antenna c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04B7/06G06N3/04G06N3/08
CPCH04B7/0617G06N3/08G06N3/045H04B7/04013
Inventor 李潇冯轲铭金石
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products