Millimeter wave communication beam training method based on deep reinforcement learning
A technology of beam training and reinforcement learning, which is applied in the field of millimeter wave wireless communication, can solve the problems of high training overhead, hardware complexity, and high power consumption, and achieve the effect of reducing overhead and hardware complexity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0072] see Figure 1-Figure 5 , this embodiment provides a millimeter wave communication beam training method based on deep reinforcement learning, which specifically includes:
[0073] Consider a mmWave massive MIMO system for a single user with N r Antennas, N at the base station t The antennas are arranged in the form of a uniform linear array (Uniform Linear Array, ULA). According to the widely used Saleh-Valenzuela model, the downlink mmWave channel can be modeled as:
[0074]
[0075] Among them, L, α l , θ l Respectively represent the number of paths, the channel gain of the lth path, the angle of arrival of the channel and the angle of departure of the channel. Usually the path with l=1 is the LOS path, and the other paths are the NLOS paths. definition Θ l and Ψ l is the angle of arrival and angle of departure in the spatial domain, both of which obey the uniform distribution in [0, π]. d t and d r represent the distance between the array antennas at...
Embodiment 2
[0149] On the basis of Embodiment 1, this embodiment provides a millimeter-wave communication beam training device based on deep reinforcement learning, which includes:
[0150] Beam selection module, according to the action A performed at time t t The set of beam combinations used for testing at the next moment is obtained as where I denotes the total number of beamcombinations used for training, Indicates the i-th transmit and receive beam combination.
[0151] The channel sample generation module generates several time-varying channel matrices H according to the random change of the channel steering angle t , use beam scanning method to determine each channel matrix H t Corresponding optimal transmit and receive beam combination
[0152] Receive signal matrix module, use set B t+1 The beams in are tested sequentially to get each beam combination Corresponding received signal strength z t+1 , and the received signals corresponding to other untested beam combinati...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


