Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems

被引:57
作者
Zhang, Yu [1 ]
Alrabeiah, Muhammad [1 ]
Alkhateeb, Ahmed [1 ]
机构
[1] Arizona State Univ ASU, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
基金
美国国家科学基金会;
关键词
Hardware; Array signal processing; Geometry; Base stations; Reinforcement learning; Radio frequency; Phase shifters; Beamforming codebook; millimeter wave (mmWave); terahertz (THz); reinforcement learning; site-specific; ARRAYS;
D O I
10.1109/TCOMM.2021.3126856
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Millimeter wave (mmWave) and terahertz MIMO systems rely on pre-defined beamforming codebooks for both initial access and data transmission. These pre-defined codebooks, however, are commonly not optimized for specific environments, user distributions, and/or possible hardware impairments. This leads to large codebook sizes with high beam training overhead which makes it hard for these systems to support highly mobile applications. To overcome these limitations, this paper develops a deep reinforcement learning framework that learns how to optimize the codebook beam patterns relying only on the receive power measurements. The developed model learns how to adapt the beam patterns based on the surrounding environment, user distribution, hardware impairments, and array geometry. Further, this approach does not require any knowledge about the channel, RF hardware, or user positions. To reduce the learning time, the proposed model designs a novel Wolpertinger-variant architecture that is capable of efficiently searching the large discrete action space. The proposed learning framework respects the RF hardware constraints such as the constant-modulus and quantized phase shifter constraints. Simulation results confirm the ability of the developed framework to learn near-optimal beam patterns for line-of-sight (LOS), non-LOS (NLOS), mixed LOS/NLOS scenarios and for arrays with hardware impairments without requiring any channel knowledge.
引用
收藏
页码:904 / 919
页数:16
相关论文
共 31 条
[1]   Contribution of the Zubair source rocks to the generation and expulsion of oil to the reservoirs of the Mesopotamian Basin, Southern Iraq [J].
Al-Khafaji, Amer Jassim ;
Sadooni, Fadhil ;
Hindi, Mohammed Hadi .
PETROLEUM SCIENCE AND TECHNOLOGY, 2019, 37 (08) :940-949
[2]   Frequency Selective Hybrid Precoding for Limited Feedback Millimeter Wave Systems [J].
Alkhateeb, Ahmed ;
Heath, Robert W., Jr. .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (05) :1801-1818
[3]   Limited Feedback Hybrid Precoding for Multi-User Millimeter Wave Systems [J].
Alkhateeb, Ahmed ;
Leus, Geert ;
Heath, Robert W., Jr. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (11) :6481-6494
[4]   MIMO Precoding and Combining Solutions for Millimeter-Wave Systems [J].
Alkhateeb, Ahmed ;
Mo, Jianhua ;
Gonzalez-Prelcic, Nuria ;
Heath, Robert W., Jr. .
IEEE COMMUNICATIONS MAGAZINE, 2014, 52 (12) :122-131
[5]   Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems [J].
Alkhateeb, Ahmed ;
El Ayach, Omar ;
Leus, Geert ;
Heath, Robert W., Jr. .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (05) :831-846
[6]  
Alrabeiah M., 2020, IEEE Transactions on Communications
[7]  
Alrabeiah M, 2019, CONF REC ASILOMAR C, P1465, DOI [10.1109/IEEECONF44664.2019.9048929, 10.1109/ieeeconf44664.2019.9048929]
[8]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[9]  
Dulac-Arnold Gabriel, 2015, Deep reinforcement learning in large discrete action spaces
[10]   Spatially Sparse Precoding in Millimeter Wave MIMO Systems [J].
El Ayach, Omar ;
Rajagopal, Sridhar ;
Abu-Surra, Shadi ;
Pi, Zhouyue ;
Heath, Robert W., Jr. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (03) :1499-1513