Access Point Clustering in Cell-Free Massive MIMO Using Conventional and Federated Multi-Agent Reinforcement Learning

被引:15
作者
Banerjee, Bitan [1 ]
Elliott, Robert C. [1 ]
Krzymien, Witold A. [1 ]
Medra, Mostafa [2 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 1H9, Canada
[2] Huawei Technol Canada Co Ltd, Ottawa, ON K2K 3J1, Canada
来源
IEEE TRANSACTIONS ON MACHINE LEARNING IN COMMUNICATIONS AND NETWORKING | 2023年 / 1卷
基金
加拿大自然科学与工程研究理事会;
关键词
Massive MIMO; Clustering algorithms; Antenna arrays; Reinforcement learning; Matching pursuit algorithms; Antennas; Precoding; Access point clustering; cell-free massive MIMO; centralized critic; decentralized actors; federated reinforcement learning; multi-agent reinforcement learning; user association; COORDINATED MULTIPOINT TRANSMISSION; DISTRIBUTED ANTENNA SYSTEMS; LARGE-SCALE MIMO; C-RAN; NETWORKS; COMMUNICATION; PERFORMANCE; CHALLENGES; ALLOCATION; RECEPTION;
D O I
10.1109/TMLCN.2023.3283228
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cell-free massive multiple-input multiple-output (MIMO) systems consist of geographically-distributed multi-antenna access points (APs) that form a virtual massive MIMO array. To make the network arbitrarily scalable in size, each user should be served by the best possible personalized user-centric cluster of nearby APs. Unfortunately, determining that cluster is a combinatorially-complex problem made even harder when the users are in motion. Therefore, in this work, we develop a multi-agent reinforcement learning (MARL) algorithm for AP selection and clustering. Each AP is an agent in the MARL algorithm and it is trained to near-optimally select for itself which users to serve. Conventional MARL algorithms require a centralized reward system to train the agents, and the agents' neural network weights tend to strongly depend on their locations during training. To counteract these problems, we also consider a federated MARL framework. Simulation results demonstrate both our conventional and federated MARL algorithms outperform existing published AP selection algorithms, and also provide performance comparable to the case of all APs serving all users. The results also show the conventional algorithm has somewhat superior performance in the environment it was trained in, but the federated algorithm transfers its learning to changed environments much better, with very little performance loss.
引用
收藏
页码:107 / 123
页数:17
相关论文
共 77 条
[1]   A Cooperation Strategy Based on Bargaining Game for Fair User-Centric Clustering in Cloud-RAN [J].
Abdelhakam, Mostafa M. ;
Elmesalawy, Mahmoud M. ;
Mahmoud, Korany R. ;
Ibrahim, Ibrahim. I. .
IEEE COMMUNICATIONS LETTERS, 2018, 22 (07) :1454-1457
[2]   Multiple Access in Cell-Free Networks: Outage Performance, Dynamic Clustering, and Deep Reinforcement Learning-Based Design [J].
Al-Eryani, Yasser ;
Akrout, Mohamed ;
Hossain, Ekram .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (04) :1028-1042
[3]   A Federated Reinforcement Learning Framework for Incumbent Technologies in Beyond 5G Networks [J].
Ali, Rashid ;
Bin Zikria, Yousaf ;
Garg, Sahil ;
Bashir, Ali Kashif ;
Obaidat, Mohammad S. ;
Kim, Hyung Seok .
IEEE NETWORK, 2021, 35 (04) :152-159
[4]   User-Centric Cell-Free Massive MIMO Networks: A Survey of Opportunities, Challenges and Solutions [J].
Ammar, Hussein A. ;
Adve, Raviraj ;
Shahbazpanahi, Shahram ;
Boudreau, Gary ;
Srinivas, Kothapalli Venkata .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2022, 24 (01) :611-652
[5]  
Banerjee Bitan, 2022, 2022 IEEE Globecom Workshops (GC Wkshps), P1579, DOI 10.1109/GCWkshps56602.2022.10008644
[6]   Access Point Clustering in Cell-Free Massive MIMO Using Multi-Agent Reinforcement Learning [J].
Banerjee, Bitan ;
Elliott, Robert C. ;
Krzymien, Witold A. ;
Farmanbar, Hamid .
2022 IEEE 33RD ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2022, :1086-1092
[7]  
Biswas S, 2021, 2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), P158, DOI [10.1109/WISPNET51692.2021.9419450, 10.1109/WiSPNET51692.2021.9419450]
[8]   Scalable Cell-Free Massive MIMO Systems [J].
Bjornson, Emil ;
Sanguinetti, Luca .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (07) :4247-4261
[9]   Making Cell-Free Massive MIMO Competitive With MMSE Processing and Centralized Implementation [J].
Bjornson, Emil ;
Sanguinetti, Luca .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (01) :77-90
[10]   Massive MIMO networks: Spectral, energy, and hardware efficiency [J].
Björnson E. ;
Hoydis J. ;
Sanguinetti L. .
Foundations and Trends in Signal Processing, 2017, 11 (3-4) :154-655