Decentralized and Joint Resource Allocation, Beamforming, and Beamcombining for 5G Networks With Heterogeneous MARL

被引:0
作者
Al-Habashna, Ala'a [1 ,2 ]
Menard, Jon [1 ]
Wainer, Gabriel [1 ]
Boudreau, Gary [3 ]
机构
[1] Carleton Univ, Syst & Comp Engn, 1125 Colonel Dr, Ottawa, ON K1S 5B6, Canada
[2] Al Hussein Tech Univ, Sch Comp & Informat, Amman, Jordan
[3] Ericsson Canada, 349 Terry Fox Dr, Kanata, ON K2K 2V6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Resource management; 5G mobile communication; Throughput; Interference; Array signal processing; Uplink; Training; Device-to-device communication; Computational complexity; Proposals; Deep reinforcement learning; multi-agent reinforcement learning; distributed resource allocation; beamforming; 5G; CHANNEL ESTIMATION; WIRELESS;
D O I
10.1109/ACCESS.2025.3576190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel Multi-Agent Reinforcement Learning (MARL) -based paradigm for distributed and joint resource allocation, beamforming (BF), and beam combining of uplink transmissions in 5G networks. The proposed paradigm employs two types of heterogenous agents that learn to perform and optimize different tasks in order to achieve the main objective of the system, as well as the objective of the individual agents. In the proposed paradigm, UEs can be multi-agents that optimize their own resource allocation and BF. In addition to these multi agents (i.e., UEs), the BS is a different type of agent that optimizes the combining of UEs' transmissions. We developed three different implementations of our proposal using three different MARL algorithms: Independent Q Learners (IQL), Multi-Agent Deep Deterministic Policy Gradient (MADDPG), and QTRAN. Various experiments were conducted to validate the usability of our proposal. Our results show that the proposed paradigm can successfully optimize the task of joint resource allocation, beamforming, and combining. Furthermore, we provide a comparative analysis of the three different implementations, highlighting noteworthy insights into the strengths and limitations of fully distributed algorithms, such as IQL, in comparison to algorithms employing the Centralized Training with Decentralized Execution (CTDE) framework, exemplified by QTRAN and MADDPG.
引用
收藏
页码:101491 / 101506
页数:16
相关论文
共 45 条
[1]  
Al-Habashna A, 2010, CONF REC ASILOMAR C, P160, DOI 10.1109/ACSSC.2010.5757489
[2]  
Al-Habashna A., 2016, P 49 ANN SIM S, P1
[3]  
Al-Habashna A., 2015, Canada Provisional Patent, Patent No. 47111
[4]  
Al-Habashna A., 2010, P IEEE GLOB TEL C GL, P1
[5]  
Al-Habashna A., 2016, P INT S PERF EV COMP, P1
[6]   QoE awareness in progressive caching and DASH-based D2D video streaming in cellular networks [J].
Al-Habashna, Ala'a ;
Wainer, Gabriel .
WIRELESS NETWORKS, 2020, 26 (03) :2051-2073
[7]   Improving Video Transmission in Cellular Networks with Cached and Segmented Video Download Algorithms [J].
Al-Habashna, Ala'a ;
Wainer, Gabriel .
MOBILE NETWORKS & APPLICATIONS, 2018, 23 (03) :543-559
[8]  
Al-Habashna A, 2017, 2017 INTERNATIONAL SYMPOSIUM ON PERFORMANCE EVALUATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (SPECTS)
[9]  
Al-Habashna A, 2011, IEEE ICC
[10]   Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems [J].
Alkhateeb, Ahmed ;
El Ayach, Omar ;
Leus, Geert ;
Heath, Robert W., Jr. .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (05) :831-846