Deep Reinforcement Learning-Based Sum Rate Fairness Trade-Off for Cell-Free mMIMO

被引:3
作者
Rahmani, Mostafa [1 ,2 ]
Bashar, Manijeh [2 ]
Dehghani, Mohammad Javad [1 ]
Akbari, Ali [3 ]
Xiao, Pei [2 ]
Tafazolli, Rahim [2 ]
Debbah, Merouane [4 ,5 ]
机构
[1] Shiraz Univ Technol, Dept Elect & Elect Engn, 71946-84334, Shiraz, Iran
[2] Univ Surrey, Inst Commun Syst ICS, 5GIC & 6GIC, Guildford GU2 7XH, England
[3] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, England
[4] Technol Innovat Inst, 9639, Abu Dhabi, U Arab Emirates
[5] Univ Paris Saclay, CentraleSupelec, F-91192 Gif Sur Yvette, France
基金
英国工程与自然科学研究理事会;
关键词
Signal to noise ratio; Interference; Resource management; Optimization; Power control; Heuristic algorithms; Antennas; Cell-free massive MIMO; deep reinforcement learning; fairness; power control; sequential convex approximation; FREE MASSIVE MIMO; POWER-CONTROL; NETWORKS; ALLOCATION; OPTIMIZATION;
D O I
10.1109/TVT.2022.3230041
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The uplink of a cell-free massive multiple-input multiple-output with maximum-ratio combining (MRC) and zero-forcing (ZF) schemes are investigated. A power allocation optimization problem is considered, where two conflicting metrics, namely the sum rate and fairness, are jointly optimized. As there is no closed-form expression for the achievable rate in terms of the large scale-fading (LSF) components, the sum rate fairness trade-off optimization problem cannot be solved by using known convex optimization methods. To alleviate this problem, we propose two new approaches. For the first approach, a use-and-then-forget scheme is utilized to derive a closed-form expression for the achievable rate. Then, the fairness optimization problem is iteratively solved through the proposed sequential convex approximation (SCA) scheme. For the second approach, we exploit LSF coefficients as inputs of a twin delayed deep deterministic policy gradient (TD3), which efficiently solves the non-convex sum rate fairness trade-off optimization problem. Next, the complexity and convergence properties of the proposed schemes are analyzed. Numerical results demonstrate the superiority of the proposed approaches over conventional power control algorithms in terms of the sum rate and minimum user rate for both the ZF and MRC receivers. Moreover, the proposed TD3-based power control achieves better performance than the proposed SCA-based approach as well as the fractional power scheme.
引用
收藏
页码:6039 / 6055
页数:17
相关论文
共 38 条
  • [1] Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach
    Ahsan, Waleed
    Yi, Wenqiang
    Qin, Zhijin
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (08) : 5083 - 5098
  • [2] Multiple Access in Cell-Free Networks: Outage Performance, Dynamic Clustering, and Deep Reinforcement Learning-Based Design
    Al-Eryani, Yasser
    Akrout, Mohamed
    Hossain, Ekram
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (04) : 1028 - 1042
  • [3] Exploiting Deep Learning in Limited-Fronthaul Cell-Free Massive MIMO Uplink
    Bashar, Manijeh
    Akbari, Ali
    Cumanan, Kanapathippillai
    Ngo, Hien Quoc
    Burr, Alister G.
    Xiao, Pei
    Debbah, Merouane
    Kittler, Josef
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (08) : 1678 - 1697
  • [4] Bashar M, 2018, CONF REC ASILOMAR C, P624, DOI 10.1109/ACSSC.2018.8645433
  • [5] A sequential parametric convex approximation method with applications to nonconvex truss topology design problems
    Beck, Amir
    Ben-Tal, Aharon
    Tetruashvili, Luba
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2010, 47 (01) : 29 - 51
  • [6] Two Applications of Deep Learning in the Physical Layer of Communication Systems
    Bjornson, Emil
    Giselsson, Pontus
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (05) : 134 - 140
  • [7] Björnson E, 2017, FOUND TRENDS SIGNAL, V11, P154, DOI 10.1561/2000000093
  • [8] Boyd S., 2004, CONVEX OPTIMIZATION, DOI 10.1017/CBO9780511804441
  • [9] Demir OT, 2023, Arxiv, DOI arXiv:2108.02541
  • [10] Fredj F, 2021, Arxiv, DOI arXiv:2006.15138