Deep Reinforcement Learning-Based Sum Rate Fairness Trade-Off for Cell-Free mMIMO

被引：3

作者：

Rahmani, Mostafa ^{[1
,2
]}

Bashar, Manijeh ^{[2
]}

Dehghani, Mohammad Javad ^{[1
]}

Akbari, Ali ^{[3
]}

Xiao, Pei ^{[2
]}

Tafazolli, Rahim ^{[2
]}

Debbah, Merouane ^{[4
,5
]}

机构：

[1] Shiraz Univ Technol, Dept Elect & Elect Engn, 71946-84334, Shiraz, Iran

[2] Univ Surrey, Inst Commun Syst ICS, 5GIC & 6GIC, Guildford GU2 7XH, England

[3] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, England

[4] Technol Innovat Inst, 9639, Abu Dhabi, U Arab Emirates

[5] Univ Paris Saclay, CentraleSupelec, F-91192 Gif Sur Yvette, France

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2023年 / 72卷 / 05期

基金：

英国工程与自然科学研究理事会;

关键词：

Signal to noise ratio; Interference; Resource management; Optimization; Power control; Heuristic algorithms; Antennas; Cell-free massive MIMO; deep reinforcement learning; fairness; power control; sequential convex approximation; FREE MASSIVE MIMO; POWER-CONTROL; NETWORKS; ALLOCATION; OPTIMIZATION;

D O I：

10.1109/TVT.2022.3230041

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The uplink of a cell-free massive multiple-input multiple-output with maximum-ratio combining (MRC) and zero-forcing (ZF) schemes are investigated. A power allocation optimization problem is considered, where two conflicting metrics, namely the sum rate and fairness, are jointly optimized. As there is no closed-form expression for the achievable rate in terms of the large scale-fading (LSF) components, the sum rate fairness trade-off optimization problem cannot be solved by using known convex optimization methods. To alleviate this problem, we propose two new approaches. For the first approach, a use-and-then-forget scheme is utilized to derive a closed-form expression for the achievable rate. Then, the fairness optimization problem is iteratively solved through the proposed sequential convex approximation (SCA) scheme. For the second approach, we exploit LSF coefficients as inputs of a twin delayed deep deterministic policy gradient (TD3), which efficiently solves the non-convex sum rate fairness trade-off optimization problem. Next, the complexity and convergence properties of the proposed schemes are analyzed. Numerical results demonstrate the superiority of the proposed approaches over conventional power control algorithms in terms of the sum rate and minimum user rate for both the ZF and MRC receivers. Moreover, the proposed TD3-based power control achieves better performance than the proposed SCA-based approach as well as the fractional power scheme.

引用

页码：6039 / 6055

页数：17

共 38 条

[1] Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach
Ahsan, Waleed
Yi, Wenqiang
Qin, Zhijin
Liu, Yuanwei
Nallanathan, Arumugam
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (08) : 5083 - 5098
[2] Multiple Access in Cell-Free Networks: Outage Performance, Dynamic Clustering, and Deep Reinforcement Learning-Based Design
Al-Eryani, Yasser
Akrout, Mohamed
Hossain, Ekram
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (04) : 1028 - 1042
[3] Exploiting Deep Learning in Limited-Fronthaul Cell-Free Massive MIMO Uplink
Bashar, Manijeh
Akbari, Ali
Cumanan, Kanapathippillai
Ngo, Hien Quoc
Burr, Alister G.
Xiao, Pei
Debbah, Merouane
Kittler, Josef
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (08) : 1678 - 1697
[4] Bashar M, 2018, CONF REC ASILOMAR C, P624, DOI 10.1109/ACSSC.2018.8645433
[5] A sequential parametric convex approximation method with applications to nonconvex truss topology design problems
Beck, Amir
Ben-Tal, Aharon
Tetruashvili, Luba
[J]. JOURNAL OF GLOBAL OPTIMIZATION, 2010, 47 (01) : 29 - 51
[6] Two Applications of Deep Learning in the Physical Layer of Communication Systems
Bjornson, Emil
Giselsson, Pontus
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (05) : 134 - 140
[7] Björnson E, 2017, FOUND TRENDS SIGNAL, V11, P154, DOI 10.1561/2000000093
[8] Boyd S., 2004, CONVEX OPTIMIZATION, DOI 10.1017/CBO9780511804441
[9] Demir OT, 2023, Arxiv, DOI arXiv:2108.02541
[10] Fredj F, 2021, Arxiv, DOI arXiv:2006.15138

← 1 2 3 4 →