Multi-Agent Reinforcement Learning Based Uplink OFDMA for IEEE 802.11ax Networks

被引:2
|
作者
Han, Mingqi [1 ]
Sun, Xinghua [1 ]
Zhan, Wen [1 ]
Gao, Yayu [2 ]
Jiang, Yuan [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Throughput; Uplink; Computational complexity; Sun; IEEE 802.11ax Standard; Optimization; Multiple access; multi-agent reinforcement learning; multi-objective reinforcement learning; mean-field reinforcement learning; DYNAMIC MULTICHANNEL ACCESS; MINIMIZATION; INFORMATION;
D O I
10.1109/TWC.2024.3355276
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the IEEE 802.11ax Wireless Local Area Networks (WLANs), Orthogonal Frequency Division Multiple Access (OFDMA) has been applied to enable the high-throughput WLAN amendment. However, with the growth of the number of devices, it is difficult for the Access Point (AP) to schedule uplink transmissions, which calls for an efficient access mechanism in the OFDMA uplink system. Based on Multi-Agent Proximal Policy Optimization (MAPPO), we propose a Mean-Field Multi-Agent Proximal Policy Optimization (MFMAPPO) algorithm to improve the throughput and guarantee the fairness. Motivated by the Mean-Field games (MFGs) theory, a novel global state and action design are proposed to ensure the convergence of MFMAPPO in the massive access scenario. The Multi-Critic Single-Policy (MCSP) architecture is deployed in the proposed MFMAPPO so that each agent can learn the optimal channel access strategy to improve the throughput while satisfying fairness requirement. Extensive simulation experiments are performed to show that the MFMAPPO algorithm 1) has low computational complexity that increases linearly with respect to the number of stations 2) achieves nearly optimal throughput and fairness performance in the massive access scenario, 3) can adapt to various diverse and dynamic traffic conditions without retraining, as well as the traffic condition different from training traffic.
引用
收藏
页码:8868 / 8882
页数:15
相关论文
共 50 条
  • [1] OFDMA Uplink Scheduling in IEEE 802.11ax Networks
    Bankov, Dmitry
    Didenko, Andre
    Khorov, Evgeny
    Lyakhov, Andrey
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [2] Performance Analysis of Uplink Multi-User OFDMA in IEEE 802.11ax
    Naik, Gaurang
    Bhattarai, Sudeep
    Park, Jung-Min
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [3] Joint Optimization on Uplink OFDMA and MU-MIMO for IEEE 802.11ax: Deep Hierarchical Reinforcement Learning Approach
    Noh, Hyeonho
    Lee, Harim
    Yang, Hyun Jong
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (08) : 1800 - 1804
  • [4] NOMA-based Uplink OFDMA Collision Reduction in 802.11ax Networks
    Lee, Won-Jae
    Shin, Wonjae
    Ruiz-de-Azua, Joan A.
    Fernandez Capon, Lara
    Park, Hyuk
    Kim, Jae-Hyun
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 212 - 214
  • [5] An Efficient Backoff Procedure for IEEE 802.11ax Uplink OFDMA-Based Random Access
    Kosek-Szott, Katarzyna
    Domino, Krzysztof
    IEEE ACCESS, 2022, 10 : 8855 - 8863
  • [6] Uplink Resource Allocation in IEEE 802.11ax
    Bhattarai, Sudeep
    Naik, Gaurang
    Park, Jung-Min
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [7] Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning
    Wydmanski, Witold
    Szott, Szymon
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [8] Throughput-maximizing OFDMA Scheduler for IEEE 802.11ax Networks
    Kuran, Mehmet Sukru
    Dilmac, A.
    Topal, Omer
    Yamansavascilar, Baris
    Avallone, Stefano
    Tugcu, Tuna
    2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [9] Distributed Convolutional Deep Reinforcement Learning based OFDMA MAC for 802.11ax
    Kotagiri, Dheeraj
    Nihei, Koichi
    Li, Tansheng
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [10] Transmission Delay-Based Uplink Multi-User Scheduling in IEEE 802.11ax Networks
    Kim, Yonggang
    Kim, Gyungmin
    Oh, Youngwoo
    Choi, Wooyeol
    APPLIED SCIENCES-BASEL, 2021, 11 (19):