Multi-Agent Reinforcement Learning Based Uplink OFDMA for IEEE 802.11ax Networks

被引:2
|
作者
Han, Mingqi [1 ]
Sun, Xinghua [1 ]
Zhan, Wen [1 ]
Gao, Yayu [2 ]
Jiang, Yuan [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Throughput; Uplink; Computational complexity; Sun; IEEE 802.11ax Standard; Optimization; Multiple access; multi-agent reinforcement learning; multi-objective reinforcement learning; mean-field reinforcement learning; DYNAMIC MULTICHANNEL ACCESS; MINIMIZATION; INFORMATION;
D O I
10.1109/TWC.2024.3355276
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the IEEE 802.11ax Wireless Local Area Networks (WLANs), Orthogonal Frequency Division Multiple Access (OFDMA) has been applied to enable the high-throughput WLAN amendment. However, with the growth of the number of devices, it is difficult for the Access Point (AP) to schedule uplink transmissions, which calls for an efficient access mechanism in the OFDMA uplink system. Based on Multi-Agent Proximal Policy Optimization (MAPPO), we propose a Mean-Field Multi-Agent Proximal Policy Optimization (MFMAPPO) algorithm to improve the throughput and guarantee the fairness. Motivated by the Mean-Field games (MFGs) theory, a novel global state and action design are proposed to ensure the convergence of MFMAPPO in the massive access scenario. The Multi-Critic Single-Policy (MCSP) architecture is deployed in the proposed MFMAPPO so that each agent can learn the optimal channel access strategy to improve the throughput while satisfying fairness requirement. Extensive simulation experiments are performed to show that the MFMAPPO algorithm 1) has low computational complexity that increases linearly with respect to the number of stations 2) achieves nearly optimal throughput and fairness performance in the massive access scenario, 3) can adapt to various diverse and dynamic traffic conditions without retraining, as well as the traffic condition different from training traffic.
引用
收藏
页码:8868 / 8882
页数:15
相关论文
共 50 条
  • [31] Deep learning based adaptive modulation and coding for uplink multi-user SIMO transmissions in IEEE 802.11ax WLANs
    Elwekeil, Mohamed
    Wang, Taotao
    Zhang, Shengli
    WIRELESS NETWORKS, 2021, 27 (08) : 5217 - 5227
  • [32] Deep learning based adaptive modulation and coding for uplink multi-user SIMO transmissions in IEEE 802.11ax WLANs
    Mohamed Elwekeil
    Taotao Wang
    Shengli Zhang
    Wireless Networks, 2021, 27 : 5217 - 5227
  • [33] Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems
    Wang, Ting-Hui
    Shen, Li-Hsiang
    Feng, Kai-Ten
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 433 - 438
  • [34] Performance Evaluation of IEEE 802.11ax for Residential Networks
    Sandoval, Jorge
    Cespedes, Sandra
    2021 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM 2021), 2021,
  • [35] Adaptive multi-user uplink resource allocation based on access delay analysis in IEEE 802.11ax
    Min Peng
    Qiqi Yin
    Kai Zhang
    Caihong Kai
    Wireless Networks, 2023, 29 : 1223 - 1235
  • [36] Symbol Timing Synchronization for Uplink Multi-User Transmission in IEEE 802.11ax WLAN
    Son, Youngwook
    Kim, Seongwon
    Byeon, Seongho
    Choi, Sunghyun
    IEEE ACCESS, 2018, 6 : 72962 - 72977
  • [37] Improving IEEE 802.11ax UORA Performance: Comparison of Reinforcement Learning and Heuristic Approaches
    Kosek-Szott, Katarzyna
    Szott, Szymon
    Dressler, Falko
    IEEE ACCESS, 2022, 10 : 120285 - 120295
  • [38] Adaptive multi-user uplink resource allocation based on access delay analysis in IEEE 802.11ax
    Peng, Min
    Yin, Qiqi
    Zhang, Kai
    Kai, Caihong
    WIRELESS NETWORKS, 2023, 29 (03) : 1223 - 1235
  • [39] Improving IEEE 802.11ax UORA Performance: Comparison of Reinforcement Learning and Heuristic Approaches
    Kosek-Szott, Katarzyna
    Szott, Szymon
    Dressler, Falko
    IEEE Access, 2022, 10 : 120285 - 120295
  • [40] Uplink Frame Transmission with Functions of Adaptive Triggering and Resource Allocation of OFDMA in Interfering IEEE 802.11ax Wireless LANs
    Takahashi, Ryoichi
    Tanigawa, Yosuke
    Tode, Hideki
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2021, E104B (06) : 664 - 674