Joint User Pairing and Beamforming Design of Multi-STAR-RISs-Aided NOMA in the Indoor Environment via Multi-Agent Reinforcement Learning

被引:0
作者
Park, Yu Min [1 ]
Tun, Yan Kyaw [2 ]
Hong, Choong Seon [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea
[2] Aalborg Univ, Dept Elect Syst, AC Meyers Vaenge 15, DK-2450 Aalborg, Denmark
来源
PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
STAR-RIS; NOMA network; indoor environment; reinforcement learning; multi-agent proximal policy optimization; MILLIMETER-WAVE-NOMA; ALLOCATION;
D O I
10.1109/NOMS59830.2024.10575611
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To increase the quality of the 6G / B5G network, conventional cellular networks based on terrestrial base stations are geographically and economically restricted. Meanwhile, Non-Orthogonal Multiple Access (NOMA) allows multiple users to share the same resources, which improves the spectral efficiency of the system and has the advantage of supporting a larger number of users. Additionally, by intelligently manipulating the phase and amplitude of both the reflected and transmitted signals, Simultaneously Transmitting and Reflecting RISs (STAR-RISs) can achieve improved coverage, increased spectral efficiency, and enhanced communication reliability. However, STAR-RISs must simultaneously optimize the amplitude and phase shift corresponding to reflection and transmission, which makes existing terrestrial networks more complicated and is considered a major challenge. Motivated by the above, we study the joint user pairing for NOMA and the beamforming design of Multi-STAR-RISs in an indoor environment. Then, we formulate the optimization problem with the objective of maximizing the total throughput of mobile users (MUs) by jointly optimizing the decoding order, user pairing, active beamforming, and passive beamforming. However, the formulated problem is a mixed-integer non-linear programming (MINLP). To address this challenge, we first introduce the decoding order for NOMA networks. Next, we decompose the original problem into two subproblems, namely: 1) MU pairing and 2) Beamforming optimization under the optimal decoding order. For the first subproblem, we employ correlation-based K-means clustering to solve the user pairing problem. Then, to jointly deal with beamforming vector optimizations, we propose Multi-Agent Proximal Policy Optimization (MAPPO), which can make quick decisions in the given environment owing to its low complexity. Finally, simulation results prove that our proposed MAPPO algorithm is superior to Proximal Policy Optimization (PPO) and Advanced Actor-Critic (A2C) by a maximum of 1% and 6%, respectively. Furthermore, the proposed algorithm converges 1.5 times faster than the typical PPO algorithm.
引用
收藏
页数:8
相关论文
共 16 条
[1]   Reconfigurable-Intelligent-Surface-Assisted B5G/6G Wireless Communications: Challenges, Solution, and Future Opportunities [J].
Chen, Zhen ;
Chen, Gaojie ;
Tang, Jie ;
Zhang, Shun ;
So, Daniel K. C. ;
Dobre, Octavia A. A. ;
Wong, Kai-Kit ;
Chambers, Jonathon .
IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (01) :16-22
[2]   Unsupervised Machine Learning-Based User Clustering in Millimeter-Wave-NOMA Systems [J].
Cui, Jingjing ;
Ding, Zhiguo ;
Fan, Pingzhi ;
Al-Dhahir, Naofal .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (11) :7425-7440
[3]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[4]  
Higuchi Kenichi, 2013, IEEE VEHICULAR TECHN
[5]   RESOURCE ALLOCATION FOR DOWNLINK NOMA SYSTEMS: KEY TECHNIQUES AND OPEN ISSUES [J].
Islam, S. M. Riazul ;
Zeng, Ming ;
Dobre, Octavia A. ;
Kwak, Kyung-Sup .
IEEE WIRELESS COMMUNICATIONS, 2018, 25 (02) :40-47
[6]   Simultaneous Transmitting and Reflecting-Reconfigurable Intelligent Surface in 6G: Design Guidelines and Future Perspectives [J].
Khalid, Waqas ;
Kaleem, Zeeshan ;
Ullah, Rehmat ;
Van Chien, Trinh ;
Noh, Song ;
Yu, Heejung .
IEEE NETWORK, 2023, 37 (05) :173-181
[7]   STAR: SIMULTANEOUS TRANSMISSION AND REFLECTION FOR 360° COVERAGE BY INTELLIGENT SURFACES [J].
Liu, Yuanwei ;
Mu, Xidong ;
Xu, Jiaqi ;
Schober, Robert ;
Hao, Yang ;
Poor, H. Vincent ;
Hanzo, Lajos .
IEEE WIRELESS COMMUNICATIONS, 2021, 28 (06) :102-109
[8]   Trajectory Optimization and Phase-Shift Design in IRS-Assisted UAV Network for Smart Railway [J].
Park, Yu Min ;
Tun, Yan Kyaw ;
Han, Zhu ;
Hong, Choong Seon .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) :11317-11321
[9]   Joint Resources and Phase-Shift Optimization of MEC-Enabled UAV in IRS-Assisted 6G THz Networks [J].
Park, Yu Min ;
Hassan, Sheikh Salman ;
Tun, Yan Kyaw ;
Han, Zhu ;
Hong, Choong Seon .
PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
[10]  
Schulman J, 2017, Arxiv, DOI arXiv:1707.06347