Decoupled Association With Rate Splitting Multiple Access in UAV-Assisted Cellular Networks Using Multi-Agent Deep Reinforcement Learning

被引:23
作者
Ji, Jiequ [1 ]
Cai, Lin [2 ]
Zhu, Kun [3 ]
Niyato, Dusit [4 ]
机构
[1] Soochow Univ, Coll Future Sci & Engn, Suzhou 215222, Peoples R China
[2] Univ Victoria, Dept Elect & Comp Engn, Victoria, BC V8W 3P6, Canada
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Array signal processing; Uplink; Cellular networks; NOMA; Interference cancellation; Backhaul networks; Reinforcement learning; Decoupled multiple association; multi-agent deep reinforcement learning; rate splitting; UAV-assisted networks; USER ASSOCIATION; FULL-DUPLEX; OPTIMIZATION; PLACEMENT; DOWNLINK; DESIGN; UPLINK; NOMA; 5G;
D O I
10.1109/TMC.2023.3256404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In unmanned aerial vehicles (UAVs) assisted cellular networks, user association plays an important role in interference control and spectrum efficiency. In this paper, we study the performance of uplink-downlink decoupled (UDDe) user association in a multi-UAV assisted network in which each user can associate with different UAVs or the macro base station (MBS) for uplink (UL) and downlink (DL) transmissions. Since some popular data may be requested by multiple users, grouping these users and applying multicasting can significantly improve spectral efficiency. Unlike traditional linear precoding that treats interference entirely as noise, we propose a rate-splitting multiple access (RSMA) policy that employs rate splitting at the transmitter and successive interference cancellation (SIC) at the receiver. To be specific, the transmitted signal is split into a common part and a private part, and the interference is partially decoded and partially treated as noise. In this context, we formulate a joint optimization problem of UL-DL association and beamforming for maximizing the sum-rate of users in UL and that of multicast groups in DL under the constraints of UAV backhaul capacity and power budget. Since the formulated problem is non-convex with intricate states and an individual UAV may not know the rewards of other UAVs, we convert it into a robust partially observable Markov decision process (POMDP). Then we resort to multi-agent deep reinforcement learning (MADRL) that enables each UAV to learn and optimize its policy in a distributed manner. To achieve an optimal policy, we further propose an improved clip and count-based proximal policy optimization (PPO) algorithm to train actor and critic networks. Simulation results demonstrate the superiority of the proposed decoupled association strategy with RSMA and the MADRL learning algorithm.
引用
收藏
页码:2186 / 2201
页数:16
相关论文
共 46 条
[1]   Rate Splitting Multiple Access in C-RAN: A Scalable and Robust Design [J].
Ahmad, Alaa Alameer ;
Mao, Yijie ;
Sezgin, Aydin ;
Clerckx, Bruno .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (09) :5727-5743
[2]  
Akyildiz I.F., 2010, Elsevier Journal of Physical Communication, V3, P217, DOI DOI 10.1016/J.PHYCOM.2010.08.001
[3]  
Alameer A, 2016, INT SYM TURBO CODES, P375, DOI 10.1109/ISTC.2016.7593140
[4]  
[Anonymous], 2017, 3GPP TR 36.777
[5]   Why to Decouple the Uplink and Downlink in Cellular Networks and How To Do It [J].
Boccardi, Federico ;
Andrews, Jeffrey ;
Elshaer, Hisham ;
Dohler, Mischa ;
Parkvall, Stefan ;
Popovski, Petar ;
Singh, Sarabjot .
IEEE COMMUNICATIONS MAGAZINE, 2016, 54 (03) :110-117
[6]   Cooperative Caching and Transmission Design in Cluster-Centric Small Cell Networks [J].
Chen, Zheng ;
Lee, Jemin ;
Quek, Tony Q. S. ;
Kountouris, Marios .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (05) :3401-3415
[7]   Rate-Splitting Unifying SDMA, OMA, NOMA, and Multicasting in MISO Broadcast Channel: A Simple Two-User Rate Analysis [J].
Clerckx, Bruno ;
Mao, Yijie ;
Schober, Robert ;
Poor, H. Vincent .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (03) :349-353
[8]   Full-Duplex MIMO Relaying: Achievable Rates Under Limited Dynamic Range [J].
Day, Brian P. ;
Margetts, Adam R. ;
Bliss, Daniel W. ;
Schniter, Philip .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (08) :1541-1553
[9]   Experiment-Driven Characterization of Full-Duplex Wireless Systems [J].
Duarte, Melissa ;
Dick, Chris ;
Sabharwal, Ashutosh .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2012, 11 (12) :4296-4307
[10]  
Duarte M, 2010, CONF REC ASILOMAR C, P1558, DOI 10.1109/ACSSC.2010.5757799