Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks

被引:2
作者
Chen, Peixin [1 ]
Zhao, Jian [1 ]
Shen, Furao [2 ,3 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[3] Nanjing Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
来源
2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP | 2022年
基金
美国国家科学基金会; 国家重点研发计划;
关键词
UAV communication; deep reinforcement learning; NOMA; dynamic environment; COMMUNICATION;
D O I
10.1109/WCSP55476.2022.10039197
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicles (UAVs) are widely used as aerial base stations (BSs) to provide wireless communication services. In this paper, we consider the UAV's trajectory and power allocation design for downlink communication rate maximization in a UAV-enabled network in disaster areas or the cell fringe. Non-orthogonal multiple access (NOMA) is used to improve the spectrum efficiency of the entire network, while all users are roaming around randomly. The formulated problem is non-convex and the considered environment is dynamic. Such a problem is difficult to be solved via conventional optimization methods. Therefore, we propose a soft actor-critic (SAC) learning scheme to tackle the pertinent problem. Simulation results show that our proposed learning framework is more stable and has a faster convergence rate compared to baseline approaches.
引用
收藏
页码:933 / 938
页数:6
相关论文
共 15 条
[1]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[2]   On the Application of Quasi-Degradation to MISO-NOMA Downlink [J].
Chen, Zhiyong ;
Ding, Zhiguo ;
Dai, Xuchu ;
Karagiannidis, George K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (23) :6174-6189
[3]   A General MIMO Framework for NOMA Downlink and Uplink Transmission Based on Signal Alignment [J].
Ding, Zhiguo ;
Schober, Robert ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (06) :4438-4454
[4]   The Application of MIMO to Non-Orthogonal Multiple Access [J].
Ding, Zhiguo ;
Adachi, Fumiyuki ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (01) :537-552
[5]   Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach [J].
Liu, Chi Harold ;
Chen, Zheyu ;
Tang, Jian ;
Xu, Jie ;
Piao, Chengzhe .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :2059-2070
[6]   Cooperative NOMA Broadcasting/Multicasting for Low-Latency and High-Reliability 5G Cellular V2X Communications [J].
Liu, Gang ;
Wang, Zhiqing ;
Hu, Jiewen ;
Ding, Zhiguo ;
Fan, Pingzhi .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :7828-7838
[7]   UAV COMMUNICATIONS BASED ON NON-ORTHOGONAL MULTIPLE ACCESS [J].
Liu, Yuanwei ;
Qin, Zhijin ;
Cai, Yunlong ;
Gao, Yue ;
Li, Geoffrey Ye ;
Nallanathan, Arumugam .
IEEE WIRELESS COMMUNICATIONS, 2019, 26 (01) :52-57
[8]   On the Performance of Opportunistic NOMA in Downlink CoMP Networks [J].
Tian, Yue ;
Nix, Andrew R. ;
Beach, Mark .
IEEE COMMUNICATIONS LETTERS, 2016, 20 (05) :998-1001
[9]   Spectrum Sharing Planning for Full-Duplex UAV Relaying Systems With Underlaid D2D Communications [J].
Wang, Haichao ;
Wang, Jinlong ;
Ding, Guoru ;
Chen, Jin ;
Li, Yuzhou ;
Han, Zhu .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1986-1999
[10]   Energy Tradeoff in Ground-to-UAV Communication via Trajectory Design [J].
Yang, Dingcheng ;
Wu, Qingqing ;
Zeng, Yong ;
Zhang, Rui .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (07) :6721-6726