Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective

被引:10
作者
Lee, Seungmin [1 ,2 ]
Lim, Suhyeon [1 ,2 ]
Chae, Seong Ho [3 ]
Jung, Bang Chul [4 ]
Park, Chan Yi [5 ]
Lee, Howon [1 ,2 ]
机构
[1] Hankyong Natl Univ, Sch Elect & Elect Engn, Anseong 17579, South Korea
[2] Hankyong Natl Univ, Inst IT Convergence IITC, Anseong 17579, South Korea
[3] Tech Univ Korea, Dept Elect Engn, Siheung Si 15073, South Korea
[4] Chungnam Natl Univ, Dept Elect Engn, Daejeon 34134, South Korea
[5] Agcy Def Dev, Daejeon 34186, South Korea
关键词
Frequency conversion; Computer architecture; Time-frequency analysis; Microprocessors; Wireless networks; Q-learning; Autonomous aerial vehicles; Unmanned aerial vehicle; optimal frequency reuse; transmit power control; energy efficiency; hierarchical multi-agent Q-learning; multi-UAV wireless network; COVERAGE; ACCESS;
D O I
10.1109/ACCESS.2022.3166179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To overcome the problems caused by the limited battery lifetime in multiple-unmanned aerial vehicle (UAV) wireless networks, we propose a hierarchical multi-agent reinforcement learning (RL) framework to maximize the energy efficiency (EE) of UAVs by finding the optimal frequency reuse factor and transmit power. The proposed algorithm consists of distributed inner-loop RL for transmit power control of the UAV terminal (UT) and centralized outer-loop RL for finding the optimal frequency reuse factor. Specifically, the proposed algorithm adjusts these two factors jointly to effectively mitigate intercell interference and reduce undesired transmit power consumption in multi-UAV wireless networks. We show that, for this reason, the proposed algorithm outperforms conventional algorithms, such as a random action algorithm with a fixed frequency reuse factor and a hierarchical multi-agent Q-learning algorithm with binary transmit power controls. Furthermore, even in the environment where UTs are continuously moving based on the mixed mobility model, we show that the proposed algorithm can find the best reward when compared to conventional algorithms.
引用
收藏
页码:39555 / 39565
页数:11
相关论文
共 19 条
[1]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[2]  
[Anonymous], 2020, NEXT HYPERCONNECTED, P1
[3]  
[Anonymous], 2012, PROPAGATION DATA PRE, P1
[4]  
Flagship G., 2019, CISC VIS NETW IND GL, V1, P1
[5]   Interference Management for 4G Cellular Standards [J].
Himayat, Nageen ;
Talwar, Shilpa ;
Rao, Anil ;
Soni, Robert .
IEEE COMMUNICATIONS MAGAZINE, 2010, 48 (08) :86-92
[6]  
Hossain M., 2020, P IEEE VTC SPRING MA, P1
[7]   Multiagent Q-Learning-Based Multi-UAV Wireless Networks for Maximizing Energy Efficiency: Deployment and Power Control Strategy Design [J].
Lee, Seungmin ;
Yu, Heejung ;
Lee, Howon .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (09) :6434-6442
[8]   RE-ORA: Residual Energy-Aware Online Random Access for Improving the Lifetime of Slotted ALOHA-Based Swarming Drone Networks [J].
Lim, Suhyeon ;
Chae, Seong Ho ;
Lee, Howon .
IEEE ACCESS, 2021, 9 :45504-45511
[9]   Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach [J].
Liu, Chi Harold ;
Chen, Zheyu ;
Tang, Jian ;
Xu, Jie ;
Piao, Chengzhe .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :2059-2070
[10]   Dynamic Multichannel Sensing in Cognitive Radio: Hierarchical Reinforcement Learning [J].
Liu, Shuai ;
Wu, Jiayun ;
He, Jing .
IEEE ACCESS, 2021, 9 :25473-25481