Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches

被引:62
作者
Chang, Zheng [1 ,2 ]
Deng, Hengwei [1 ]
You, Li [3 ,4 ]
Min, Geyong [5 ]
Garg, Sahil [6 ]
Kaddoum, Georges [7 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Jyvaskyla, Fac Informat Technol, Jyvaskyla, Keski Suomi, Finland
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
[4] Purple Mt Labs, Nanjing 211100, Peoples R China
[5] Univ Exeter, Dept Comp Sci, Exeter EX4 4QF, England
[6] Acole Technol Super, Resilient Machine Learning Inst ReMI, Montreal, PQ H3C 1K3, Canada
[7] Ecole Technol Super, Elect Engn Dept, Montreal, PQ H3C 1K3, Canada
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2023年 / 10卷 / 05期
基金
中国国家自然科学基金;
关键词
Resource management; Trajectory; Autonomous aerial vehicles; Communication systems; Reinforcement learning; Wireless networks; Throughput; Trajectory design; resource allocation; multi-agent reinforcement learning; deep learning; UAV; drone; COMMUNICATION; OPTIMIZATION; ALGORITHM; INTERNET; ALTITUDE;
D O I
10.1109/TNSE.2022.3171600
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The future mobile communication system is expected to provide ubiquitous connectivity and unprecedented services over billions of devices. The unmanned aerial vehicle (UAV), which is prominent in its flexibility and low cost, emerges as a significant network entity to realize such ambitious targets. In this work, novel machine learning-based trajectory design and resource allocation schemes are presented for a multi-UAV communications system. In the considered system, the UAVs act as aerial Base Stations (BSs) and provide ubiquitous coverage. In particular, with the objective to maximize the system utility over all served users, a joint user association, power allocation and trajectory design problem is presented. To solve the problem caused by high dimensionality in state space, we first propose a machine learning-based strategic resource allocation algorithm which comprises of reinforcement learning and deep learning to design the optimal policy of all the UAVs. Then, we also present a multi-agent deep reinforcement learning scheme for distributed implementation without knowing a priori knowledge of the dynamic nature of networks. Extensive simulation studies are conducted and illustrated to evaluate the advantages of the proposed scheme.
引用
收藏
页码:2940 / 2951
页数:12
相关论文
共 34 条
[1]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[2]   An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks [J].
Asheralieva, Alia ;
Miyanaga, Yoshikazu .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) :3996-4012
[3]   Efficient 3-D Placement of an Aerial Base Station in Next Generation Cellular Networks [J].
Bor-Yaliniz, R. Irem ;
El-Keyi, Amr ;
Yanikomeroglu, Haiti .
2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
[4]   Dual-UAV-Enabled Secure Communications: Joint Trajectory Design and User Scheduling [J].
Cai, Yunlong ;
Cui, Fangyu ;
Shi, Qingjiang ;
Zhao, Minjian ;
Li, Geoffrey Ye .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1972-1985
[5]  
Chang Z., 2020, P IEEE INT C COMM WO, P1
[6]   Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control [J].
Chu, Tianshu ;
Wang, Jie ;
Codeca, Lara ;
Li, Zhaojian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) :1086-1095
[7]   Learning to Coordinate with Deep Reinforcement Learning in Doubles Pong Game [J].
Diallo, Elhadji Amadou Oury ;
Sugiyama, Ayumi ;
Sugawara, Toshiharu .
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, :14-19
[8]  
Foerster JN, 2016, ADV NEUR IN, V29
[9]  
Gupta Jayesh K., 2017, Autonomous Agents and Multiagent Systems, AAMAS 2017: Workshops, Best Papers. Revised Selected Papers: LNAI 10642, P66, DOI 10.1007/978-3-319-71682-4_5
[10]  
Hausknecht M., 2015, P ASS ADV ART INT FA, P1