Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks

被引:9
作者
Luo, Xiaoling [1 ,2 ]
Chen, Che [3 ,4 ]
Zeng, Chunnian [1 ]
Li, Chengtao [2 ]
Xu, Jing [5 ]
Gong, Shimin [4 ]
机构
[1] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China
[2] China Three Gorges Corp, Wuhan 430010, Peoples R China
[3] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[4] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518107, Peoples R China
[5] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
关键词
UAV; multi-agent deep reinforcement learning; trajectory planning; access control; RESOURCE-ALLOCATION; DESIGN; COMMUNICATION; OPTIMIZATION; COVERAGE; INTERNET;
D O I
10.3390/s23104691
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance.
引用
收藏
页数:22
相关论文
共 37 条
[1]   Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks [J].
Abd-Elmagid, Mohamed A. ;
Dhillon, Harpreet S. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :2003-2008
[2]   Localization and Clustering Based on Swarm Intelligence in UAV Networks for Emergency Communications [J].
Arafat, Muhammad Yeasir ;
Moh, Sangman .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8958-8976
[3]   UAV Deployment Exercise for Mapping Purposes: Evaluation of Emergency Response Applications [J].
Boccardo, Piero ;
Chiabrando, Filiberto ;
Dutto, Furio ;
Tonolo, Fabio Giulio ;
Lingua, Andrea .
SENSORS, 2015, 15 (07) :15717-15737
[4]   An Amateur Drone Surveillance System Based on the Cognitive Internet of Things [J].
Ding, Guoru ;
Wu, Qihui ;
Zhang, Linyuan ;
Lin, Yun ;
Tsiftsis, Theodoros A. ;
Yao, Yu-Dong .
IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (01) :29-35
[5]  
Du Y.-N., 2018, Proceedings of the IEEE International Conference on Communications (ICC), P1, DOI 10.1109/GLOCOM.2018.8647789
[6]   Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks [J].
Gong, Shimin ;
Wang, Meng ;
Gu, Bo ;
Zhang, Wenjie ;
Dinh Thai Hoang ;
Niyato, Dusit .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) :10933-10948
[7]   Hierarchical Deep Reinforcement Learning for Age-of-Information Minimization in IRS-Aided and Wireless-Powered Wireless Networks [J].
Gong, Shimin ;
Cui, Leiyang ;
Gu, Bo ;
Lyu, Bin ;
Dinh Thai Hoang ;
Niyato, Dusit .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (11) :8114-8127
[8]   Exploiting Backscatter-Aided Relay Communications With Hybrid Access Model in Device-to-Device Networks [J].
Gong, Shimin ;
Gao, Lin ;
Xu, Jing ;
Guo, Yuanxiong ;
Dinh Thai Hoang ;
Niyato, Dusit .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (04) :835-848
[9]   Survey of Important Issues in UAV Communication Networks [J].
Gupta, Lav ;
Jain, Raj ;
Vaszkun, Gabor .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (02) :1123-1152
[10]   UAV-Aided Backscatter Communications: Performance Analysis and Trajectory Optimization [J].
Han, Rui ;
Bai, Lin ;
Wen, Yongqing ;
Liu, Jianwei ;
Choi, Jinho ;
Zhang, Wei .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (10) :3129-3143