Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks

被引:0
作者
Li, Kai [1 ]
Ni, Wei [2 ]
Tovar, Eduardo [1 ]
Guizani, Mohsen [3 ]
机构
[1] CISTER Res Ctr, Porto, Portugal
[2] CSIRO, Sydney, NSW, Australia
[3] Qatar Univ, Doha, Qatar
来源
2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC | 2020年
关键词
Wireless sensor networks; Unmanned aerial vehicles; Trajectory planning; Wireless power transfer; Deep reinforcement learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In Unmanned Aerial Vehicle (UAV)-enabled wireless powered sensor networks, a UAV can be employed to charge the ground sensors remotely via Wireless Power Transfer (WPT) and collect the sensory data. This paper focuses on trajectory planning of the UAV for aerial data collection and WPT to minimize buffer overflow at the ground sensors and unsuccessful transmission due to lossy airborne channels. Consider network states of battery levels and buffer lengths of the ground sensors, channel conditions, and location of the UAV. A flight trajectory planning optimization is formulated as a Partial Observable Markov Decision Process (POMDP), where the UAV has partial observation of the network states. In practice, the UAV-enabled sensor network contains a large number of network states and actions in POMDP while the up-to-date knowledge of the network states is not available at the UAV. To address these issues, we propose an onboard deep reinforcement learning algorithm to optimize the real-time trajectory planning of the UAV given outdated knowledge on the network states.
引用
收藏
页码:958 / 963
页数:6
相关论文
共 24 条
[1]   Adaptive modulation over Nakagami fading channels [J].
Alouini, MS ;
Goldsmith, AJ .
WIRELESS PERSONAL COMMUNICATIONS, 2000, 13 (1-2) :119-143
[2]  
Alvear OA, 2017, INT WIREL COMMUN, P2115, DOI 10.1109/IWCMC.2017.7986610
[3]  
Emami Y., 2020, VEHICULAR TECHNOLOGY
[4]  
Fotouhi A, 2017, I S WORLD WIREL MOBI
[5]  
Ghazzai H, 2018, IEEE GLOB COMM CONF
[6]  
Gradshteyn I S., 2014, Table of Integrals, Series and Products
[7]   UAV-Assisted Relaying and Edge Computing: Scheduling and Trajectory Optimization [J].
Hu, Xiaoyan ;
Wong, Kai-Kit ;
Yang, Kun ;
Zheng, Zhongbin .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (10) :4738-4752
[8]   Optimal 1D Trajectory Design for UAV-Enabled Multiuser Wireless Power Transfer [J].
Hu, Yulin ;
Yuan, Xiaopeng ;
Xu, Jie ;
Schmeink, Anke .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (08) :5674-5688
[9]   UAV Communications for 5G and Beyond: Recent Advances and Future Trends [J].
Li, Bin ;
Fei, Zesong ;
Zhang, Yan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2241-2263
[10]  
Li K, 2019, IEEE Transactions on Vehicular Technology, P1