Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVs

被引:7
作者
Li, Kai [1 ]
Emami, Yousef [1 ]
Ni, Wei [2 ]
Tovar, Eduardo [1 ]
Han, Zhu [3 ]
机构
[1] Real-Time and Embedded Computing Systems Research Centre (CISTER), Porto
[2] Data61 Commonwealth Scientific and Industrial Research Organization (CSIRO), Sydney, 2122, NSW
[3] Electrical and Computer Engineering Department, University of Houston, Houston, 77004, TX
来源
IEEE Networking Letters | 2020年 / 2卷 / 03期
关键词
data collection; deep reinforcement learning; flight control; Unmanned aerial vehicles;
D O I
10.1109/LNET.2020.3002341
中图分类号
学科分类号
摘要
In Unmanned Aerial Vehicle (UAV) enabled data collection, scheduling data transmissions of the ground nodes while controlling flight of the UAV, e.g., heading and velocity, is critical to reduce the data packet loss resulting from buffer overflows and channel fading. In this letter, a new online flight resource allocation scheme based on deep deterministic policy gradients (DDPG-FRAS) is studied to jointly optimize the flight control of the UAV and data collection scheduling along the trajectory in real time, thereby asymptotically minimizing the packet loss of the ground sensor networks. Numerical results confirm that the proposed DDPG-FRAS can gradually converge, while enlarging the buffer size can reduce the packet loss by 47.9%. © 2019 IEEE.
引用
收藏
页码:106 / 110
页数:4
相关论文
共 19 条
[11]  
Li K., Ni W., Wei B., Tovar E., Onboard double Q-learning for airborne data capture in wireless powered IoT networks, Ieee Netw. Lett., 2, 2, pp. 71-75, (2020)
[12]  
Li K., Ni W., Tovar E., Jamalipour A., On-board deep Q-network for UAV-assisted online power transfer and data collection, Ieee Trans. Veh. Technol., 68, 12, pp. 12215-12226, (2019)
[13]  
Fadlullah Z.M., Takaishi D., Nishiyama H., Kato N., Miura R., A dynamic trajectory control algorithm for improving the communication throughput and delay in UAV-aided networks, Ieee Netw., 30, 1, pp. 100-105, (2016)
[14]  
Han Z., Swindlehurst A.L., Liu K.R., Smart deployment/movement of unmanned air vehicle to improve connectivity in MANET, Proc. Ieee Wireless Commun. Netw. Conf. (WCNC), Las Vegas, NV, Usa, pp. 252-257, (2006)
[15]  
Li K., Ni W., Wang X., Liu R.P., Kanhere S.S., Jha S., Energy-efficient cooperative relaying for unmanned aerial vehicles, Ieee Trans. Mobile Comput., 15, 6, pp. 1377-1386, (2016)
[16]  
Al-Hourani A., Kandeepan S., Lardner S., Optimal LAP altitude for maximum coverage, Ieee Wireless Commun. Lett., 3, 6, pp. 569-572, (2014)
[17]  
Emami Y., Li K., Tovar E., Buffer-aware scheduling for UAV relay networks with energy fairness, Proc. Ieee Veh. Technol. Conf. (VTC Spring), pp. 1-6, (2020)
[18]  
Silver D., Lever G., Heess N., Degris T., Wierstra D., Riedmiller M., Deterministic policy gradient algorithms, Proc. 31st Int. Conf. Mach. Learn. (ICML), pp. 387-395, (2014)
[19]  
Ketkar N., Introduction to PyTorch, Deep Learning with Python, pp. 195-208, (2017)