A Reinforcement Learning Algorithm for Data Collection in UAV-aided IoT Networks with Uncertain Time Windows

被引:3
作者
Cicek, Cihan Tugrul [1 ]
机构
[1] Atilim Univ, Dept Ind Engn, Ankara, Turkey
来源
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS) | 2021年
关键词
UAV; internet-of-things; reinforcement learning; battery swapping; time windows; uncertainty;
D O I
10.1109/ICCWorkshops50388.2021.9473768
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles (UAVs) have been considered as an efficient solution to collect data from ground sensor nodes in Internet-of-Things (IoT) networks due to their several advantages such as flexibility, quick deployment and maneuverability. Studies on this subject have been mainly focused on problems where limited UAV battery is introduced as a tight constraint that shortens the mission time in the models, which significantly undervalues the UAV potential. Moreover, the sensors in the network are typically assumed to have deterministic working times during which the data is uploaded. In this study, we revisit the UAV trajectory planning problem with a different approach and revise the battery constraint by allowing UAVs to swap their batteries at fixed stations and continue their data collection task, hence, the planning horizon can be extended. In particular, we develop a discrete time Markov process (DTMP) in which the UAV trajectory and battery swapping times are jointly determined to minimize the total data loss in the network, where the sensors have uncertain time windows for uploading. Due to the so-called curse-of-dimensionality, we propose a reinforcement learning (RL) algorithm in which the UAV is trained as an agent to explore the network. The computational study shows that our proposed algorithm outperforms two benchmark approaches and achieves significant reduction in data loss.
引用
收藏
页数:6
相关论文
共 18 条
[1]  
Bertsekas D. P., 2011, Dynamic programming and optimal control, VII
[2]   AirNet: Energy-Aware Deployment and Scheduling of Aerial Networks [J].
Bozkaya, Elif ;
Foerster, Klaus-Tycho ;
Schmid, Stefan ;
Canberk, Berk .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) :12252-12263
[3]   When UAV Swarm Meets Edge-Cloud Computing: The QoS Perspective [J].
Chen, Wuhui ;
Liu, Baichuan ;
Huang, Huawei ;
Guo, Song ;
Meng, Zibin .
IEEE NETWORK, 2019, 33 (02) :36-43
[4]  
Cicek C. T., 2019, 2019 1 INT C UNM, P1
[5]  
Cicek C. T., 2021, INFORMS J COMPUT
[6]  
Drone Industry Insights, 2020, DRON MARK REP 2020 2
[7]   Flight Time Minimization of UAV for Data Collection Over Wireless Sensor Networks [J].
Gong, Jie ;
Chang, Tsung-Hui ;
Shen, Chao ;
Chen, Xiang .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1942-1954
[8]   UAV Trajectory Planning for Data Collection from Time-Constrained IoT Devices [J].
Samir, Moataz ;
Sharafeddine, Sanaa ;
Assi, Chadi M. ;
Tri Minh Nguyen ;
Ghrayeb, Ali .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (01) :34-46
[9]   Priority-Based Data Gathering Framework in UAV-Assisted Wireless Sensor Networks [J].
Say, Sotheara ;
Inata, Hikari ;
Liu, Jiang ;
Shimamoto, Shigeru .
IEEE SENSORS JOURNAL, 2016, 16 (14) :5785-5794
[10]   Unmanned Aerial Vehicles (UAVs): A Survey on Civil Applications and Key Research Challenges [J].
Shakhatreh, Hazim ;
Sawalmeh, Ahmad H. ;
Al-Fuqaha, Ala ;
Dou, Zuochao ;
Almaita, Eyad ;
Khalil, Issa ;
Othman, Noor Shamsiah ;
Khreishah, Abdallah ;
Guizani, Mohsen .
IEEE ACCESS, 2019, 7 :48572-48634