Deep Reinforcement Learning Based Energy Efficient Multi-UAV Data Collection for IoT Networks

被引:20
|
作者
Khodaparast, Seyed Saeed [1 ]
Lu, Xiao [1 ]
Wang, Ping [1 ]
Uyen Trang Nguyen [1 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON M3J 1P3, Canada
来源
IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY | 2021年 / 2卷
基金
加拿大自然科学与工程研究理事会;
关键词
Sensors; Data collection; Energy consumption; Trajectory; Navigation; Unmanned aerial vehicles; Task analysis; unmanned aerial vehicle (UAV); Internet of Things (IoT); deep reinforcement learning (DRL); energy consumption; AUTONOMOUS NAVIGATION;
D O I
10.1109/OJVT.2021.3085421
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicles (UAVs) are regarded as an emerging technology, which can be effectively utilized to perform the data collection tasks in the Internet of Things (IoT) networks. However, both the UAVs and the sensors in these networks are energy-limited devices, which necessitates an energy-efficient data collection procedure to ensure the network lifetime. In this paper, we propose a multi-UAV-assisted network, where the UAVs fly to the ground sensors and control the sensor's transmit power during the data collection time. Our goal is to minimize the total energy consumption of the UAVs and the sensors, which is needed to accomplish the data collection mission. We formulate this problem into three sub-problems of single UAV navigation, sensor power control as well as multi-UAV scheduling and model each part as a finite-horizon Markov Decision Process (MDP). We deploy deep reinforcement learning (DRL)-based frameworks to solve each part. Specifically, we use deep deterministic policy gradient (DDPG) method to generate the best trajectory for the UAVs in an obstacle-constraint environment, given its starting position and the target sensor. We also deploy DDPG to control the sensor's transmit power during data collection. To schedule activity plans for each UAV to visit the sensors, we propose a multi-agent deep Q-learning (DQL) approach by taking the total energy consumption of the UAVs on each path into account. Our simulations show that the UAVs can find a safe and optimal path for each of their trips. Continuous power control of the sensors achieves better performance over the fixed power approaches in terms of the total energy consumption during data collection. In addition, compared to the two commonly used baselines, our scheduling framework achieves better and near-optimal results.
引用
收藏
页码:249 / 260
页数:12
相关论文
共 50 条
  • [1] Energy Efficient UAV-Assisted IoT Data Collection: A Graph-Based Deep Reinforcement Learning Approach
    Wu, Qianqian
    Liu, Qiang
    Zhu, Wenliang
    Wu, Zefan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6082 - 6094
  • [2] Timely Data Collection for UAV-Based IoT Networks: A Deep Reinforcement Learning Approach
    Hu, Yingmeng
    Liu, Yan
    Kaushik, Aryan
    Masouros, Christos
    Thompson, John S.
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 12295 - 12308
  • [3] Multi-UAV Reinforcement Learning for Data Collection in Cellular MIMO Networks
    Diaz-Vilor, Carles
    Abdelhady, Amr M.
    Eltawil, Ahmed M.
    Jafarkhani, Hamid
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15462 - 15476
  • [4] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [5] Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning
    Bayerlein, Harald
    Theile, Mirco
    Caccamo, Marco
    Gesbert, David
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2021, 2 : 1171 - 1187
  • [6] Multitask Transfer Deep Reinforcement Learning for Timely Data Collection in Rechargeable-UAV-Aided IoT Networks
    Yi, Mengjie
    Wang, Xijun
    Liu, Juan
    Zhang, Yan
    Hou, Ronghui
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20545 - 20559
  • [7] Energy-Efficient Multidimensional Trajectory of UAV-Aided IoT Networks With Reinforcement Learning
    Silvirianti
    Shin, Soo Young
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19): : 19214 - 19226
  • [8] Deep Reinforcement Learning for Energy-Efficient Data Dissemination Through UAV Networks
    Ali, Abubakar S.
    Al-Habob, Ahmed A.
    Naser, Shimaa
    Bariah, Lina
    Dobre, Octavia A.
    Muhaidat, Sami
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5567 - 5583
  • [9] Distributed Energy-Efficient Multi-UAV Navigation for Long-Term Communication Coverage by Deep Reinforcement Learning
    Liu, Chi Harold
    Ma, Xiaoxin
    Gao, Xudong
    Tang, Jian
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (06) : 1274 - 1285
  • [10] Efficient Data Collection Scheme for Multi-Modal Underwater Sensor Networks Based on Deep Reinforcement Learning
    Song, Shanshan
    Liu, Jun
    Guo, Jiani
    Lin, Bin
    Ye, Qiang
    Cui, Junhong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 6558 - 6570