Deep Reinforcement Learning Based Energy Efficient Multi-UAV Data Collection for IoT Networks

被引:22
作者
Khodaparast, Seyed Saeed [1 ]
Lu, Xiao [1 ]
Wang, Ping [1 ]
Uyen Trang Nguyen [1 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON M3J 1P3, Canada
来源
IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY | 2021年 / 2卷
基金
加拿大自然科学与工程研究理事会;
关键词
Sensors; Data collection; Energy consumption; Trajectory; Navigation; Unmanned aerial vehicles; Task analysis; unmanned aerial vehicle (UAV); Internet of Things (IoT); deep reinforcement learning (DRL); energy consumption; AUTONOMOUS NAVIGATION;
D O I
10.1109/OJVT.2021.3085421
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicles (UAVs) are regarded as an emerging technology, which can be effectively utilized to perform the data collection tasks in the Internet of Things (IoT) networks. However, both the UAVs and the sensors in these networks are energy-limited devices, which necessitates an energy-efficient data collection procedure to ensure the network lifetime. In this paper, we propose a multi-UAV-assisted network, where the UAVs fly to the ground sensors and control the sensor's transmit power during the data collection time. Our goal is to minimize the total energy consumption of the UAVs and the sensors, which is needed to accomplish the data collection mission. We formulate this problem into three sub-problems of single UAV navigation, sensor power control as well as multi-UAV scheduling and model each part as a finite-horizon Markov Decision Process (MDP). We deploy deep reinforcement learning (DRL)-based frameworks to solve each part. Specifically, we use deep deterministic policy gradient (DDPG) method to generate the best trajectory for the UAVs in an obstacle-constraint environment, given its starting position and the target sensor. We also deploy DDPG to control the sensor's transmit power during data collection. To schedule activity plans for each UAV to visit the sensors, we propose a multi-agent deep Q-learning (DQL) approach by taking the total energy consumption of the UAVs on each path into account. Our simulations show that the UAVs can find a safe and optimal path for each of their trips. Continuous power control of the sensors achieves better performance over the fixed power approaches in terms of the total energy consumption during data collection. In addition, compared to the two commonly used baselines, our scheduling framework achieves better and near-optimal results.
引用
收藏
页码:249 / 260
页数:12
相关论文
共 50 条
  • [21] AoI-Energy-Aware UAV-Assisted Data Collection for IoT Networks: A Deep Reinforcement Learning Method
    Sun, Mengying
    Xu, Xiaodong
    Qin, Xiaoqi
    Zhang, Ping
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (24) : 17275 - 17289
  • [22] Energy-Efficient Data Collection for IoT Networks via Cooperative Multi-Hop UAV Networks
    Kim, Taewoon
    Qiao, Daji
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13796 - 13811
  • [23] Joint Speed Control and Energy Replenishment Optimization for UAV-Assisted IoT Data Collection With Deep Reinforcement Transfer Learning
    Chu, Nam H.
    Hoang, Dinh Thai
    Nguyen, Diep N.
    Huynh, Nguyen Van
    Dutkiewicz, Eryk
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (07) : 5778 - 5793
  • [24] Deep Reinforcement Learning for Energy-Efficient Federated Learning in UAV-Enabled Wireless Powered Networks
    Quang Vinh Do
    Quoc-Viet Pham
    Hwang, Won-Joo
    [J]. IEEE COMMUNICATIONS LETTERS, 2022, 26 (01) : 99 - 103
  • [25] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [26] An MA-HPPO Approach for Multi-UAV Data Collection
    Bai, Zixuan
    Shi, Jia
    Li, Zan
    Li, Meng
    Liao, Xiaomin
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) : 17974 - 17986
  • [27] UAV Data Collection With Deep Reinforcement Learning for Grant-Free IoT
    Zhong, Jiale
    Hu, Yingdong
    Li, Ye
    Xu, Yicheng
    Gao, Ruifeng
    Wang, Jue
    [J]. 2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [28] Deep Reinforcement Learning Based Resource Allocation in Multi-UAV-Aided MEC Networks
    Chen, Jingxuan
    Cao, Xianbin
    Yang, Peng
    Xiao, Meng
    Ren, Siqiao
    Zhao, Zhongliang
    Wu, Dapeng Oliver
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) : 296 - 309
  • [29] Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach
    Wang, Yang
    Gao, Zhen
    Zhang, Jun
    Cao, Xianbin
    Zheng, Dezhi
    Gao, Yue
    Ng, Derrick Wing Kwan
    Di Renzo, Marco
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (05): : 3899 - 3912
  • [30] Energy Efficient Task Cooperation for Multi-UAV Networks: A Coalition Formation Game Approach
    Luan, Heyu
    Xu, Yitao
    Liu, Dianxiong
    Du, Zhiyong
    Qian, Huiming
    Liu, Xiaodu
    Tong, Xiaobing
    [J]. IEEE ACCESS, 2020, 8 : 149372 - 149384