Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach

被引:75
|
作者
Wang, Yang [1 ]
Gao, Zhen [1 ]
Zhang, Jun [1 ]
Cao, Xianbin [2 ]
Zheng, Dezhi [3 ]
Gao, Yue [4 ]
Ng, Derrick Wing Kwan [5 ]
Di Renzo, Marco [6 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Instrumentat & Optoelect Engn, Innovat Inst Frontier Sci & Technol, Beijing 100191, Peoples R China
[4] Univ Surrey, Dept Elect & Elect Engn, Surrey GU2 7XH, England
[5] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2025, Australia
[6] Univ Paris Saclay, Lab Signaux & Syst, Cent Supelec, CNRS, F-91192 Gif Sur Yvette, France
来源
IEEE INTERNET OF THINGS JOURNAL | 2022年 / 9卷 / 05期
基金
北京市自然科学基金; 澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Trajectory; Data collection; Sensors; Optimization; Three-dimensional displays; Minimization; Resource management; deep reinforcement learning (DRL); Internet of Things (IoT); trajectory design; unmanned aerial vehicle (UAV) communications; ENERGY-EFFICIENT; RESOURCE-ALLOCATION; COMMUNICATION; OPTIMIZATION;
D O I
10.1109/JIOT.2021.3102185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we investigate an unmanned aerial vehicle (UAV)-assisted Internet of Things (IoT) system in a sophisticated 3-D environment, where the UAV's trajectory is optimized to efficiently collect data from multiple IoT ground nodes. Unlike existing approaches focusing only on a simplified 2-D scenario and the availability of perfect channel state information (CSI), this article considers a practical 3-D urban environment with imperfect CSI, where the UAV's trajectory is designed to minimize data collection completion time subject to practical throughput and flight movement constraints. Specifically, inspired by the state-of-the-art deep reinforcement learning approaches, we leverage the twin-delayed deep deterministic policy gradient (TD3) to design the UAV's trajectory and we present a TD3-based trajectory design for completion time minimization (TD3-TDCTM) algorithm. In particular, we set an additional information, i.e., the merged pheromone, to represent the state information of the UAV and environment as a reference of reward which facilitates the algorithm design. By taking the service statuses of the IoT nodes, the UAV's position, and the merged pheromone as input, the proposed algorithm can continuously and adaptively learn how to adjust the UAV's movement strategy. By interacting with the external environment in the corresponding Markov decision process, the proposed algorithm can achieve a near-optimal navigation strategy. Our simulation results show the superiority of the proposed TD3-TDCTM algorithm over three conventional nonlearning-based baseline methods.
引用
收藏
页码:3899 / 3912
页数:14
相关论文
共 50 条
  • [31] A Deep Reinforcement Learning Approach to Energy-harvesting UAV-aided Data Collection
    Zhang, Ning
    Liu, Juan
    Xie, Lingfu
    Tong, Peng
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 93 - 98
  • [32] UAV Trajectory Planning Based on Deep Q-Network for Internet of Things
    Zhang Jianhang
    Kang Kai
    Qian Hua
    Yang Miao
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (11) : 3850 - 3857
  • [33] Learning-Based Aerial Charging Scheduling for UAV-Based Data Collection
    Yang, Jia
    Zhu, Kun
    Zhu, Xiaojun
    Wang, Junhua
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT II, 2021, 12938 : 600 - 611
  • [34] Deep Reinforcement Learning-Based Distributed 3D UAV Trajectory Design
    He, Huasen
    Yuan, Wenke
    Chen, Shuangwu
    Jiang, Xiaofeng
    Yang, Feng
    Yang, Jian
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (06) : 3736 - 3751
  • [35] Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications
    Zhang, Chiya
    Li, Zhukun
    He, Chunlong
    Wang, Kezhi
    Pan, Cunhua
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2398 - 2402
  • [36] Energy Efficient UAV-Assisted IoT Data Collection: A Graph-Based Deep Reinforcement Learning Approach
    Wu, Qianqian
    Liu, Qiang
    Zhu, Wenliang
    Wu, Zefan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (06): : 6082 - 6094
  • [37] Modeling of a UAV-based Data Collection System
    Arvanitaki, Antonia
    Pappas, Nikolaus
    2017 IEEE 22ND INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2017,
  • [38] UAV-based LoRaWAN flying gateway for the internet of flying things
    Moheddine, Aya
    Patrone, Fabio
    Marchese, Mario
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 36 (05)
  • [39] Deep Reinforcement Learning for UAV Trajectory Design Considering Mobile Ground Users
    Lee, Wonseok
    Jeon, Young
    Kim, Taejoon
    Kim, Young-Il
    SENSORS, 2021, 21 (24)
  • [40] Cooperative Data Collection for UAV-Assisted Maritime IoT Based on Deep Reinforcement Learning
    Fu, Xiuwen
    Huang, Xiong
    Pan, Qiongshan
    Pace, Pasquale
    Aloi, Gianluca
    Fortino, Giancarlo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 10333 - 10349