UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

被引:44
|
作者
Bayerlein, Harald [1 ]
Theile, Mirco [2 ]
Caccamo, Marco [2 ]
Gesbert, David [1 ]
机构
[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France
[2] Tech Univ Munich, TUM Dept Mech Engn, Munich, Germany
来源
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2020年
基金
欧洲研究理事会;
关键词
D O I
10.1109/GLOBECOM42002.2020.9322234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RI) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance. While previous approaches, learning and non-learning based, must perform expensive recomputations or relearn a behavior when important scenario parameters such as the number of sensors, sensor positions, or maximum flying time, change, we train a double deep Q-network (DDQN) with combined experience replay to learn a UAV control policy that generalizes over changing scenario parameters. By exploiting a multi-layer map of the environment fed through convolutional network layers to the agent, we show that our proposed network architecture enables the agent to make movement decisions for a variety of scenario parameters that balance the data collection goal with flight time efficiency and safety constraints. Considerable advantages in learning efficiency from using a map centered on the UAV's position over a non-centered map are also illustrated.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Efficient Deep Reinforcement Learning for Optimal Path Planning
    Ren, Jing
    Huang, Xishi
    Huang, Raymond N.
    ELECTRONICS, 2022, 11 (21)
  • [42] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [43] Robot Path Planning Based on Deep Reinforcement Learning
    Zhang, Rui
    Jiang, Yuhao
    Wu Fenghua
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701
  • [44] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Yan, Chao
    Xiang, Xiaojia
    Wang, Chang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (02) : 297 - 309
  • [45] iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
    Maw, Aye Aye
    Tyan, Maxim
    Nguyen, Tuan Anh
    Lee, Jae-Woo
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [46] Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms
    Dhuheir, Marwan
    Baccour, Emna
    Erbad, Aiman
    Al-Obaidi, Sinan Sabeeh
    Hamdi, Mounir
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) : 8185 - 8201
  • [47] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Chao Yan
    Xiaojia Xiang
    Chang Wang
    Journal of Intelligent & Robotic Systems, 2020, 98 : 297 - 309
  • [48] Deep Reinforcement Learning Assisted UAV Path Planning Relying on Cumulative Reward Mode and Region Segmentation
    Wang, Zhipeng
    Ng, Soon Xin
    EI-Hajjar, Mohammed
    IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY, 2024, 5 : 737 - 751
  • [49] Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing
    Rueckin, Julius
    Jin, Liren
    Popovic, Marija
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 4473 - 4479
  • [50] UAV Control for Wireless Service Provisioning in Critical Demand Areas: A Deep Reinforcement Learning Approach
    Ho, Tai Manh
    Kim-Khoa Nguyen
    Cheriet, Mohamed
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (07) : 7138 - 7152