UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

被引：44

作者：

Bayerlein, Harald ^{[1
]}

Theile, Mirco ^{[2
]}

Caccamo, Marco ^{[2
]}

Gesbert, David ^{[1
]}

机构：

[1] EURECOM, Commun Syst Dept, Sophia Antipolis, France

[2] Tech Univ Munich, TUM Dept Mech Engn, Munich, Germany

来源：

2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2020年

基金：

欧洲研究理事会;

关键词：

D O I：

10.1109/GLOBECOM42002.2020.9322234

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RI) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance. While previous approaches, learning and non-learning based, must perform expensive recomputations or relearn a behavior when important scenario parameters such as the number of sensors, sensor positions, or maximum flying time, change, we train a double deep Q-network (DDQN) with combined experience replay to learn a UAV control policy that generalizes over changing scenario parameters. By exploiting a multi-layer map of the environment fed through convolutional network layers to the agent, we show that our proposed network architecture enables the agent to make movement decisions for a variety of scenario parameters that balance the data collection goal with flight time efficiency and safety constraints. Considerable advantages in learning efficiency from using a map centered on the UAV's position over a non-centered map are also illustrated.

引用

页数：6

共 50 条

[41] Efficient Deep Reinforcement Learning for Optimal Path Planning
Ren, Jing
Huang, Xishi
Huang, Raymond N.
ELECTRONICS, 2022, 11 (21)
[42] Robot path planning based on deep reinforcement learning
Long, Yinxin
He, Huajin
2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
[43] Robot Path Planning Based on Deep Reinforcement Learning
Zhang, Rui
Jiang, Yuhao
Wu Fenghua
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701
[44] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
Yan, Chao
Xiang, Xiaojia
Wang, Chang
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (02) : 297 - 309
[45] iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
Maw, Aye Aye
Tyan, Maxim
Nguyen, Tuan Anh
Lee, Jae-Woo
APPLIED SCIENCES-BASEL, 2021, 11 (09):
[46] Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms
Dhuheir, Marwan
Baccour, Emna
Erbad, Aiman
Al-Obaidi, Sinan Sabeeh
Hamdi, Mounir
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) : 8185 - 8201
[47] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
Chao Yan
Xiaojia Xiang
Chang Wang
Journal of Intelligent & Robotic Systems, 2020, 98 : 297 - 309
[48] Deep Reinforcement Learning Assisted UAV Path Planning Relying on Cumulative Reward Mode and Region Segmentation
Wang, Zhipeng
Ng, Soon Xin
EI-Hajjar, Mohammed
IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY, 2024, 5 : 737 - 751
[49] Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing
Rueckin, Julius
Jin, Liren
Popovic, Marija
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 4473 - 4479
[50] UAV Control for Wireless Service Provisioning in Critical Demand Areas: A Deep Reinforcement Learning Approach
Ho, Tai Manh
Kim-Khoa Nguyen
Cheriet, Mohamed
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (07) : 7138 - 7152

← 1 2 3 4 5 →