Autonomous UAV Trajectory for Localizing Ground Objects: A Reinforcement Learning Approach

被引：81

作者：

Ebrahimi, Dariush ^{[1
]}

Sharafeddine, Sanaa ^{[2
]}

Ho, Pin-Han ^{[3
]}

Assi, Chadi ^{[4
]}

机构：

[1] Lakehead Univ, Dept Comp Sci, Thunder Bay, ON P7B 5E1, Canada

[2] Lebanese Amer Univ, Dept Comp Sci & Math, Beirut 1102, Lebanon

[3] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

[4] Concordia Univ, Fac Engn & Comp Sci, Montreal, PQ H4B 1R6, Canada

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2021年 / 20卷 / 04期

关键词：

Trajectory; Energy consumption; Drones; Shadow mapping; Global Positioning System; Reinforcement learning; Localization; reinforcement learning; Q-Learning; unmanned aerial vehicles (UAVs); drones; trajectory planning; received signal strength (RSS); LOCALIZATION; COMMUNICATION; ALTITUDE;

D O I：

10.1109/TMC.2020.2966989

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Disaster management, search and rescue missions, and health monitoring are examples of critical applications that require object localization with high precision and sometimes in a timely manner. In the absence of the global positioning system (GPS), the radio received signal strength index (RSSI) can be used for localization purposes due to its simplicity and cost-effectiveness. However, due to the low accuracy of RSSI, unmanned aerial vehicles (UAVs) or drones may be used as an efficient solution for improved localization accuracy due to their agility and higher probability of line-of-sight (LoS). Hence, in this context, we propose a novel framework based on reinforcement learning (RL) to enable a UAV (agent) to autonomously find its trajectory that results in improving the localization accuracy of multiple objects in shortest time and path length, fewer signal-strength measurements (waypoints), and/or lower UAV energy consumption. In particular, we first control the agent through initial scan trajectory on the whole region to 1) know the number of nodes and estimate their initial locations, and 2) train the agent online during operation. Then, the agent forms its trajectory by using RL to choose the next waypoints in order to minimize the average location errors of all objects. Our framework includes detailed UAV to ground channel characteristics with an empirical path loss and log-normal shadowing model, and also with an elaborate energy consumption model. We investigate and compare the localization precision of our approach with existing methods from the literature by varying the UAV's trajectory length, energy, number of waypoints, and time. Furthermore, we study the impact of the UAV's velocity, altitude, hovering time, communication range, number of maximum RSSI measurements, and number of objects. The results show the superiority of our method over the state-of-art and demonstrates its fast reduction of the localization error.

引用

页码：1312 / 1324

页数：13

共 50 条

[1] Federated Reinforcement Learning UAV Trajectory Design for Fast Localization of Ground Users
Shahbazi, Arzhang
Donevski, Igor
Nielsen, Jimmy Jessen
Di Renzo, Marco
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 663 - 666
[2] Deep Reinforcement Learning for UAV Trajectory Design Considering Mobile Ground Users
Lee, Wonseok
Jeon, Young
Kim, Taejoon
Kim, Young-Il
SENSORS, 2021, 21 (24)
[3] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
Zhang, Chunyu
Liu, Yiming
Zhang, Zhi
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[4] UAV air combat autonomous trajectory planning method based on robust adversarial reinforcement learning
Wang, Lixin
Zheng, Sizhuang
Tai, Shang
Liu, Hailiang
Yue, Ting
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 153
[5] Hybrid reinforcement learning for autonomous UAV control
Yoo J.H.
Journal of Institute of Control, Robotics and Systems, 2019, 25 (06) : 546 - 550
[6] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
Wu, Mengjie
Chi, Huijia
Gan, Shuying
Wang, Xijun
Xu, Chao
2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
[7] Trajectory Design and Generalization for UAV Enabled Networks:A Deep Reinforcement Learning Approach
Li, Xuan
Wang, Qiang
Liu, Jie
Zhang, Wenqi
2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
[8] Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach
Bouhamed, Omar
Ghazzai, Hakim
Besbes, Hichem
Massoud, Yehia
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[9] Trajectory Design in UAV-Aided Mobile Crowdsensing: A Deep Reinforcement Learning Approach
Tao, Xi
Hafid, Abdelhakim Senhaji
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[10] Trajectory Design for UAV-Based Inspection System: A Deep Reinforcement Learning Approach
Zhang, Wei
Yang, Dingcheng
Wu, Fahui
Xiao, Lin
2023 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS, 2023, : 1654 - 1659

← 1 2 3 4 5 →