Autonomous UAV Trajectory for Localizing Ground Objects: A Reinforcement Learning Approach

被引:81
|
作者
Ebrahimi, Dariush [1 ]
Sharafeddine, Sanaa [2 ]
Ho, Pin-Han [3 ]
Assi, Chadi [4 ]
机构
[1] Lakehead Univ, Dept Comp Sci, Thunder Bay, ON P7B 5E1, Canada
[2] Lebanese Amer Univ, Dept Comp Sci & Math, Beirut 1102, Lebanon
[3] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[4] Concordia Univ, Fac Engn & Comp Sci, Montreal, PQ H4B 1R6, Canada
关键词
Trajectory; Energy consumption; Drones; Shadow mapping; Global Positioning System; Reinforcement learning; Localization; reinforcement learning; Q-Learning; unmanned aerial vehicles (UAVs); drones; trajectory planning; received signal strength (RSS); LOCALIZATION; COMMUNICATION; ALTITUDE;
D O I
10.1109/TMC.2020.2966989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Disaster management, search and rescue missions, and health monitoring are examples of critical applications that require object localization with high precision and sometimes in a timely manner. In the absence of the global positioning system (GPS), the radio received signal strength index (RSSI) can be used for localization purposes due to its simplicity and cost-effectiveness. However, due to the low accuracy of RSSI, unmanned aerial vehicles (UAVs) or drones may be used as an efficient solution for improved localization accuracy due to their agility and higher probability of line-of-sight (LoS). Hence, in this context, we propose a novel framework based on reinforcement learning (RL) to enable a UAV (agent) to autonomously find its trajectory that results in improving the localization accuracy of multiple objects in shortest time and path length, fewer signal-strength measurements (waypoints), and/or lower UAV energy consumption. In particular, we first control the agent through initial scan trajectory on the whole region to 1) know the number of nodes and estimate their initial locations, and 2) train the agent online during operation. Then, the agent forms its trajectory by using RL to choose the next waypoints in order to minimize the average location errors of all objects. Our framework includes detailed UAV to ground channel characteristics with an empirical path loss and log-normal shadowing model, and also with an elaborate energy consumption model. We investigate and compare the localization precision of our approach with existing methods from the literature by varying the UAV's trajectory length, energy, number of waypoints, and time. Furthermore, we study the impact of the UAV's velocity, altitude, hovering time, communication range, number of maximum RSSI measurements, and number of objects. The results show the superiority of our method over the state-of-art and demonstrates its fast reduction of the localization error.
引用
收藏
页码:1312 / 1324
页数:13
相关论文
共 50 条
  • [1] Federated Reinforcement Learning UAV Trajectory Design for Fast Localization of Ground Users
    Shahbazi, Arzhang
    Donevski, Igor
    Nielsen, Jimmy Jessen
    Di Renzo, Marco
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 663 - 666
  • [2] Deep Reinforcement Learning for UAV Trajectory Design Considering Mobile Ground Users
    Lee, Wonseok
    Jeon, Young
    Kim, Taejoon
    Kim, Young-Il
    SENSORS, 2021, 21 (24)
  • [3] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
    Zhang, Chunyu
    Liu, Yiming
    Zhang, Zhi
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [4] UAV air combat autonomous trajectory planning method based on robust adversarial reinforcement learning
    Wang, Lixin
    Zheng, Sizhuang
    Tai, Shang
    Liu, Hailiang
    Yue, Ting
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 153
  • [5] Hybrid reinforcement learning for autonomous UAV control
    Yoo J.H.
    Journal of Institute of Control, Robotics and Systems, 2019, 25 (06) : 546 - 550
  • [6] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
    Wu, Mengjie
    Chi, Huijia
    Gan, Shuying
    Wang, Xijun
    Xu, Chao
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [7] Trajectory Design and Generalization for UAV Enabled Networks:A Deep Reinforcement Learning Approach
    Li, Xuan
    Wang, Qiang
    Liu, Jie
    Zhang, Wenqi
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [8] Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach
    Bouhamed, Omar
    Ghazzai, Hakim
    Besbes, Hichem
    Massoud, Yehia
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [9] Trajectory Design in UAV-Aided Mobile Crowdsensing: A Deep Reinforcement Learning Approach
    Tao, Xi
    Hafid, Abdelhakim Senhaji
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [10] Trajectory Design for UAV-Based Inspection System: A Deep Reinforcement Learning Approach
    Zhang, Wei
    Yang, Dingcheng
    Wu, Fahui
    Xiao, Lin
    2023 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS, 2023, : 1654 - 1659