Reinforcement learning framework for UAV-based target localization applications

被引:14
作者
Shurrab, Mohammed [1 ]
Mizouni, Rabeb [1 ]
Singh, Shakti [1 ]
Otrok, Hadi [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
关键词
Target localization; Unmanned aerial vehicle (UAV); Reinforcement learning (RL); Deep Q-network (DQN); Data-driven; Deep reinforcement learning (DRL); Smart environmental monitoring (SEM); INTERNET; SYSTEM; THINGS; IOT;
D O I
10.1016/j.iot.2023.100867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart environmental monitoring has gained prominence, where target localization is of utmost importance. Employing UAVs for localization tasks is appealing owing to their low-cost, light-weight, and high maneuverability. However, UAVs lack the autonomy of decision-making if met with uncertain situations. Therefore, reinforcement learning (RL) can introduce intelligence to UAVs, where they learn to act based on the presented situation. Existing works focus on UAV trajectory optimization, navigation, and target tracking. These methods are application-specific and cannot be adapted to localization tasks since they require prior knowledge of the target. Moreover, the current RL-based autonomous target localization systems are lacking since-1) they must keep track of all visited locations and their corresponding readings, 2) they require retraining when encountering new environments, and 3) they are not scalable since the agent's movement is limited to slow speeds and for specific environments. Therefore, this work proposes a data-driven UAV target localization system based on Q-learning, which employs tabular methods to learn the optimal policy. Deep Q-network (DQN) is introduced to enhance the RL model and alleviate the curse of dimensionality. The proposed models enable smart decision-making, where the sensory information gathered by the UAV is exploited to produce the best action. Moreover, the UAV movement is modeled based on motion physics, where the actions correspond to linear velocities and heading angles. The proposed approach is compared with different benchmarks, where the results indicate that a more efficient, scalable, and adaptable localization is achieved, irrespective of the environment or source characteristics, without retraining.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Energy-Efficient Multidimensional Trajectory of UAV-Aided IoT Networks With Reinforcement Learning
    Silvirianti
    Shin, Soo Young
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19): : 19214 - 19226
  • [32] Review of Navigation Methods for UAV-Based Parcel Delivery
    Dissanayaka, Didula
    Wanasinghe, Thumeera R.
    De Silva, Oscar
    Jayasiri, Awantha
    Mann, George K., I
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 1068 - 1082
  • [33] Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization
    Alagha, Ahmed
    Mizouni, Rabeb
    Bentahar, Jamal
    Otrok, Hadi
    Singh, Shakti
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13556 - 13570
  • [34] Target localization based on cross-view matching between UAV and satellite
    Ren, Kan
    Ding, Lei
    Wan, Minjie
    Gu, Guohua
    Chen, Qian
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (09) : 333 - 341
  • [35] AoI-Aware Deep Reinforcement Learning Based UAV Path Planning for Defence Applications
    Kumari, Shilpi
    Sodhi, Eshaan
    Gupta, Dev
    Pratap, Ajay
    2024 IEEE SPACE, AEROSPACE AND DEFENCE CONFERENCE, SPACE 2024, 2024, : 230 - 234
  • [36] UAV-Based Terrain Modeling under Vegetation in the Chinese Loess Plateau: A Deep Learning and Terrain Correction Ensemble Framework
    Na, Jiaming
    Xue, Kaikai
    Xiong, Liyang
    Tang, Guoan
    Ding, Hu
    Strobl, Josef
    Pfeifer, Norbert
    REMOTE SENSING, 2020, 12 (20) : 1 - 18
  • [37] Fly, Wake-up, Find: UAV-based Energy-efficient Localization for Distributed Sensor Nodes
    Niculescu, Vlad
    Palossi, Daniele
    Magno, Michele
    Benini, Luca
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 34
  • [38] UAV Enhanced Target-Barrier Coverage Algorithm for Wireless Sensor Networks Based on Reinforcement Learning
    Li, Li
    Chen, Hongbin
    SENSORS, 2022, 22 (17)
  • [39] Toward Autonomous Multi-UAV Wireless Network: A Survey of Reinforcement Learning-Based Approaches
    Bai, Yu
    Zhao, Hui
    Zhang, Xin
    Chang, Zheng
    Jantti, Riku
    Yang, Kun
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (04): : 3038 - 3067
  • [40] UAV-Based Air Pollutant Source Localization Using Combined Metaheuristic and Probabilistic Methods
    Yungaicela-Naula, Noe
    Garza-Castanon, Luis E.
    Zhang, Youmin
    Minchala-Avila, Luis, I
    APPLIED SCIENCES-BASEL, 2019, 9 (18):