Reinforcement learning framework for UAV-based target localization applications

被引:14
作者
Shurrab, Mohammed [1 ]
Mizouni, Rabeb [1 ]
Singh, Shakti [1 ]
Otrok, Hadi [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
关键词
Target localization; Unmanned aerial vehicle (UAV); Reinforcement learning (RL); Deep Q-network (DQN); Data-driven; Deep reinforcement learning (DRL); Smart environmental monitoring (SEM); INTERNET; SYSTEM; THINGS; IOT;
D O I
10.1016/j.iot.2023.100867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart environmental monitoring has gained prominence, where target localization is of utmost importance. Employing UAVs for localization tasks is appealing owing to their low-cost, light-weight, and high maneuverability. However, UAVs lack the autonomy of decision-making if met with uncertain situations. Therefore, reinforcement learning (RL) can introduce intelligence to UAVs, where they learn to act based on the presented situation. Existing works focus on UAV trajectory optimization, navigation, and target tracking. These methods are application-specific and cannot be adapted to localization tasks since they require prior knowledge of the target. Moreover, the current RL-based autonomous target localization systems are lacking since-1) they must keep track of all visited locations and their corresponding readings, 2) they require retraining when encountering new environments, and 3) they are not scalable since the agent's movement is limited to slow speeds and for specific environments. Therefore, this work proposes a data-driven UAV target localization system based on Q-learning, which employs tabular methods to learn the optimal policy. Deep Q-network (DQN) is introduced to enhance the RL model and alleviate the curse of dimensionality. The proposed models enable smart decision-making, where the sensory information gathered by the UAV is exploited to produce the best action. Moreover, the UAV movement is modeled based on motion physics, where the actions correspond to linear velocities and heading angles. The proposed approach is compared with different benchmarks, where the results indicate that a more efficient, scalable, and adaptable localization is achieved, irrespective of the environment or source characteristics, without retraining.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Hybrid Machine Learning and Reinforcement Learning Framework for Adaptive UAV Obstacle Avoidance
    Skarka, Wojciech
    Ashfaq, Rukhseena
    AEROSPACE, 2024, 11 (11)
  • [22] Reinforcement Learning-Based Security/Safety UAV System for Intrusion Detection Under Dynamic and Uncertain Target Movement
    Masadeh, Ala'eddin
    Alhafnawi, Mohannad
    Salameh, Haythem A. Bany
    Musa, Ahmed
    Jararweh, Yaser
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2024, 71 : 12498 - 12508
  • [23] UAV-Based Automatic Detection, Localization, and Cleaning of Bird Excrement on Solar Panels
    Huang, Yo-Ping
    Kshetrimayum, Satchidanand
    Sandnes, Frode Eika
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 1657 - 1670
  • [24] DroneSegNet: Robust Aerial Semantic Segmentation for UAV-Based IoT Applications
    Chakravarthy, Anirudh S.
    Sinha, Soumendu
    Narang, Pratik
    Mandal, Murari
    Chamola, Vinay
    Yu, F. Richard
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (04) : 4277 - 4286
  • [25] UAV-based automated 3D modeling framework using deep learning for building energy modeling
    Yoon, Jonghyeon
    Kim, Yeeun
    Lee, Sanghyo
    Shin, Minjae
    SUSTAINABLE CITIES AND SOCIETY, 2024, 101
  • [26] Data acquisition and analysis methods in UAV-based applications for Precision Agriculture
    Tsouros, Dimosthenis C.
    Triantafyllou, Anna
    Bibi, Stamatia
    Sarigannidis, Panagiotis G.
    2019 15TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS), 2019, : 377 - 384
  • [27] Consumer Personalized Gesture Recognition in UAV-Based Industry 5.0 Applications
    Paikrao, Pavan
    Routray, Sidheswar
    Mukherjee, Amrit
    Khan, Ahmad Raza
    Vohnout, Rudolf
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2023, 69 (04) : 842 - 849
  • [28] Integrated UAV-Based Real-Time Mapping for Security Applications
    Hein, Daniel
    Kraft, Thomas
    Brauchle, Joerg
    Berger, Ralf
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (05):
  • [29] A Methodological Framework for Free and Open-Source UAV-Based Archaeological Research
    Reese, Kelsey M.
    Field, Sean
    ADVANCES IN ARCHAEOLOGICAL PRACTICE, 2021, 9 (04): : 394 - 401
  • [30] Coordinated Multi-Agent Deep Reinforcement Learning for Energy-Aware UAV-Based Big-Data Platforms
    Jung, Soyi
    Yun, Won Joon
    Kim, Joongheon
    Kim, Jae-Hyun
    ELECTRONICS, 2021, 10 (05) : 1 - 15