Deep learning assisted real-time object recognition and depth estimation for enhancing emergency response in adaptive environment

被引:1
|
作者
Faseeh, Muhammad [1 ,2 ]
Bibi, Misbah [2 ,3 ]
Khan, Murad Ali [2 ,3 ]
Kim, Do-Hyeun [2 ,3 ]
机构
[1] Jeju Natl Univ, Dept Elect Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Big Data Res Ctr, Dept Comp Engn, Jeju 63243, South Korea
[3] Jeju Natl Univ, Dept Comp Engn, Jeju 63243, South Korea
关键词
Object detection; Depth estimation; Real-time systems; Deep learning; Temporal information; LSTM; Yolo;
D O I
10.1016/j.rineng.2024.103482
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Accurate long-range object recognition is essential in autonomous navigation and military surveillance applications. While recent advancements have improved real-time recognition, existing models, especially those focused on monocular depth estimation, face accuracy challenges due to supervised Deep Learning (DL) limitations. This study presents a robust, real-time military object recognition system that leverages temporal sequences and attention mechanisms for enhanced depth estimation. Using RGB frames along depth maps from the KITTI and synthetics dataset, along with a fine-tuned YOLOv11 model, our system achieves a Root Mean Squared Error (RMSE) of 1.24 meters, and RMSE (log) of 0.18 in-depth estimation, with object detection adequate up to 250 meters.The model maintains high precision (96.4%), recall (93.67%),and F1 score (93.33%) across various ranges, confirming YOLOv11's accuracy with an average inference time of 13 ms for short-range and 17 ms for long-range detection. These results highlight the system's potential for deployment in real-time military and adaptive response scenarios, outperforming existing models in both accuracy and computational efficiency.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Depth-assisted rectification for real-time object detection and pose estimation
    Silva do Monte Lima, Joao Paulo
    Magalhaes Simoes, Francisco Paulo
    Uchiyama, Hideaki
    Teichrieb, Veronica
    Marchand, Eric
    MACHINE VISION AND APPLICATIONS, 2016, 27 (02) : 193 - 219
  • [2] Depth-assisted rectification for real-time object detection and pose estimation
    João Paulo Silva do Monte Lima
    Francisco Paulo Magalhães Simões
    Hideaki Uchiyama
    Veronica Teichrieb
    Eric Marchand
    Machine Vision and Applications, 2016, 27 : 193 - 219
  • [3] Real-Time Deep Learning-Based Object Recognition in Augmented Reality
    Egipko, V
    Zhdanova, M.
    Gapon, N.
    Voronin, V.
    Semenishchev, E.
    REAL-TIME PROCESSING OF IMAGE, DEPTH, AND VIDEO INFORMATION 2024, 2024, 13000
  • [4] Real-Time Object Detection and Recognition on Low-Compute Humanoid Robots using Deep Learning
    Chatterjee, Sayantan
    Zunjani, Faheem H.
    Nandi, Gora C.
    2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 202 - 208
  • [5] Platooning control of drones with real-time deep learning object detection
    Dai, Xin
    Nagahara, Masaaki
    ADVANCED ROBOTICS, 2023, 37 (03) : 220 - 225
  • [6] Real-time multiple object tracking using deep learning methods
    Meimetis, Dimitrios
    Daramouskas, Ioannis
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01) : 89 - 118
  • [7] Deep Learning Based, Real-Time Object Detection for Autonomous Driving
    Akyol, Gamze
    Kantarci, Alperen
    Celik, Ali Eren
    Ak, Abdullah Cihan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [8] Real-Time Deep Learning-Based Object Detection Framework
    Tarimo, William
    Sabra, Moustafa M.
    Hendre, Shonan
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1829 - 1836
  • [9] Real-time multiple object tracking using deep learning methods
    Dimitrios Meimetis
    Ioannis Daramouskas
    Isidoros Perikos
    Ioannis Hatzilygeroudis
    Neural Computing and Applications, 2023, 35 : 89 - 118
  • [10] Exploration of deep learning architectures for real-time yoga pose recognition
    Saurav, Sumeet
    Gidde, Prashant
    Singh, Sanjay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 81621 - 81663