Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection

被引:0
|
作者
Son, Jinhwan [1 ]
Jung, Heechul [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
deep learning; computer vision; object detection; auto-labeling;
D O I
10.3390/app14062232
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection is a crucial research topic in the fields of computer vision and artificial intelligence, involving the identification and classification of objects within images. Recent advancements in deep learning technologies, such as YOLO (You Only Look Once), Faster-R-CNN, and SSDs (Single Shot Detectors), have demonstrated high performance in object detection. This study utilizes the YOLOv8 model for real-time object detection in environments requiring fast inference speeds, specifically in CCTV and automotive dashcam scenarios. Experiments were conducted using the 'Multi-Image Identical Situation and Object Identification Data' provided by AI Hub, consisting of multi-image datasets captured in identical situations using CCTV, dashcams, and smartphones. Object detection experiments were performed on three types of multi-image datasets captured in identical situations. Despite the utility of YOLO, there is a need for performance improvement in the AI Hub dataset. Grounding DINO, a zero-shot object detector with a high mAP performance, is employed. While efficient auto-labeling is possible with Grounding DINO, its processing speed is slower than YOLO, making it unsuitable for real-time object detection scenarios. This study conducts object detection experiments using publicly available labels and utilizes Grounding DINO as a teacher model for auto-labeling. The generated labels are then used to train YOLO as a student model, and performance is compared and analyzed. Experimental results demonstrate that using auto-generated labels for object detection does not lead to degradation in performance. The combination of auto-labeling and manual labeling significantly enhances performance. Additionally, an analysis of datasets containing data from various devices, including CCTV, dashcams, and smartphones, reveals the impact of different device types on the recognition accuracy for distinct devices. Through Grounding DINO, this study proves the efficacy of auto-labeling technology in contributing to efficiency and performance enhancement in the field of object detection, presenting practical applicability.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Automatic Defect Detection in Sewer Pipe Closed- Circuit Television Images via Improved You Only Look Once Version 5 Object Detection Network
    Huang, Jianying
    Kang, Hoon
    IEEE ACCESS, 2024, 12 : 92797 - 92825
  • [42] Toward Reliable Post-Disaster Assessment: Advancing Building Damage Detection Using You Only Look Once Convolutional Neural Network and Satellite Imagery
    Gonzalez, Cesar Luis Moreno
    Montoya, German A.
    Garzon, Carlos Lozano
    MATHEMATICS, 2025, 13 (07)
  • [43] Real-Time Tool Detection in Smart Manufacturing Using You-Only-Look- Once (YOLO)v5
    Zendehdel, Niloofar
    Chen, Haodong
    Leu, Ming C.
    MANUFACTURING LETTERS, 2023, 35 : 1052 - 1059
  • [44] Real-Time Pipeline Fault Detection in Water Distribution Networks Using You Only Look Once v8
    Michael, Goodnews
    Shahra, Essa Q.
    Basurra, Shadi
    Wu, Wenyan
    Jabbar, Waheb A.
    SENSORS, 2024, 24 (21)
  • [45] Underwater moving target detection and tracking based on enhanced you only look once and deep simple online and realtime tracking strategy
    Sun, Bing
    Zhang, Wei
    Xing, Cheng
    Li, Yingyao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
  • [46] Deep Learning-Based Multiple Droplet Contamination Detector for Vision Systems Using a You Only Look Once Algorithm
    Kim, Youngkwang
    Kim, Woochan
    Yoon, Jungwoo
    Chung, Sangkug
    Kim, Daegeun
    INFORMATION, 2024, 15 (03)
  • [47] Robust Forest Fire Detection Method for Surveillance Systems Based on You Only Look Once Version 8 and Transfer Learning Approaches
    Yunusov, Nodir
    Islam, Bappy M. D. Siful
    Abdusalomov, Akmalbek
    Kim, Wooseong
    PROCESSES, 2024, 12 (05)
  • [48] Fish Detection under Occlusion Using Modified You Only Look Once v8 Integrating Real-Time Detection Transformer Features
    Li, Enze
    Wang, Qibiao
    Zhang, Jinzhao
    Zhang, Weihan
    Mo, Hanlin
    Wu, Yadong
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [49] An Aerial Weed Detection System for Green Onion Crops Using the You Only Look Once (YOLOv3) Deep Learning Algorithm
    Parico A.I.B.
    Ahamed T.
    Ahamed, Tofael (tofael.ahamed.gp@u.tsukuba.ac.jp), 1600, Asian Agricultural and Biological Engineering Association (13): : 42 - 48
  • [50] AI-Powered Image-Based Assessment of Pressure Injuries Using You Only Look once (YOLO) Version 8 Models
    Tusar, Mehedi Hasan
    Fayyazbakhsh, Fateme
    Zendehdel, Niloofar
    Mochalin, Eduard
    Melnychuk, Igor
    Gould, Lisa
    Leu, Ming C.
    ADVANCES IN WOUND CARE, 2025,