Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection

被引:0
|
作者
Son, Jinhwan [1 ]
Jung, Heechul [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
deep learning; computer vision; object detection; auto-labeling;
D O I
10.3390/app14062232
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection is a crucial research topic in the fields of computer vision and artificial intelligence, involving the identification and classification of objects within images. Recent advancements in deep learning technologies, such as YOLO (You Only Look Once), Faster-R-CNN, and SSDs (Single Shot Detectors), have demonstrated high performance in object detection. This study utilizes the YOLOv8 model for real-time object detection in environments requiring fast inference speeds, specifically in CCTV and automotive dashcam scenarios. Experiments were conducted using the 'Multi-Image Identical Situation and Object Identification Data' provided by AI Hub, consisting of multi-image datasets captured in identical situations using CCTV, dashcams, and smartphones. Object detection experiments were performed on three types of multi-image datasets captured in identical situations. Despite the utility of YOLO, there is a need for performance improvement in the AI Hub dataset. Grounding DINO, a zero-shot object detector with a high mAP performance, is employed. While efficient auto-labeling is possible with Grounding DINO, its processing speed is slower than YOLO, making it unsuitable for real-time object detection scenarios. This study conducts object detection experiments using publicly available labels and utilizes Grounding DINO as a teacher model for auto-labeling. The generated labels are then used to train YOLO as a student model, and performance is compared and analyzed. Experimental results demonstrate that using auto-generated labels for object detection does not lead to degradation in performance. The combination of auto-labeling and manual labeling significantly enhances performance. Additionally, an analysis of datasets containing data from various devices, including CCTV, dashcams, and smartphones, reveals the impact of different device types on the recognition accuracy for distinct devices. Through Grounding DINO, this study proves the efficacy of auto-labeling technology in contributing to efficiency and performance enhancement in the field of object detection, presenting practical applicability.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Two-stage method based on the you only look once framework and image segmentation for crack detection in concrete structures
    Mayank Mishra
    Vipul Jain
    Saurabh Kumar Singh
    Damodar Maity
    Architecture, Structures and Construction, 2023, 3 (4): : 429 - 446
  • [22] A Detailed Comparative Analysis of You Only Look Once-Based Architectures for the Detection of Personal Protective Equipment on Construction Sites
    Elesawy, Abdelrahman
    Abdelkader, Eslam Mohammed
    Osman, Hesham
    ENG, 2024, 5 (01): : 347 - 366
  • [23] A Semi-Supervised Object Detection Algorithm Based on Teacher-Student Models with Strong-Weak Heads
    Cai, Xiaowei
    Luo, Fuyi
    Qi, Wei
    Liu, Hong
    ELECTRONICS, 2022, 11 (23)
  • [24] Heavy Equipment Detection on Construction Sites Using You Only Look Once (YOLO-Version 10) with Transformer Architectures
    Eum, Ikchul
    Kim, Jaejun
    Wang, Seunghyeon
    Kim, Juhyung
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [25] Modified You Only Look Once Network Model for Enhanced Traffic Scene Detection Performance for Small Targets
    Shi, Lei
    Ren, Shuai
    Fan, Xing
    Wang, Ke
    Lin, Shan
    Liu, Zhanwen
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [26] A lightweight model based on you only look once for pomegranate before fruit thinning in complex environment
    Du, Yurong
    Han, Youpan
    Su, Yaoheng
    Wang, Jiuxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [27] YOLBO: You Only Look Back Once-A Low Latency Object Tracker Based on YOLO and Optical Flow
    Kaputa, Daniel S.
    Landy, Brian P.
    IEEE ACCESS, 2021, 9 : 82497 - 82507
  • [28] You Only Look at Interested Cells: Real-Time Object Detection Based on Cell-Wise Segmentation
    Su, Kai
    Wang, Huitao
    Chowdhury, Intisar Md
    Zhao, Qiangfu
    Tomioka, Yoichi
    2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2020,
  • [29] Identification of Human Sperm based on Morphology Using the You Only Look Once Version 4 Algorithm
    Aristoteles
    Syarif, Admi
    Sutyarso
    Lumbanraja, Favorisen R.
    Hidayatullah, Arbi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 424 - 431
  • [30] Defect Detection in Printed Circuit Boards Using You-Only-Look-Once Convolutional Neural Networks
    Adibhatla, Venkat Anil
    Chih, Huan-Chuang
    Hsu, Chi-Chang
    Cheng, Joseph
    Abbod, Maysam F.
    Shieh, Jiann-Shing
    ELECTRONICS, 2020, 9 (09)