Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection

被引:0
|
作者
Son, Jinhwan [1 ]
Jung, Heechul [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
deep learning; computer vision; object detection; auto-labeling;
D O I
10.3390/app14062232
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection is a crucial research topic in the fields of computer vision and artificial intelligence, involving the identification and classification of objects within images. Recent advancements in deep learning technologies, such as YOLO (You Only Look Once), Faster-R-CNN, and SSDs (Single Shot Detectors), have demonstrated high performance in object detection. This study utilizes the YOLOv8 model for real-time object detection in environments requiring fast inference speeds, specifically in CCTV and automotive dashcam scenarios. Experiments were conducted using the 'Multi-Image Identical Situation and Object Identification Data' provided by AI Hub, consisting of multi-image datasets captured in identical situations using CCTV, dashcams, and smartphones. Object detection experiments were performed on three types of multi-image datasets captured in identical situations. Despite the utility of YOLO, there is a need for performance improvement in the AI Hub dataset. Grounding DINO, a zero-shot object detector with a high mAP performance, is employed. While efficient auto-labeling is possible with Grounding DINO, its processing speed is slower than YOLO, making it unsuitable for real-time object detection scenarios. This study conducts object detection experiments using publicly available labels and utilizes Grounding DINO as a teacher model for auto-labeling. The generated labels are then used to train YOLO as a student model, and performance is compared and analyzed. Experimental results demonstrate that using auto-generated labels for object detection does not lead to degradation in performance. The combination of auto-labeling and manual labeling significantly enhances performance. Additionally, an analysis of datasets containing data from various devices, including CCTV, dashcams, and smartphones, reveals the impact of different device types on the recognition accuracy for distinct devices. Through Grounding DINO, this study proves the efficacy of auto-labeling technology in contributing to efficiency and performance enhancement in the field of object detection, presenting practical applicability.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Anomaly Detection for Medical Images Using Teacher-Student Model With Skip Connections and Multiscale Anomaly Consistency
    Liu, Mingxuan
    Jiao, Yunrui
    Lu, Jingqiao
    Chen, Hong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [32] Deep Learning Model Based on You Only Look Once Algorithm for Detection and Visualization of Fracture Areas in Three-Dimensional Skeletal Images
    Jeon, Young-Dae
    Kang, Min-Jun
    Kuh, Sung-Uk
    Cha, Ha-Yeong
    Kim, Moo-Sub
    You, Ju-Yeon
    Kim, Hyeon-Joo
    Shin, Seung-Han
    Chung, Yang-Guk
    Yoon, Do-Kun
    DIAGNOSTICS, 2024, 14 (01)
  • [33] Detection of Aortic Dissection and Intramural Hematoma in Non-Contrast Chest Computed Tomography Using a You Only Look Once-Based Deep Learning Model
    Kim, Yu-Seop
    Kim, Jae Guk
    Choi, Hyun Young
    Lee, Dain
    Kong, Jin-Woo
    Kang, Gu Hyun
    Jang, Yong Soo
    Kim, Wonhee
    Lee, Yoonje
    Kim, Jihoon
    Shin, Dong Geum
    Park, Jae Keun
    Lee, Gayoung
    Kim, Bitnarae
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (22)
  • [34] RDD-YOLO: Road Damage Detection Algorithm Based on Improved You Only Look Once Version 8
    Li, Yue
    Yin, Chang
    Lei, Yutian
    Zhang, Jiale
    Yan, Yiting
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [35] Insulator defect detection based on improved you-only-look-once v4 in complex scenarios
    Wang, Suzhen
    Xu, Hao
    Shao, Mingwei
    Zhao, Lin
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (02)
  • [36] Internal Thread Defect Generation Algorithm and Detection System Based on Generative Adversarial Networks and You Only Look Once
    Jiang, Zhihao
    Dou, Xiaohan
    Liu, Xiaolong
    Xue, Chengqi
    Wang, Anqi
    Zhang, Gengpei
    SENSORS, 2024, 24 (17)
  • [37] Applying deep learning to defect detection in printed circuit boards via a newest model of you-only-look-once
    Adibhatla, Venkat Anil
    Chih, Huan-Chuang
    Hsu, Chi-Chang
    Cheng, Joseph
    Abbod, Maysam F.
    Shieh, Jiann-Shing
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 4411 - 4428
  • [38] An in situ surface defect detection method based on improved you only look once algorithm for wire and arc additive manufacturing
    Wu, Jun
    Huang, Cheng
    Li, Zili
    Li, Runsheng
    Wang, Guilan
    Zhang, Haiou
    RAPID PROTOTYPING JOURNAL, 2023, 29 (05) : 910 - 920
  • [39] A high-precision and efficient method for badminton action detection in sports using You Only Look Once with Hourglass Network
    Yang, Wenwen
    Jiang, Minlan
    Fang, Xiaosheng
    Shi, Xiaowei
    Guo, Yizheng
    Al-qaness, Mohammed A. A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [40] Real-time tunnel lining crack detection based on an improved You Only Look Once version X algorithm
    Zhou, Zhong
    Yan, Longbin
    Zhang, Junjie
    Yang, Hao
    GEORISK-ASSESSMENT AND MANAGEMENT OF RISK FOR ENGINEERED SYSTEMS AND GEOHAZARDS, 2023, 17 (01) : 181 - 195