Contrastive-based YOLOv7 for personal protective equipment detection

被引:9
作者
Samma, Hussein [1 ]
Al-Azani, Sadam [1 ]
Luqman, Hamzah [1 ,2 ]
Alfarraj, Motaz [1 ,2 ,3 ]
机构
[1] King Fahd Univ Petr & Minerals, SDAIA KFUPM Joint Res Ctr Artificial Intelligence, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] King Fahd Univ Petr & Minerals, Elect Engn Dept, Dhahran, Saudi Arabia
关键词
Contrastive learning; YOLO; Object detection; CHV dataset; PPE;
D O I
10.1007/s00521-023-09212-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
You only look once (YOLO) is a state-of-the-art object detection model which has a novel architecture that balances model complexity with the inference time. Among YOLO versions, YOLOv7 has a lightweight backbone network called E-ELAN that allows it to learn more efficiently without affecting the gradient path. However, YOLOv7 models face classification difficulties when dealing with classes that have a similar shape and texture like personal protective equipment (PPE). In other words, the Glass versus NoGlass PPE objects almost appear similar when the image is captured at a distance. To mitigate this issue and further improve the classification performance of YOLOv7, a modified version called the contrastive-based model is introduced in this work. The basic concept is that a contrast loss branch function has been added, which assists the YOLOv7 model in differentiating and pushing instances from different classes in the embedding space. To validate the effectiveness of the implemented contrastive-based YOLO, it has been evaluated on two different datasets which are CHV and our own indoor collected dataset named JRCAI. The dataset contains 12 different types of PPE classes. Notably, we have annotated both datasets for the studied 12 PPE objects. The experimental results showed that the proposed model outperforms the standard YOLOv7 model by 2% in mAP@0.5 measure. Furthermore, the proposed model outperformed other YOLO variants as well as cutting-edge object detection models such as YOLOv8, Faster-RCNN, and DAB-DETR.
引用
收藏
页码:2445 / 2457
页数:13
相关论文
共 42 条
[1]  
Chen T, 2020, PR MACH LEARN RES, V119
[2]   A lightweight face-assisted object detection model for welding helmet use [J].
Chen, Weiming ;
Li, Changfan ;
Guo, Hailin .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
[3]   Exploring Simple Siamese Representation Learning [J].
Chen, Xinlei ;
He, Kaiming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753
[4]   Real-time detection algorithm of helmet and reflective vest based on improved YOLOv5 [J].
Chen, Zhihua ;
Zhang, Fan ;
Liu, Hongbo ;
Wang, Longxuan ;
Zhang, Qian ;
Guo, Liulu .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
[5]   YOLOWeeds: A novel benchmark of YOLO object detectors for multi-class weed detection in cotton production systems [J].
Dang, Fengying ;
Chen, Dong ;
Lu, Yuzhen ;
Li, Zhaojian .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
[6]  
Grill J., 2020, ADV NEURAL INFORM PR, V33, P21271
[7]   Self-supervised audiovisual representation learning for remote sensing data [J].
Heidler, Konrad ;
Mou, Lichao ;
Hu, Di ;
Jin, Pu ;
Li, Guangyao ;
Gan, Chuang ;
Wen, Ji-Rong ;
Zhu, Xiao Xiang .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 116
[8]   Personal Protection Equipment detection system for embedded devices based on DNN and Fuzzy Logic [J].
Iannizzotto, Giancarlo ;
Lo Bello, Lucia ;
Patti, Gaetano .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
[9]  
Jocher G., 2023, YOLO ULTRALYTICS
[10]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735