A cascaded deep-learning-based model for face mask detection

被引:6
|
作者
Kumar, Akhil [1 ]
机构
[1] Himachal Pradesh Univ, Dept Comp Sci, Shimla, India
关键词
Face mask detection; ResNet34; YOLO; SPP layer; Deep learning; RECOGNITION;
D O I
10.1108/DTA-02-2022-0076
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - This work aims to present a deep learning model for face mask detection in surveillance environments such as automatic teller machines (ATMs), banks, etc. to identify persons wearing face masks. In surveillance environments, complete visibility of the face area is a guideline, and criminals and law offenders commit crimes by hiding their faces behind a face mask. The face mask detector model proposed in this work can be used as a tool and integrated with surveillance cameras in autonomous surveillance environments to identify and catch law offenders and criminals. Design/methodology/approach - The proposed face mask detector is developed by integrating the residual network (ResNet)34 feature extractor on top of three You Only Look Once (YOLO) detection layers along with the usage of the spatial pyramid pooling (SPP) layer to extract a rich and dense feature map. Furthermore, at the training time, data augmentation operations such as Mosaic and MixUp have been applied to the feature extraction network so that it can get trained with images of varying complexities. The proposed detector is trained and tested over a custom face mask detection dataset consisting of 52,635 images. For validation, comparisons have been provided with the performance of YOLO v1, v2, tiny YOLO v1, v2, v3 and v4 and other benchmark work present in the literature by evaluating performance metrics such as precision, recall, F1 score, mean average precision (mAP) for the overall dataset and average precision (AP) for each class of the dataset. Findings - The proposed face mask detector achieved 4.75-9.75 per cent higher detection accuracy in terms of mAP, 5-31 per cent higher AP for detection of faces with masks and, specifically, 2-30 per cent higher AP for detection of face masks on the face region as compared to the tested baseline variants of YOLO. Furthermore, the usage of the ResNet34 feature extractor and SPP layer in the proposed detection model reduced the training time and the detection time. The proposed face mask detection model can perform detection over an image in 0.45 s, which is 0.2-0.15 s lesser than that for other tested YOLO variants, thus making the proposed detection model perform detections at a higher speed. Research limitations/implications - The proposed face mask detector model can be utilized as a tool to detect persons with face masks who are a potential threat to the automatic surveillance environments such as ATMs, banks, airport security checks, etc. The other research implication of the proposed work is that it can be trained and tested for other object detection problems such as cancer detection in images, fish species detection, vehicle detection, etc. Practical implications - The proposed face mask detector can be integrated with automatic surveillance systems and used as a tool to detect persons with face masks who are potential threats to ATMs, banks, etc. and in the present times of COVID-19 to detect if the people are following a COVID-appropriate behavior of wearing a face mask or not in the public areas. Originality/value - The novelty of this work lies in the usage of the ResNet34 feature extractor with YOLO detection layers, whichmakes the proposedmodel a compact and powerful convolutional neural-network-based face mask detector model. Furthermore, the SPP layer has been applied to the ResNet34 feature extractor to make it able to extract a rich and dense feature map. The other novelty of the present work is the implementation of Mosaic and MixUp data augmentation in the training network that provided the feature extractor with 3x images of varying complexities and orientations and further aided in achieving higher detection accuracy. The proposed model is novel in terms of extracting rich features, performing augmentation at the training time and achieving high detection accuracy while maintaining the detection speed.
引用
收藏
页码:84 / 107
页数:24
相关论文
共 50 条
  • [1] A cascaded deep-learning-based model for face mask detection
    Kumar, Akhil
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, : 1 - 24
  • [2] A Deep Learning Model for Face Mask Detection
    Abd El-Aziz, A. A.
    Azim, Nesrine A.
    Mahmood, Mahmood A.
    Alshammari, Hamoud
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (10): : 101 - 106
  • [3] An Enhanced Deep Learning Model for Automatic Face Mask Detection
    Ilyas, Qazi Mudassar
    Ahmad, Muneer
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (01) : 241 - 254
  • [4] Deep learning for face mask detection: a survey
    Sharma, Aanchal
    Gautam, Rahul
    Singh, Jaspal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34321 - 34361
  • [5] Deep learning for face mask detection: a survey
    Aanchal Sharma
    Rahul Gautam
    Jaspal Singh
    Multimedia Tools and Applications, 2023, 82 : 34321 - 34361
  • [6] Automatic Face Mask Detection Using Deep Learning
    Anderson, Stephanie
    Veeravenkatappa, Suma
    Pola, Priyanka
    Pouriyeh, Seyedamin
    Han, Meng
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [7] An Efficient and Effective Deep Learning-Based Model for Real-Time Face Mask Detection
    Habib, Shabana
    Alsanea, Majed
    Aloraini, Mohammed
    Al-Rawashdeh, Hazim Saleh
    Islam, Muhammad
    Khan, Sheroz
    SENSORS, 2022, 22 (07)
  • [8] Deep Learning Based Face Mask Detection and Crowd Counting
    Amin, Prithvi N.
    Moghe, Sayali S.
    Prabhakar, Sparsh N.
    Nehete, Charusheela M.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [9] Deep-learning-based face detection using iterative bounding-box regression
    Luo, Dazhi
    Wen, Guihua
    Li, Danyang
    Hu, Yang
    Huan, Eryang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (19) : 24663 - 24680
  • [10] FACE MASK DETECTION USING DEEP LEARNING
    Kodali, Ravi Kishore
    Dhanekula, Rekha
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,