Development and validation of a deep learning-based algorithm for drowsiness detection in facial photographs

被引:7
作者
Husain, Syed Sameed [1 ]
Mir, Junaid [2 ]
Anwar, Syed Muhammad [3 ]
Rafique, Waqas [4 ]
Ullah, Muhammad Obaid [2 ]
机构
[1] Univ Surrey, Ctr Vis, Speech, Signal Proc, Guildford, Surrey, England
[2] Univ Engn & Technol Taxila, Dept Elect Engn, Taxila 47050, Pakistan
[3] Univ Engn & Technol Taxila, Dept Comp Engn, Taxila 47050, Pakistan
[4] Univ Oxford, Dept Engn Sci, Oxford, England
关键词
Drowsiness detection; Fatigue detection; Deep convolutional neural network; Parametric aggregation; CNN; FATIGUE; NETWORK; EEG;
D O I
10.1007/s11042-022-12433-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Drowsiness is a feeling of sleepiness before the sleep onset and has severe implications from a safety perspective for the individuals involved in industrial activities, mining, and driving. The state-of-the-art computer vision (CV) based drowsiness detection methods generally utilize multiple deep convolutional neural networks (DCNN) without investigating deep feature aggregation techniques for the drowsiness detection task. More importantly, the reported results are mostly based on acted drowsy data, making the utilization of models trained on such data highly arguable for detecting drowsiness in real-life situations. Towards ameliorating this, we first present a comprehensive real drowsy data curated from 50 subjects, where subjects are labeled as fresh or drowsy. Further, four DCNN models: Xception, ResNet101, InceptionV4, and ResNext101, are trained on our dataset using transfer learning to select a baseline model for our drowsiness detection method. Moreover, an experimental study is performed using five different pooling methods: global max, global average, generalized mean, region of interest, and Weibull activation, to compute a robust and discriminative global descriptor. Our results reveal that the parametric Weibull activation pooling is the best suited for aggregating deep convolutional features. Additionally, a low complexity model based on the MobileNetV2 is proposed for a deployable drowsiness detection solution in mobile devices. The detection accuracy of 93.80% and 90.50% is achieved using our proposed Weibull-based ResNext101 and MobileNetV2 models, respectively. Moreover, our results show that the proposed non-invasive method outperforms the polysomnography signals-based invasive drowsiness detection approach.
引用
收藏
页码:20425 / 20441
页数:17
相关论文
共 61 条
  • [1] Abtahi Shabnam, 2014, P 5 ACM MULT SYST C, P24, DOI DOI 10.1145/2557642.2563678
  • [2] Akin M, 2008, NEURAL COMPUT APPL, V17, P227, DOI 10.1007/S00521-007-0117-7
  • [3] Akrout B, 2013, MULTIMEDIA UBIQUITOU, P43, DOI 10.1007/978-94-007-6738-6_6
  • [4] Driver Drowsiness Detection Based on Steering Wheel Data Applying Adaptive Neuro-Fuzzy Feature Selection
    Arefnezhad, Sadegh
    Samiee, Sajjad
    Eichberger, Arno
    Nahvi, Ali
    [J]. SENSORS, 2019, 19 (04)
  • [5] AutoFER: PCA and PSO based automatic facial emotion recognition
    Arora, Malika
    Kumar, Munish
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 3039 - 3049
  • [6] Azizpour Hossein, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P36, DOI 10.1109/CVPRW.2015.7301270
  • [7] 2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors
    Bansal, Monika
    Kumar, Munish
    Kumar, Manish
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 18839 - 18857
  • [8] Byrnes A, 2018, IEEE INT C INTELL TR, P2092, DOI 10.1109/ITSC.2018.8569293
  • [9] Celona L., 2018, IEEE I C CONS ELECT, P1
  • [10] Driver Drowsiness Estimation Based on Factorized Bilinear Feature Fusion and a Long-Short-Term Recurrent Convolutional Network
    Chen, Shuang
    Wang, Zengcai
    Chen, Wenxin
    [J]. INFORMATION, 2021, 12 (01) : 1 - 15