Two-Stream Spatial-Temporal Graph Convolutional Networks for Driver Drowsiness Detection

被引:34
|
作者
Bai, Jing [1 ,2 ]
Yu, Wentao [1 ,2 ]
Xiao, Zhu [3 ]
Havyarimana, Vincent [3 ,4 ]
Regan, Amelia C. [5 ]
Jiang, Hongbo [3 ]
Jiao, Licheng [1 ,2 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[4] Ecole Normale Super, Dept Appl Sci, Bujumbura 6983, Burundi
[5] Univ Calif Irvine, Dept Comp Sci & Inst Transportat Studies, Irvine, CA 92697 USA
基金
中国国家自然科学基金;
关键词
Feature extraction; Faces; Mouth; Brain modeling; Vehicles; Videos; Support vector machines; Driver drowsiness detection; facial landmark detection; graph convolution networks (GCNs); TIME FATIGUE DETECTION; SYSTEM; ALERTNESS; STATE; EEG;
D O I
10.1109/TCYB.2021.3110813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have achieved remarkable performance in driver drowsiness detection based on the extraction of deep features of drivers' faces. However, the performance of driver drowsiness detection methods decreases sharply when complications, such as illumination changes in the cab, occlusions and shadows on the driver's face, and variations in the driver's head pose, occur. In addition, current driver drowsiness detection methods are not capable of distinguishing between driver states, such as talking versus yawning or blinking versus closing eyes. Therefore, technical challenges remain in driver drowsiness detection. In this article, we propose a novel and robust two-stream spatial-temporal graph convolutional network (2s-STGCN) for driver drowsiness detection to solve the above-mentioned challenges. To take advantage of the spatial and temporal features of the input data, we use a facial landmark detection method to extract the driver's facial landmarks from real-time videos and then obtain the driver drowsiness detection result by 2s-STGCN. Unlike existing methods, our proposed method uses videos rather than consecutive video frames as processing units. This is the first effort to exploit these processing units in the field of driver drowsiness detection. Moreover, the two-stream framework not only models both the spatial and temporal features but also models both the first-order and second-order information simultaneously, thereby notably improving driver drowsiness detection. Extensive experiments have been performed on the yawn detection dataset (YawDD) and the National TsingHua University drowsy driver detection (NTHU-DDD) dataset. The experimental results validate the feasibility of the proposed method. This method achieves an average accuracy of 93.4% on the YawDD dataset and an average accuracy of 92.7% on the evaluation set of the NTHU-DDD dataset.
引用
收藏
页码:13821 / 13833
页数:13
相关论文
共 50 条
  • [1] Two-Stream Spatial–Temporal Transformer Networks for Driver Drowsiness Detection
    Jiang, Qianyi
    Xu, Huahu
    Cheng, Chen
    Journal of Computers (Taiwan), 2023, 34 (05) : 103 - 115
  • [2] Multi-scale spatial-temporal attention graph convolutional networks for driver fatigue detection
    Fa, Shuxiang
    Yang, Xiaohui
    Han, Shiyuan
    Feng, Zhiquan
    Chen, Yuehui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 93
  • [3] STA-GCN: two-stream graph convolutional network with spatial-temporal attention for hand gesture recognition
    Zhang, Wei
    Lin, Zeyi
    Cheng, Jian
    Ma, Cuixia
    Deng, Xiaoming
    Wang, Hongan
    VISUAL COMPUTER, 2020, 36 (10-12): : 2433 - 2444
  • [4] MSTN: Multistage Spatial-Temporal Network for Driver Drowsiness Detection
    Shih, Tun-Huai
    Hsu, Chiou-Ting
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III, 2017, 10118 : 146 - 153
  • [5] Two-stream spatial-temporal neural networks for pose-based action recognition
    Wang, Zixuan
    Zhu, Aichun
    Hu, Fangqiang
    Wu, Qianyu
    Li, Yifeng
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (04)
  • [6] Spatial-Temporal Attention Two-Stream Convolution Neural Network for Smoke Region Detection
    Ding, Zhipeng
    Zhao, Yaqin
    Li, Ao
    Zheng, Zhaoxiang
    FIRE-SWITZERLAND, 2021, 4 (04):
  • [7] Two-Stream Convolutional Networks for Hyperspectral Target Detection
    Zhu, Dehui
    Du, Bo
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08): : 6907 - 6921
  • [8] Spatial-temporal graph convolutional networks foranomaly detection in multivariate time series
    Wang, Jing
    He, Miaomiao
    Ding, Jianli
    Li, Yonghua
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 51 (03): : 170 - 181
  • [9] Depth Video-based Two-stream Convolutional Neural Networks for Driver Fatigue Detection
    Ma, Xiaoxi
    Chau, Lap-Pui
    Yap, Kim-Hui
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 155 - 158
  • [10] Two-stream graph convolutional neural network fusion for weakly supervised temporal action detection
    Mengyao Zhao
    Zhengping Hu
    Shufang Li
    Shuai Bi
    Zhe Sun
    Signal, Image and Video Processing, 2022, 16 : 947 - 954