Two-Stream Spatial-Temporal Graph Convolutional Networks for Driver Drowsiness Detection

被引:34
|
作者
Bai, Jing [1 ,2 ]
Yu, Wentao [1 ,2 ]
Xiao, Zhu [3 ]
Havyarimana, Vincent [3 ,4 ]
Regan, Amelia C. [5 ]
Jiang, Hongbo [3 ]
Jiao, Licheng [1 ,2 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[4] Ecole Normale Super, Dept Appl Sci, Bujumbura 6983, Burundi
[5] Univ Calif Irvine, Dept Comp Sci & Inst Transportat Studies, Irvine, CA 92697 USA
基金
中国国家自然科学基金;
关键词
Feature extraction; Faces; Mouth; Brain modeling; Vehicles; Videos; Support vector machines; Driver drowsiness detection; facial landmark detection; graph convolution networks (GCNs); TIME FATIGUE DETECTION; SYSTEM; ALERTNESS; STATE; EEG;
D O I
10.1109/TCYB.2021.3110813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have achieved remarkable performance in driver drowsiness detection based on the extraction of deep features of drivers' faces. However, the performance of driver drowsiness detection methods decreases sharply when complications, such as illumination changes in the cab, occlusions and shadows on the driver's face, and variations in the driver's head pose, occur. In addition, current driver drowsiness detection methods are not capable of distinguishing between driver states, such as talking versus yawning or blinking versus closing eyes. Therefore, technical challenges remain in driver drowsiness detection. In this article, we propose a novel and robust two-stream spatial-temporal graph convolutional network (2s-STGCN) for driver drowsiness detection to solve the above-mentioned challenges. To take advantage of the spatial and temporal features of the input data, we use a facial landmark detection method to extract the driver's facial landmarks from real-time videos and then obtain the driver drowsiness detection result by 2s-STGCN. Unlike existing methods, our proposed method uses videos rather than consecutive video frames as processing units. This is the first effort to exploit these processing units in the field of driver drowsiness detection. Moreover, the two-stream framework not only models both the spatial and temporal features but also models both the first-order and second-order information simultaneously, thereby notably improving driver drowsiness detection. Extensive experiments have been performed on the yawn detection dataset (YawDD) and the National TsingHua University drowsy driver detection (NTHU-DDD) dataset. The experimental results validate the feasibility of the proposed method. This method achieves an average accuracy of 93.4% on the YawDD dataset and an average accuracy of 92.7% on the evaluation set of the NTHU-DDD dataset.
引用
收藏
页码:13821 / 13833
页数:13
相关论文
共 50 条
  • [41] Source detection on networks using spatial temporal graph convolutional networks
    Sha, Hao
    Al Hasan, Mohammad
    Mohler, George
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [42] Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12018 - 12027
  • [43] Fake visual content detection using two-stream convolutional neural networks
    Bilal Yousaf
    Muhammad Usama
    Waqas Sultani
    Arif Mahmood
    Junaid Qadir
    Neural Computing and Applications, 2022, 34 : 7991 - 8004
  • [44] A multi-aware graph convolutional network for driver drowsiness detection
    Lin, Liang
    Wang, Song
    Yang, Jucheng
    Wei, Feng
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [45] Crowd abnormal detection using two-stream Fully Convolutional Neural Networks
    Wei, Hongtao
    Xiao, Yao
    Li, Ruifang
    Liu, Xinhua
    2018 10TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2018, : 332 - 336
  • [46] GraphSleepNet: Adaptive Spatial-Temporal Graph Convolutional Networks for Sleep Stage Classification
    Jia, Ziyu
    Lin, Youfang
    Wang, Jing
    Zhou, Ronghao
    Ning, Xiaojun
    He, Yuanlai
    Zhao, Yaoshuai
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1324 - 1330
  • [47] Video Action Recognition by Combining Spatial-Temporal Cues with Graph Convolutional Networks
    Li, Tao
    Xiong, Wenjun
    Zhang, Zheng
    Pei, Lishen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023,
  • [48] Video Action Recognition by Combining Spatial-Temporal Cues with Graph Convolutional Networks
    Li, Tao
    Xiong, Wenjun
    Zhang, Zheng
    Pei, Lishen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023,
  • [49] Adaptive Spatial-Temporal Fusion Graph Convolutional Networks for Traffic Flow Forecasting
    Li, Senwen
    Ge, Liang
    Lin, Yongquan
    Zeng, Bo
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [50] Attention-Based Two-Stream Convolutional Networks for Face Spoofing Detection
    Chen, Haonan
    Hu, Guosheng
    Lei, Zhen
    Chen, Yaowu
    Robertson, Neil M.
    Li, Stan Z.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 578 - 593