Two-Stream Spatial-Temporal Graph Convolutional Networks for Driver Drowsiness Detection

被引:34
|
作者
Bai, Jing [1 ,2 ]
Yu, Wentao [1 ,2 ]
Xiao, Zhu [3 ]
Havyarimana, Vincent [3 ,4 ]
Regan, Amelia C. [5 ]
Jiang, Hongbo [3 ]
Jiao, Licheng [1 ,2 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
[2] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[4] Ecole Normale Super, Dept Appl Sci, Bujumbura 6983, Burundi
[5] Univ Calif Irvine, Dept Comp Sci & Inst Transportat Studies, Irvine, CA 92697 USA
基金
中国国家自然科学基金;
关键词
Feature extraction; Faces; Mouth; Brain modeling; Vehicles; Videos; Support vector machines; Driver drowsiness detection; facial landmark detection; graph convolution networks (GCNs); TIME FATIGUE DETECTION; SYSTEM; ALERTNESS; STATE; EEG;
D O I
10.1109/TCYB.2021.3110813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have achieved remarkable performance in driver drowsiness detection based on the extraction of deep features of drivers' faces. However, the performance of driver drowsiness detection methods decreases sharply when complications, such as illumination changes in the cab, occlusions and shadows on the driver's face, and variations in the driver's head pose, occur. In addition, current driver drowsiness detection methods are not capable of distinguishing between driver states, such as talking versus yawning or blinking versus closing eyes. Therefore, technical challenges remain in driver drowsiness detection. In this article, we propose a novel and robust two-stream spatial-temporal graph convolutional network (2s-STGCN) for driver drowsiness detection to solve the above-mentioned challenges. To take advantage of the spatial and temporal features of the input data, we use a facial landmark detection method to extract the driver's facial landmarks from real-time videos and then obtain the driver drowsiness detection result by 2s-STGCN. Unlike existing methods, our proposed method uses videos rather than consecutive video frames as processing units. This is the first effort to exploit these processing units in the field of driver drowsiness detection. Moreover, the two-stream framework not only models both the spatial and temporal features but also models both the first-order and second-order information simultaneously, thereby notably improving driver drowsiness detection. Extensive experiments have been performed on the yawn detection dataset (YawDD) and the National TsingHua University drowsy driver detection (NTHU-DDD) dataset. The experimental results validate the feasibility of the proposed method. This method achieves an average accuracy of 93.4% on the YawDD dataset and an average accuracy of 92.7% on the evaluation set of the NTHU-DDD dataset.
引用
收藏
页码:13821 / 13833
页数:13
相关论文
共 50 条
  • [31] Driver Drowsiness Detection Using Convolutional Neural Networks
    Kepesiova, Zuzana
    Ciganek, Jan
    Kozak, Stefan
    PROCEEDINGS OF THE 2020 30TH INTERNATIONAL CONFERENCE CYBERNETICS & INFORMATICS (K&I '20), 2020,
  • [32] Topological Elastic Graph Convolutional Networks for Spatial-Temporal Sequence Forecasting
    Wang, Yiwen
    Xu, Meiling
    Tang, Lixin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [33] PGCN: Progressive Graph Convolutional Networks for Spatial-Temporal Traffic Forecasting
    Shin, Yuyol
    Yoon, Yoonjin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7633 - 7644
  • [34] Dynamic Spatial-Temporal Graph Convolutional Neural Networks for Traffic Forecasting
    Diao, Zulong
    Wang, Xin
    Zhang, Dafang
    Liu, Yingru
    Xie, Kun
    He, Shaoyao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 890 - 897
  • [35] Spatial-Temporal Attention Mechanism and Graph Convolutional Networks for Destination Prediction
    Li, Cong
    Zhang, Huyin
    Wang, Zengkai
    Wu, Yonghao
    Yang, Fei
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [36] Orthogonal Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting
    Fei, Yanhong
    Hu, Ming
    Wei, Xian
    Chen, Mingsong
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 71 - 76
  • [37] Forecasting traffic flow with spatial-temporal convolutional graph attention networks
    Zhang, Xiyue
    Xu, Yong
    Shao, Yizhen
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15457 - 15479
  • [38] Context based spatial-temporal graph convolutional networks for traffic prediction
    Jia, Chaolong
    Zhang, Wenjing
    He, Yumei
    Wang, Rong
    Li, Jinchao
    Xiao, Yunpeng
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [39] Shot Boundary Detection with Spatial-Temporal Convolutional Neural Networks
    Wu, Lifang
    Zhang, Shuai
    Jian, Meng
    Zhao, Zhijia
    Wang, Dong
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 479 - 491
  • [40] Lightweight Two-Stream Convolutional Face Detection
    Triantafyllidou, Danai
    Nousi, Paraskevi
    Tefas, Anastasios
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1190 - 1194