Driver Drowsiness Detection Using Condition-Adaptive Representation Learning Framework

被引:67
作者
Yu, Jongmin [1 ]
Park, Sangwoo [1 ]
Lee, Sangwook [2 ]
Jeon, Moongu [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
[2] Mokwon Univ, Dept Informat Commun Engn, Daejeon 31549, South Korea
基金
新加坡国家研究基金会;
关键词
Feature extraction; Visualization; Vehicle crash testing; Sensors; Adaptation models; Automobiles; Representation learning; adaptive learning; convolutional neural network; driver drowsiness detection; CLASSIFICATION; SLEEPINESS; FATIGUE;
D O I
10.1109/TITS.2018.2883823
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
We propose a condition-adaptive representation learning framework for driver drowsiness detection based on a 3D-deep convolutional neural network. The proposed framework consists of four models: spatio-temporal representation learning, scene condition understanding, feature fusion, and drowsiness detection. Spatio-temporal representation learning extracts features that can describe motions and appearances in video simultaneously. Scene condition understanding classifies the scene conditions related to various conditions about the drivers and driving situations, such as statuses of wearing glasses, illumination condition of driving, and motion of facial elements, such as head, eye, and mouth. Feature fusion generates a condition-adaptive representation using two features extracted from the above models. The drowsiness detection model recognizes driver drowsiness status using the condition-adaptive representation. The condition-adaptive representation learning framework can extract more discriminative features focusing on each scene condition than the general representation so that the drowsiness detection method can provide more accurate results for the various driving situations. The proposed framework is evaluated with the NTHU drowsy driver detection video dataset. The experimental results show that our framework outperforms the existing drowsiness detection methods based on visual analysis.
引用
收藏
页码:4206 / 4218
页数:13
相关论文
共 58 条
[1]  
Abtahi S., 2014, P 5 ACM MULT SYST C, P24
[2]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[3]  
Dinges D. F., 1998, FHWAMCRT98006 US DEP
[4]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[5]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[6]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[7]  
Dwivedi K, 2014, IEEE INT ADV COMPUT, P995, DOI 10.1109/IAdCC.2014.6779459
[8]   Scalable Object Detection using Deep Neural Networks [J].
Erhan, Dumitru ;
Szegedy, Christian ;
Toshev, Alexander ;
Anguelov, Dragomir .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2155-2162
[9]   Model-Based Analysis and Classification of Driver Distraction Under Secondary Tasks [J].
Ersal, Tulga ;
Fuller, Helen J. A. ;
Tsimhoni, Omer ;
Stein, Jeffrey L. ;
Fathy, Hosam K. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 11 (03) :692-701
[10]   Multi-view Face Detection Using Deep Convolutional Neural Networks [J].
Farfade, Sachin Sudhakar ;
Saberian, Mohammad ;
Li, Li-Jia .
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :643-650