Representation Learning, Scene Understanding, and Feature Fusion for Drowsiness Detection

被引:11
作者
Yu, Jongmin [1 ]
Park, Sangwoo [1 ]
Lee, Sangwook [2 ]
Jeon, Moongu [1 ]
机构
[1] GIST, Dept Elect Engn & Comp Sci, Gwangju, South Korea
[2] Mokwon Univ, Dept Informat Commun Engn, Daejeon, South Korea
来源
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III | 2017年 / 10118卷
关键词
DRIVER; CLASSIFICATION; FATIGUE;
D O I
10.1007/978-3-319-54526-4_13
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a novel drowsiness detection method based on 3D-Deep Convolutional Neural Network (3D-DCNN). We design a learning architecture for the drowsiness detection, which consists of three building blocks for representation learning, scene understanding, and feature fusion. In this framework, the model generates a spatio-temporal representation from multiple consecutive frames and analyze the scene conditions which are defined as head, eye, and mouth movements. The result of analysis from the scene condition understanding model is used to auxiliary information for the drowsiness detection. Then the method subsequently generates fusion features using the spatio-temporal representation and the results of the classification of scene conditions. By using the fusion features, we show that the proposed method can boost the performance of drowsiness detection. The proposed method demonstrates with the NTHU Drowsy Driver Detection (NTHU-DDD) video dataset.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 30 条
[1]  
[Anonymous], 1989, P ADV NEUR INF PROC
[2]  
[Anonymous], ARXIV151207928
[3]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[4]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[5]  
Dwivedi K, 2014, IEEE INT ADV COMPUT, P995, DOI 10.1109/IAdCC.2014.6779459
[6]   Model-Based Analysis and Classification of Driver Distraction Under Secondary Tasks [J].
Ersal, Tulga ;
Fuller, Helen J. A. ;
Tsimhoni, Omer ;
Stein, Jeffrey L. ;
Fathy, Hosam K. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 11 (03) :692-701
[7]   Multi-view Face Detection Using Deep Convolutional Neural Networks [J].
Farfade, Sachin Sudhakar ;
Saberian, Mohammad ;
Li, Li-Jia .
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :643-650
[8]  
García I, 2012, 2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), P618, DOI 10.1109/IVS.2012.6232222
[9]   Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158
[10]   Driver Drowsiness Classification Using Fuzzy Wavelet-Packet-Based Feature-Extraction Algorithm [J].
Khushaba, Rami N. ;
Kodagoda, Sarath ;
Lal, Sara ;
Dissanayake, Gamini .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (01) :121-131