Drivers' Visual Distraction Detection Using Facial Landmarks and Head Pose

被引:7
作者
Zhang, Shile [1 ]
Abdel-Aty, Mohamed [1 ]
机构
[1] Univ Cent Florida, Dept Civil Environm & Construct Engn, Orlando, FL 32816 USA
关键词
Drivers' visual distraction; Naturalistic Driving Study (NDS); head pose; convolutional neural network (CNN); CRASH;
D O I
10.1177/03611981221087234
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Drivers' distraction has been widely studied in the field of naturalistic driving studies. However, it is difficult to use traditional variables, such as speed, acceleration, and yaw rate to detect drivers' distraction in real time. Emerging technologies have obtained features from human faces, such as eye gaze, to detect drivers' visual distraction. However, eye gaze is hard to detect in naturalistic driving situations, because of low-resolution cameras, drivers wearing sunglasses, and so forth. Instead, head pose is easier to detect, and has correlation with eye gaze direction. In this study, city-wide videos are collected using onboard cameras from over 289 drivers representing 423 events. Head pose (pitch, yaw, and roll rates) are derived and fed into a convolutional neural network to detect drivers' distraction. The experiment results show that the proposed model can achieve recall value of 0.938 and area under the receiver operating characteristic curve value of 0.931, with variables from five time slices (1.25 s) used as input. The study proves that head pose can be used to detect drivers' distraction. The study offers insights for detecting drivers' distraction and can be used for the development of advanced driver assistance systems.
引用
收藏
页码:491 / 501
页数:11
相关论文
共 46 条
[21]   Head Pose Classification by using Body-Conducted Sound [J].
Kamoshida, Ryo ;
Takemura, Kentaro .
ADJUNCT PUBLICATION OF THE 31ST ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY (UIST'18 ADJUNCT), 2018, :39-41
[22]   Determination of head pose and facial expression from a single perspective view by successive scaled orthographic approximations [J].
Chang, CC ;
Tsai, WH .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 46 (03) :179-199
[23]   Determination of Head Pose and Facial Expression from a Single Perspective View by Successive Scaled Orthographic Approximations [J].
Chin-Chun Chang ;
Wen-Hsiang Tsai .
International Journal of Computer Vision, 2002, 46 :179-199
[24]   TRFH: towards real-time face detection and head pose estimation [J].
Chen, Shicun ;
Zhang, Yong ;
Yin, Baocai ;
Wang, Boyue .
PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) :1745-1755
[25]   TRFH: towards real-time face detection and head pose estimation [J].
Shicun Chen ;
Yong Zhang ;
Baocai Yin ;
Boyue Wang .
Pattern Analysis and Applications, 2021, 24 :1745-1755
[26]   Automated detection of cephalometric landmarks using deep neural patchworks [J].
Weingart, Julia Vera ;
Schlager, Stefan ;
Metzger, Marc Christian ;
Brandenburg, Leonard Simon ;
Hein, Anna ;
Schmelzeisen, Rainer ;
Bamberg, Fabian ;
Kim, Suam ;
Kellner, Elias ;
Reisert, Marco ;
Russe, Maximilian Frederik .
DENTOMAXILLOFACIAL RADIOLOGY, 2023, 52 (06)
[27]   Probabilistic Temporal Head Pose Estimation Using a Hierarchical Graphical Model [J].
Demirkus, Meltem ;
Precup, Doina ;
Clark, James J. ;
Arbel, Tal .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :328-344
[28]   A Deep Learning Method for Automatic Visual Attention Detection in Older Drivers [J].
Chikhaoui, Belkacem ;
Ruer, Perrine ;
Vallieres, Evelyne F. .
HOW AI IMPACTS URBAN LIVING AND PUBLIC HEALTH, ICOST 2019, 2019, 11862 :49-59
[29]   Multimodal Depression Detection: Fusion Analysis of Paralinguistic, Head Pose and Eye Gaze Behaviors [J].
Alghowinem, Sharifa ;
Goecke, Roland ;
Wagner, Michael ;
Epps, Julien ;
Hyett, Matthew ;
Parker, Gordon ;
Breakspear, Michael .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) :478-490
[30]   Discriminative Robust Head-Pose and Gaze Estimation Using Kernel-DMCCA Features Fusion [J].
Rabba, Salah ;
Kyan, Matthew ;
Gao, Lei ;
Quddus, Azhar ;
Zandi, Ali Shahidi ;
Guan, Ling .
INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2020, 14 (01) :107-135