Holistic Driver Monitoring: A Multi-Task Approach for In-Cabin Driver Attention Evaluation through Multi-Camera Data

被引:0
作者
Palo, Patilapaban [1 ]
Nayak, Satyajit [1 ]
Modhugu, Durga Nagendra Raghava Kumar [1 ]
Gupta, Kwanit [1 ]
Uttarkabat, Satarupa [1 ]
机构
[1] Valeo India Private Ltd, Chennai, Tamil Nadu, India
来源
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年
关键词
RECOGNITION; NETWORKS;
D O I
10.1109/IV55156.2024.10588402
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the domain of road safety, evaluating and enhancing driver attention is crucial for reducing the frequency of road accidents. This research paper proposes a novel methodology for assessing driver attention by leveraging multi-camera data. Our approach considers a dual-camera setup capturing the face and body parts from the driver's side, focusing on four key tasks: distraction detection, gaze direction analysis, fatigue detection, and hands-on-wheel monitoring. We utilize facial landmarks for fatigue detection, specifically targeting the mouth and eye regions. SqueezeNet, a lightweight convolutional neural network, is employed to discern signs of driver fatigue. Gaze direction analysis uses the same network but focuses solely on eye landmark features. Distraction activities are identified by extracting optical flow features and using them as input to the robust I3D network. Further, hands-on-wheel detection is achieved by extracting hand landmarks, followed by using a 3D CNN model. We propose a driver attention score to consolidate these tasks into a unified measure of driver attention. This score is a holistic representation of the driver's attentiveness, combining insights from distraction detection, gaze direction analysis, fatigue detection, and hands-on-wheel monitoring. Our methodology is validated on a driver monitoring dataset (DMD), where training and application demonstrate the effectiveness of the proposed approach in assessing and quantifying driver attention.
引用
收藏
页码:1361 / 1366
页数:6
相关论文
共 47 条
[21]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[22]   DETERMINING OPTICAL-FLOW [J].
HORN, BKP ;
SCHUNCK, BG .
ARTIFICIAL INTELLIGENCE, 1981, 17 (1-3) :185-203
[23]   Car that Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models [J].
Jain, Ashesh ;
Koppula, Hema S. ;
Raghavan, Bharad ;
Soh, Shane ;
Saxena, Ashutosh .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3182-3190
[24]   A novel public dataset for multimodal multiview and multispectral driver distraction analysis: 3MDAD [J].
Jegham, Imen ;
Ben Khalifa, Anouar ;
Alouani, Ihsen ;
Mahjoub, Mohamed Ali .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 88
[25]   MDAD: A Multimodal and Multiview in-Vehicle Driver Action Dataset [J].
Jegham, Imen ;
Ben Khalifa, Anouar ;
Alouani, Ihsen ;
Mahjoub, Mohamed Ali .
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 :518-529
[26]  
Kaggle, 2020, STAT FARM DISTR DRIV
[27]  
Keutzer K., 2016, arXiv
[28]  
King DE, 2009, J MACH LEARN RES, V10, P1755
[29]  
Klauer S.G., 2006, 810594 DOT HS NAT HI
[30]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90