Improving real-time driver distraction detection via constrained attention mechanism

被引:7
作者
Gao, Hang [1 ,2 ]
Liu, Yi [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, Inst Publ Safety Res, Dept Engn Phys, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Safety Sci, Beijing 100084, Peoples R China
[3] Beijing Key Lab City Integrated Emergency Response, Beijing 100084, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Driver distraction detection; Class activation maps; Attention; Constrained attention mechanism; RECOGNITION; NETWORK;
D O I
10.1016/j.engappai.2023.107408
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time driving distraction detection has garnered significant attention due to its potential to build various driving safety protections such as distraction warnings and driver assistance systems. Recent studies have focused on the development of neural networks for vision-based detection, though achieving a balance between performance and efficiency has proven challenging. In this paper, we propose a novel constrained attention (CA) mechanism for real-time driver distraction detection, which aims to achieve better performance meanwhile ensuring the computation efficiency. Specifically, we conduct some case studies by generating class activation maps to check the model attention, and three potential factors affecting performance are mined, which are ambiguous attention signal, excessive attention region, and similar attention between different classes. Two regularization terms are designed to optimize the three obstacles. Firstly, a concentrative regularization is introduced to limit the size of the attention region, meanwhile, pixels in the region have clear attention degrees. Secondly, an orthogonal regularization is proposed to optimize the attention of different classes to be discriminative. To further inspiring the model, we design an intersample constraint, which optimizes the attention of images with the same ground truth to be similar. Experiments are conducted on two driver distraction detection datasets, and the experimental results showed that our CA mechanism can bring significant performance improvement. More importantly, there will be no additional computational burden when the trained model is deployed in actual scenarios. Codes are released at https://github.com/ gaohangcodes/CAN4DDD.
引用
收藏
页数:12
相关论文
共 53 条
[1]  
Abbas Tahir, 2020, 2020 IEEE 23 INT MUL, P1
[2]  
Abouelnaga Y, 2018, NEURAL INFORM PROCES
[3]   Towards Computationally Efficient and Realtime Distracted Driver Detection With MobileVGG Network [J].
Baheti, Bhakti ;
Talbar, Sanjay ;
Gajre, Suhas .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (04) :565-574
[4]   Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers' Activity Monitoring [J].
Behera, Ardhendu ;
Wharton, Zachary ;
Keidel, Alexander ;
Debnath, Bappaditya .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (03) :2874-2881
[5]  
Behera A, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P343, DOI 10.1109/AVSS.2018.8639158
[6]   Attend and Guide (AG-Net): A Keypoints-Driven Attention-Based Deep Network for Image Recognition [J].
Bera, Asish ;
Wharton, Zachary ;
Liu, Yonghuai ;
Bessis, Nik ;
Behera, Ardhendu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3691-3704
[7]   POSEidon: Face-from-Depth for Driver Pose Estimation [J].
Borghi, Guido ;
Venturelli, Marco ;
Vezzani, Roberto ;
Cucchiara, Rita .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5494-5503
[8]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[9]   Driver Monitoring Using Sparse Representation With Part-Based Temporal Face Descriptors [J].
Chiou, Chien-Yu ;
Wang, Wei-Cheng ;
Lu, Shueh-Chou ;
Huang, Chun-Rong ;
Chung, Pau-Choo ;
Lai, Yun-Yang .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (01) :346-361
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848