An Integrated Framework for Multi-State Driver Monitoring Using Heterogeneous Loss and Attention-Based Feature Decoupling

被引:2
作者
Hu, Zhongxu [1 ]
Zhang, Yiran [1 ]
Xing, Yang [2 ]
Li, Qinghua [3 ]
Lv, Chen [1 ]
机构
[1] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore
[2] Cranfield Univ, Ctr Autonomous & Cyber Phys Syst, Cranfield MK43 0AL, Beds, England
[3] Alibaba DAMO Acad Autonomous Driving Lab, Hangzhou 311121, Peoples R China
关键词
driver state; feature decoupling; cascade cross-entropy; gaze consistency; HEAD POSE ESTIMATION; GAZE ESTIMATION; VEHICLES; NETWORK;
D O I
10.3390/s22197415
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multi-state driver monitoring is a key technique in building human-centric intelligent driving systems. This paper presents an integrated visual-based multi-state driver monitoring framework that incorporates head rotation, gaze, blinking, and yawning. To solve the challenge of head pose and gaze estimation, this paper proposes a unified network architecture that tackles these estimations as soft classification tasks. A feature decoupling module was developed to decouple the extracted features from different axis domains. Furthermore, a cascade cross-entropy was designed to restrict large deviations during the training phase, which was combined with the other features to form a heterogeneous loss function. In addition, gaze consistency was used to optimize its estimation, which also informed the model architecture design of the gaze estimation task. Finally, the proposed method was verified on several widely used benchmark datasets. Comprehensive experiments were conducted to evaluate the proposed method and the experimental results showed that the proposed method could achieve a state-of-the-art performance compared to other methods.
引用
收藏
页数:14
相关论文
共 53 条
[1]   img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation [J].
Albiero, Vitor ;
Chen, Xingyu ;
Yin, Xi ;
Pang, Guan ;
Hassner, Tal .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7613-7623
[2]   Constraining Design: Applying the Insights of Cognitive Work Analysis to the Design of Novel In-Car Interfaces to Support Eco-Driving [J].
Allison, Craig K. ;
Stanton, Neville A. .
AUTOMOTIVE INNOVATION, 2020, 3 (01) :30-41
[3]  
Bao Y., 2021, ARXIV, DOI [10.1609/aaai.v36i1.19921, DOI 10.1609/AAAI.V36I1.19921]
[4]   Future Directions of Intelligent Vehicles: Potentials, Possibilities, and Perspectives [J].
Cao, Dongpu ;
Wang, Xiao ;
Li, Lingxi ;
Lv, Chen ;
Na, Xiaoxiang ;
Xing, Yang ;
Li, Xuan ;
Li, Ying ;
Chen, Yuanyuan ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (01) :7-10
[5]   A Vector-based Representation to Enhance Head Pose Estimation [J].
Cao, Zhiwen ;
Chu, Zongcheng ;
Liu, Dongfang ;
Chen, Yingjie .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1187-1196
[6]   TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications [J].
Chen, Zitao ;
Narayanan, Niranjhana ;
Fang, Bo ;
Li, Guanpeng ;
Pattabiraman, Karthik ;
DeBardeleben, Nathan .
2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, :426-435
[7]   Gaze Estimation by Exploring Two-Eye Asymmetry [J].
Cheng, Yihua ;
Zhang, Xucong ;
Lu, Feng ;
Sato, Yoichi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :5259-5272
[8]   Automated Vehicle Handover Interface Design: Focus Groups with Learner, Intermediate and Advanced Drivers [J].
Clark, Jediah R. ;
Stanton, Neville A. ;
Revell, Kirsten M. A. .
AUTOMOTIVE INNOVATION, 2020, 3 (01) :14-29
[9]   MS-SSPCANet: A powerful deep learning framework for tool wear prediction [J].
Duan, Jian ;
Hu, Cheng ;
Zhan, Xiaobin ;
Zhou, Hongdi ;
Liao, Guanglan ;
Shi, Tielin .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
[10]   Unsupervised Learning of Eye Gaze Representation from the Web [J].
Dubey, Neeru ;
Ghosh, Shreya ;
Dhall, Abhinav .
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,