Driver Emotion Recognition With a Hybrid Attentional Multimodal Fusion Framework

被引:28
作者
Mou, Luntian [1 ]
Zhao, Yiyuan [1 ]
Zhou, Chao [1 ]
Nakisa, Bahareh [2 ]
Rastgoo, Mohammad Naim [3 ]
Ma, Lei [4 ,5 ]
Huang, Tiejun [4 ,5 ]
Yin, Baocai [1 ]
Jain, Ramesh [6 ]
Gao, Wen [7 ,8 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Deakin Univ, Fac Sci Engn & Built Environm, Sch Informat Technol, Burwood, Vic 3125, Australia
[3] Queensland Univ Technol, Sch Elect Engn & Comp Sci, Brisbane, Qld 4000, Australia
[4] Beijing Acad Artificial Intelligence, Beijing 100875, Peoples R China
[5] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
[6] Univ Calif Irvine, Inst Future Hlth, Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA
[7] Peking Univ, Inst Digital Media, Beijing 100871, Peoples R China
[8] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; Vehicles; Feature extraction; Accidents; Anxiety disorders; Physiology; Data mining; Attention mechanism; convolutional long short term memory; driver emotion recognition; driver stress; multimodal fusion; STRESS; EEG;
D O I
10.1109/TAFFC.2023.3250460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Negative emotions may induce dangerous driving behaviors leading to extremely serious traffic accidents. Therefore, it is necessary to establish a system that can automatically recognize driver emotions so that some actions can be taken to avoid traffic accidents. Existing studies on driver emotion recognition have mainly used facial data and physiological data. However, there are fewer studies on multimodal data with contextual characteristics of driving. In addition, fully fusing multimodal data in the feature fusion layer to improve the performance of emotion recognition is still a challenge. To this end, we propose to recognize driver emotion using a novel multimodal fusion framework based on convolutional long-short term memory network (ConvLSTM), and hybrid attention mechanism to fuse non-invasive multimodal data of eye, vehicle, and environment. In order to verify the effectiveness of the proposed method, extensive experiments have been carried out on a dataset collected using an advanced driving simulator. The experimental results demonstrate the effectiveness of the proposed method. Finally, a preliminary exploration on the correlation between driver emotion and stress is performed.
引用
收藏
页码:2970 / 2981
页数:12
相关论文
共 56 条
[1]  
[Anonymous], 2022, Annual global road crash statistics
[2]   Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions [J].
Azad, Reza ;
Asadi-Aghbolaghi, Maryam ;
Fathy, Mahmood ;
Escalera, Sergio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :406-415
[3]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[4]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[5]  
Berg N., 2020, J. Int. NEur.sychol. Soc., V27, P1
[6]   MEASURING EMOTION - THE SELF-ASSESSMENT MANNEQUIN AND THE SEMANTIC DIFFERENTIAL [J].
BRADLEY, MM ;
LANG, PJ .
JOURNAL OF BEHAVIOR THERAPY AND EXPERIMENTAL PSYCHIATRY, 1994, 25 (01) :49-59
[7]   Effect of Passenger Presence on Older Drivers' Risk of Fatal Crash Involvement [J].
Braitman, Keli A. ;
Chaudhary, Neil K. ;
McCartt, Anne T. .
TRAFFIC INJURY PREVENTION, 2014, 15 (05) :451-456
[8]   Predicting Driver Self-Reported Stress by Analyzing the Road Scene [J].
Bustos, Cristina ;
Elhaouij, Neska ;
Sole-Ribalta, Albert ;
Borge-Holthoefer, Javier ;
Lapedriza, Agata ;
Picard, Rosalind .
2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2021,
[9]   Multiple Transferable Recursive Feature Elimination Technique for Emotion Recognition Based on EEG Signals [J].
Cai, Jiahui ;
Chen, Wei ;
Yin, Zhong .
SYMMETRY-BASEL, 2019, 11 (05)
[10]   Why Drivers Feel the Way they Do:An On-the-Road Study Using Self-Reports and Geo-Tagging [J].
Dittrich, Monique .
AUTOMOTIVEUI '21: 13TH INTERNATIONAL ACM CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2021, :116-125