A Hybrid Multimodal Emotion Recognition Framework for UX Evaluation Using Generalized Mixture Functions

被引:7
作者
Razzaq, Muhammad Asif [1 ,2 ]
Hussain, Jamil [3 ]
Bang, Jaehun [4 ]
Hua, Cam-Hao [2 ]
Satti, Fahad Ahmed [2 ,5 ]
Rehman, Ubaid Ur [2 ,5 ]
Bilal, Hafiz Syed Muhammad [5 ]
Kim, Seong Tae [2 ]
Lee, Sungyoung [2 ]
机构
[1] Fatima Jinnah Women Univ, Dept Comp Sci, Rawalpindi 46000, Pakistan
[2] Kyung Hee Univ, Dept Comp Sci & Engn, Ubiquitous Comp Lab, Yongin 17104, South Korea
[3] Sejong Univ, Dept Data Sci, Seoul 30019, South Korea
[4] Hanwha Corp Momentum, Hanwha Bldg, 86 Cheonggyecheon Ro, Seoul 04541, South Korea
[5] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci SEECS, Dept Comp, Islamabad 44000, Pakistan
关键词
emotion recognition; user experience; audio-based emotion recognition; feature fusioning; decision fusioning; generalized mixture function; FUSION; NETWORK; MODEL;
D O I
10.3390/s23094373
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multimodal emotion recognition has gained much traction in the field of affective computing, human-computer interaction (HCI), artificial intelligence (AI), and user experience (UX). There is growing demand to automate analysis of user emotion towards HCI, AI, and UX evaluation applications for providing affective services. Emotions are increasingly being used, obtained through the videos, audio, text or physiological signals. This has led to process emotions from multiple modalities, usually combined through ensemble-based systems with static weights. Due to numerous limitations like missing modality data, inter-class variations, and intra-class similarities, an effective weighting scheme is thus required to improve the aforementioned discrimination between modalities. This article takes into account the importance of difference between multiple modalities and assigns dynamic weights to them by adapting a more efficient combination process with the application of generalized mixture (GM) functions. Therefore, we present a hybrid multimodal emotion recognition (H-MMER) framework using multi-view learning approach for unimodal emotion recognition and introducing multimodal feature fusion level, and decision level fusion using GM functions. In an experimental study, we evaluated the ability of our proposed framework to model a set of four different emotional states (Happiness, Neutral, Sadness, and Anger) and found that most of them can be modeled well with significantly high accuracy using GM functions. The experiment shows that the proposed framework can model emotional states with an average accuracy of 98.19% and indicates significant gain in terms of performance in contrast to traditional approaches. The overall evaluation results indicate that we can identify emotional states with high accuracy and increase the robustness of an emotion classification system required for UX measurement.
引用
收藏
页数:25
相关论文
共 54 条
[11]   Robust Human Activity Recognition Using Multimodel Feature-Level Fusion [J].
Ehatisham-Ul-Haq, Muhammad ;
Javed, Ali ;
Azam, Muhammad Awais ;
Malik, Hafiz M. A. ;
Irtaza, Aun ;
Lee, Ik Hyun ;
Mahmood, Muhammad Tariq .
IEEE ACCESS, 2019, 7 :60736-60751
[12]   Perception of Emotions and Body Movement in the Emilya Database [J].
Fourati, Nesrine ;
Pelachaud, Catherine .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :90-101
[13]   Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention [J].
Gan, Chenquan ;
Xiao, Junhao ;
Wang, Zhangyi ;
Zhang, Zufan ;
Zhu, Qingyi .
IMAGE AND VISION COMPUTING, 2022, 117
[14]   Multimodal sentiment analysis: A systematic review of history, datasets, multimodal fusion methods, applications, challenges and future directions [J].
Gandhi, Ankita ;
Adhvaryu, Kinjal ;
Poria, Soujanya ;
Cambria, Erik ;
Hussain, Amir .
INFORMATION FUSION, 2023, 91 :424-444
[15]  
Ghoniem RM., 2019, INFORMATION, V10, P239, DOI [10.3390/info10070239, DOI 10.3390/info10070239]
[16]  
github, LEAN UX MIX METH APP
[17]   Multi-sensor fusion in body sensor networks: State-of-the-art and research challenges [J].
Gravina, Raffaele ;
Alinia, Parastoo ;
Ghasemzadeh, Hassan ;
Fortino, Giancarlo .
INFORMATION FUSION, 2017, 35 :68-80
[18]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[19]  
Hua C.H., 2020, P 2020 14 INT C UBIQ, P1
[20]  
Huang J., 2017, P 7 ANN WORKSH AUD V, P11