Recognizing Emotion in the Wild using Multimodal Data

被引:6
作者
Srivastava, Shivam [1 ]
Lakshminarayan, Saandeep Aathreya Sidhapur [1 ]
Hinduja, Saurabh [1 ]
Jannat, Sk Rahatul [1 ]
Elhamdadi, Hamza [1 ]
Canavan, Shaun [1 ]
机构
[1] Univ S Florida, Tampa, FL 33620 USA
来源
PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2020 | 2020年
关键词
Gaze Detection; Group Emotion; Engagement Prediction; Physiological; Signals; AUGMENTATION; PERFORMANCE; GAN;
D O I
10.1145/3382507.3417970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present our approach for all four tracks of the eighth Emotion Recognition in the Wild Challenge (EmotiW 2020). The four tasks are group emotion recognition, driver gaze prediction, predicting engagement in the wild, and emotion recognition using physiological signals. We explore multiple approaches including classical machine learning tools such as random forests, state of the art deep neural networks, and multiple fusion and ensemblebased approaches. We also show that similar approaches can be used across tracks as many of the features generalize well to the different problems (e.g. facial features). We detail evaluation results that are either comparable to or outperform the baseline results for both the validation and testing for most of the tracks.
引用
收藏
页码:849 / 857
页数:9
相关论文
共 43 条
[1]  
Al-Alwani A, 2016, INT J ADV COMPUT SC, V7, P444
[2]  
Anderson P.A., 1998, Handbook of Communication and Emotion
[3]  
Baltrusaitis T, 2016, IEEE WINT CONF APPL
[4]   OpenFace 2.0: Facial Behavior Analysis Toolkit [J].
Baltrusaitis, Tadas ;
Zadeh, Amir ;
Lim, Yao Chong ;
Morency, Louis-Philippe .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   How Do Drivers Allocate Their Potential Attention? Driving Fixation Prediction via Convolutional Neural Networks [J].
Deng, Tao ;
Yan, Hongmei ;
Qin, Long ;
Thuyen Ngo ;
Manjunath, B. S. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (05) :2146-2154
[10]  
Dhall Abhinav., 2011, Acted Facial Expressions