Multimodal First Impression Analysis with Deep Residual Networks

被引:43
作者
Gucluturk, Yagmur [1 ]
Guclu, Umut [1 ]
Baro, Xavier [2 ,3 ]
Escalante, Hugo Jair [4 ]
Guyon, Isabelle [5 ,6 ]
Escalera, Sergio [7 ,8 ]
van Gerven, Marcel A. J. [1 ]
van Lier, Rob [1 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 HR Nijmegen, Netherlands
[2] Open Univ Catalonia, Barcelona 08018, Spain
[3] Comp Vis Ctr, Barcelona 08018, Spain
[4] Inst Nacl Astrofis Opt & Eectron, Puebla 72840, Mexico
[5] Univ Paris Saclay, INRIA, UPSud, F-91190 St Aubin, France
[6] ChaLearn, Berkeley, CA 94708 USA
[7] Univ Barcelona, Barcelona 08007, Spain
[8] Comp Vis Ctr, Barcelona 08007, Spain
关键词
Big Five personality traits; deep learning; explainability; first impression; multimodal; PERSONALITY;
D O I
10.1109/TAFFC.2017.2751469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People form first impressions about the personalities of unfamiliar individuals even after very brief interactions with them. In this study we present and evaluate several models that mimic this automatic social behavior. Specifically, we present several models trained on a large dataset of short YouTube video blog posts for predicting apparent Big Five personality traits of people and whether they seem suitable to be recommended to a job interview. Along with presenting our audiovisual approach and results that won the third place in the ChaLearn First Impressions Challenge, we investigate modeling in different modalities including audio only, visual only, language only, audiovisual, and combination of audiovisual and language. Our results demonstrate that the best performance could be obtained using a fusion of all data modalities. Finally, in order to promote explainability in machine learning and to provide an example for the upcoming ChaLearn challenges, we present a simple approach for explaining the predictions for job interview recommendations.
引用
收藏
页码:316 / 329
页数:14
相关论文
共 61 条
[1]  
Anderson Keith, 2013, Advances in Computer Entertainment. 10th International Conference, ACE 2013. Proceedings: LNCS 8253, P476, DOI 10.1007/978-3-319-03161-3_35
[2]  
[Anonymous], P 14 ACM INT C MULT
[3]  
[Anonymous], P INT C AFF COMP INT
[4]  
[Anonymous], 2015, ARXIV150606724
[5]  
[Anonymous], 2016, ARXIV160308155
[6]  
[Anonymous], 2016, ADV NEURAL INF PROCE
[7]  
[Anonymous], 2011, P INT AAAI C WEB SOC
[8]  
[Anonymous], P BRIT MACH VIS C JU
[9]  
[Anonymous], DEEP IMPRESSION AUDI
[10]  
[Anonymous], 2015, P INT C MACHINE LEAR