Evaluation of Different Word Embeddings to Create Personality Models in Spanish

被引:1
作者
Orlando Lopez-Pabon, Felipe [1 ]
Rafael Orozco-Arroyave, Juan [1 ,2 ]
机构
[1] Univ Antioquia UdeA, Fac Engn, Medellin, Colombia
[2] Friedrich Alexander Univ Erlangen Nurnberg, Pattern Recognit Lab, Erlangen, Germany
来源
APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021 | 2021年 / 1431卷
关键词
Personality traits; OCEAN model; Word embeddings; Regression; Classification;
D O I
10.1007/978-3-030-86702-7_11
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Research in psychology has shown that personality directly influences the way people think, feel and communicate. It also has consequences on behavior and indirectly affects work effectiveness and job performance. Automatic personality assessment has gained attention in the last decade, and one of the most common models in psychology for automatic personality analysis is the Big Five model, also called as OCEAN model. Different works that study personality traits are based on English texts; conversely, very few studies focus on creating Spanish models. This paper proposes a methodology for the automatic modeling of personality in Spanish texts. Transliterations of videos from YouTube are translated to Spanish to create and evaluate the models. Classical word embeddings are considered, including Wor2Vec, GloVe, BERT, and BETO. Classification and regression experiments are performed to predict the labels of the five traits in the OCEAN model. The results show that 3 out of the five traits can be predicted with high reliability. Additionally, embeddings created with transformer-based models (i.e., BERT and BETO) yield the highest accuracies.
引用
收藏
页码:121 / 132
页数:12
相关论文
共 24 条
[1]  
Alammar Jay., 2019, The Illustrated Word2vec
[2]  
[Anonymous], 2013, COMPUTING RES REPOSI
[3]   Hi YouTube! Personality Impressions and Verbal Content in Social Video [J].
Biel, Joan-Isaac ;
Tsiminaki, Vagia ;
Dines, John ;
Gatica-Perez, Daniel .
ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, :119-126
[4]  
Celli F., 2012, P ICDS VAL
[5]   Personality Recognition from Facebook Text [J].
Claudino da Silva, Barbara Barbosa ;
Paraboni, Ivandre .
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 :107-114
[6]   Influence of Personality on Satisfaction with Mobile Phone Services [J].
de Oliveira, Rodrigo ;
Cherubini, Mauro ;
Oliver, Nuria .
ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2013, 20 (02)
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]   Personality Recognition Using Convolutional Neural Networks [J].
Gimenez, Maite ;
Paredes, Roberto ;
Rosso, Paolo .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 :313-323
[9]   AN ALTERNATIVE DESCRIPTION OF PERSONALITY - THE BIG-5 FACTOR STRUCTURE [J].
GOLDBERG, LR .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1990, 59 (06) :1216-1229
[10]  
John O. P., 2010, Handbook of Personality: Theory and Research (Chapter 4)