Personality Prediction from Social Media Posts using Text Embedding and Statistical Features

被引:3
作者
Majima, Seiyu [1 ]
Markov, Konstantin [1 ]
机构
[1] Univ Aizu, Aizu Wakamatsu, Fukushima, Japan
来源
PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS) | 2022年
关键词
RECOGNITION;
D O I
10.15439/2022F133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in deep learning based language models have boosted the performance in many downstream tasks such as sentiment analysis, text summarization, question answering, etc. Personality prediction from text is a relatively new task that has attracted researchers' attention due to the increased interest in personalized services as well as the availability of social media data. In this study, we propose a personality prediction system where text embeddings from large language models such as BERT are combined with multiple statistical features extracted from the input text. For the combination, we use the selfattention mechanism which is a popular choice when several information sources need to be merged together. Our experiments with the Kaggle dataset for MBTI clearly show that adding text statistical features improves the system performance relative to using only BERT embeddings. We also analyze the influence of the personality type words on the overall results.
引用
收藏
页码:235 / 240
页数:6
相关论文
共 23 条
[1]   Comparative Analysis of Feature Selection Algorithms for Computational Personality Prediction From Social Media [J].
Al Marouf, Ahmed ;
Hasan, Md. Kamrul ;
Mahmud, Hasan .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03) :587-599
[2]   Machine Learning Approach to Personality Type Prediction Based on the Myers-Briggs Type Indicator® [J].
Amirhosseini, Mohammad Hossein ;
Kazemian, Hassan .
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2020, 4 (01)
[3]  
[Anonymous], 2017, MBTI MYERS BRIGGS PE
[4]   TwitPersonality: Computing Personality Traits from Tweets Using Word Embeddings and Supervised Learning [J].
Carducci, Giulio ;
Rizzo, Giuseppe ;
Monti, Diego ;
Palumbo, Enrico ;
Morisio, Maurizio .
INFORMATION, 2018, 9 (05)
[5]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
[6]  
He Jun, 2021, 2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT), P150, DOI 10.1109/ICESIT53460.2021.9697048
[7]  
John O. P., 2008, Handbook of personality: Theory and research, P114, DOI DOI 10.1016/S0191-8869(97)81000-8
[8]   Using linguistic cues for the automatic recognition of personality in conversation and text [J].
Mairesse, Francois ;
Walker, Marilyn A. ;
Mehl, Matthias R. ;
Moore, Roger K. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2007, 30 :457-500
[9]  
Balmaceda JM, 2014, ONLINE INFORM REV, V38, P136, DOI [10.1108/OIR-06-2012-0104, 10.1108/OIR-06-2012.0104]
[10]   Psychological targeting as an effective approach to digital mass persuasion [J].
Matz, S. C. ;
Kosinski, M. ;
Nave, G. ;
Stillwell, D. J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (48) :12714-12719