Demographics and Personality Discovery on Social Media: A Machine Learning Approach

被引:3
作者
Tuomchomtam, Sarach [1 ]
Soonthornphisaj, Nuanwan [1 ]
机构
[1] Kasetsart Univ, Dept Comp Sci, Artificial Intelligence & Knowledge Discovery Lab, Fac Sci, Bangkok 10900, Thailand
关键词
demographic attributes; personality prediction; social media; machine learning; BRIGGS TYPE INDICATOR; MODEL;
D O I
10.3390/info12090353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research proposes a new feature extraction algorithm using aggregated user engagements on social media in order to achieve demographics and personality discovery tasks. Our proposed framework can discover seven essential attributes, including gender identity, age group, residential area, education level, political affiliation, religious belief, and personality type. Multiple feature sets are developed, including comment text, community activity, and hybrid features. Various machine learning algorithms are explored, such as support vector machines, random forest, multi-layer perceptron, and naive Bayes. An empirical analysis is performed on various aspects, including correctness, robustness, training time, and the class imbalance problem. We obtained the highest prediction performance by using our proposed feature extraction algorithm. The result on personality type prediction was 87.18%. For the demographic attribute prediction task, our feature sets also outperformed the baseline at 98.1% for residential area, 94.7% for education level, 92.1% for gender identity, 91.5% for political affiliation, 60.6% for religious belief, and 52.0% for the age group. Moreover, this paper provides the guideline for the choice of classifiers with appropriate feature sets.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Machine Learning for Mental Health in Social Media: Bibliometric Study
    Kim, Jina
    Lee, Daeun
    Park, Eunil
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [32] Predicting Stock Market Movements with Social Media and Machine Learning
    Koukaras, Paraskevas
    Tsichli, Vasiliki
    Tjortjis, Christos
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2021, : 436 - 443
  • [33] Detecting Virtual Harassment in Social Media Using Machine Learning
    Benassou, Lina Feriel
    Bendaouia, Safa
    Salem, Osman
    Mehaoua, Ahmed
    MACHINE LEARNING FOR NETWORKING, MLN 2023, 2024, 14525 : 185 - 198
  • [34] Personality Traits Can Predict Architectural Preferences: A Machine Learning Approach
    Tafti, Mohsen Dehghani
    Ahmadzad-Asl, Masoud
    Memarian, Gholamhossein
    Tafti, Mehrnaz Fallah
    Rajimehr, Reza
    Soltani, Sarvenaz
    Mirfazeli, Fatemeh Sadat
    Vahabie, Abdol-Hossein
    Moein, Shima T.
    Mozaffar, Farhang
    PSYCHOLOGY OF AESTHETICS CREATIVITY AND THE ARTS, 2024, 18 (05) : 750 - 761
  • [35] Personality Classification from Online Text using Machine Learning Approach
    Khan, Alam Sher
    Ahmad, Hussain
    Asghar, Muhammad Zubair
    Saddozai, Furcian Khan
    Arir, Areeba
    Khalid, Hassan Ali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 460 - 476
  • [36] ENHANCING SOCIAL MEDIA ANALYSIS WITH VISUAL DATA ANALYTICS: A DEEP LEARNING APPROACH
    Shin, Donghyuk
    He, Shu
    Lee, Gene Moo
    Whinston, Andrew B.
    Cetintas, Suleyman
    Lee, Kuang-Chih
    MIS QUARTERLY, 2020, 44 (04) : 1459 - 1492
  • [37] How social media expression can reveal personality
    Han, Nuo
    Li, Sijia
    Huang, Feng
    Wen, Yeye
    Su, Yue
    Li, Linyan
    Liu, Xiaoqian
    Zhu, Tingshao
    FRONTIERS IN PSYCHIATRY, 2023, 14
  • [38] A Brand-New Look at You: Predicting Brand Personality in Social Media Networks with Machine Learning
    Pamuksuz, Utku
    Yun, Joseph T.
    Humphreys, Ashlee
    JOURNAL OF INTERACTIVE MARKETING, 2021, 56 : 55 - 69
  • [39] Classifying Social Media Users with Machine Learning
    Li G.
    Zhou H.
    Mao J.
    Chen S.
    Data Analysis and Knowledge Discovery, 2019, 3 (08) : 1 - 9
  • [40] A textual-based featuring approach for depression detection using machine learning classifiers and social media texts
    Chiong, Raymond
    Budhi, Gregorius Satia
    Dhakal, Sandeep
    Chiong, Fabian
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135