Demographics and Personality Discovery on Social Media: A Machine Learning Approach

被引:3
作者
Tuomchomtam, Sarach [1 ]
Soonthornphisaj, Nuanwan [1 ]
机构
[1] Kasetsart Univ, Dept Comp Sci, Artificial Intelligence & Knowledge Discovery Lab, Fac Sci, Bangkok 10900, Thailand
关键词
demographic attributes; personality prediction; social media; machine learning; BRIGGS TYPE INDICATOR; MODEL;
D O I
10.3390/info12090353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research proposes a new feature extraction algorithm using aggregated user engagements on social media in order to achieve demographics and personality discovery tasks. Our proposed framework can discover seven essential attributes, including gender identity, age group, residential area, education level, political affiliation, religious belief, and personality type. Multiple feature sets are developed, including comment text, community activity, and hybrid features. Various machine learning algorithms are explored, such as support vector machines, random forest, multi-layer perceptron, and naive Bayes. An empirical analysis is performed on various aspects, including correctness, robustness, training time, and the class imbalance problem. We obtained the highest prediction performance by using our proposed feature extraction algorithm. The result on personality type prediction was 87.18%. For the demographic attribute prediction task, our feature sets also outperformed the baseline at 98.1% for residential area, 94.7% for education level, 92.1% for gender identity, 91.5% for political affiliation, 60.6% for religious belief, and 52.0% for the age group. Moreover, this paper provides the guideline for the choice of classifiers with appropriate feature sets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Detecting suicidality on social media: Machine learning at rescue
    Rabani, Syed Tanzeel
    Khanday, Akib Mohi Ud Din
    Khan, Qamar Rayees
    Hajam, Umar Ayoub
    Imran, Ali Shariq
    Kastrati, Zenun
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 291 - 302
  • [22] Employer ratings in social media and firm performance: Evidence from an explainable machine learning approach
    Ylinen, Mika
    Ranta, Mikko
    ACCOUNTING AND FINANCE, 2024, 64 (01) : 247 - 276
  • [23] Sentiment analysis of Arabic social media texts: A machine learning approach to deciphering customer perceptions
    Alsemaree, Ohud
    Alam, Atm S.
    Gill, Sukhpal Singh
    Uhlig, Steve
    HELIYON, 2024, 10 (09)
  • [24] Machine Learning for Social Science: An Agnostic Approach
    Grimmer, Justin
    Roberts, Margaret E.
    Stewart, Brandon M.
    ANNUAL REVIEW OF POLITICAL SCIENCE, VOL 24, 2021, 2021, 24 : 395 - 419
  • [25] Multi-Class Sentiment Analysis of Social Media Data with Machine Learning Algorithms
    Mutanov, Galimkair
    Karyukin, Vladislav
    Mamykova, Zhanl
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (01): : 913 - 930
  • [26] Personality Prediction from Social Media Images: A Content Driven Approach
    Sahu, Yuktee
    Ramani, Yash
    Parekh, Viral
    Maru, Nishit
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 517 - 521
  • [27] Melting of Privacy with Machine Learning, Big Data, and Social Media
    Canbay, Pelin
    Demircioglu, Zubeyde
    ACTA INFOLOGICA, 2023, 7 (01):
  • [28] A Machine Learning Technique for Detection of Social Media Fake News
    Arowolo, Micheal Olaolu
    Misra, Sanjay
    Ogundokun, Roseline Oluwaseun
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2023, 19 (01)
  • [29] Rumor Detection Using Machine Learning Techniques on Social Media
    Kumar, Akshi
    Sangwan, Saurabh Raj
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 213 - 221
  • [30] Machine Learning for Mental Health in Social Media: Bibliometric Study
    Kim, Jina
    Lee, Daeun
    Park, Eunil
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)