Novel Transformer Based Contextualized Embedding and Probabilistic Features for Depression Detection From Social Media

被引:5
作者
Abbas, Muhammad Asad [1 ]
Munir, Kashif [1 ]
Raza, Ali [2 ]
Samee, Nagwan Abdel [3 ]
Jamjoom, Mona M. [4 ]
Ullah, Zahid [5 ]
机构
[1] Khwaja Fareed Univ Engn & Informat Technol, Inst Informat Technol, Rahim Yar Khan 64200, Pakistan
[2] Univ Lahore, Dept Software Engn, Lahore 54000, Pakistan
[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia
[4] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11671, Saudi Arabia
[5] King Abdulaziz Univ, Dept Informat Syst, Jeddah 21589, Saudi Arabia
关键词
Depression detection; machine learning; deep learning; text mining; BERT; transformer;
D O I
10.1109/ACCESS.2024.3387695
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depression constitutes a significant mental health condition, impacting an individual's emotional state, thought processes, and ability to carry out everyday tasks. Depression is defined by ongoing feelings of sadness, diminished interest in previously enjoyed activities, alterations in hunger, sleep disturbances, decreased vitality, and challenges with focus. The impact of depression extends beyond the individual, affecting society at large through decreased productivity and higher healthcare costs. In the realm of social media, users often express their thoughts and emotions through posts, which can provide insightful data for identifying patterns of depression. This research aims to detect depression early by analyzing social media user content with machine learning techniques. We have built advanced machine learning models using a benchmark depression database containing 20,000 tagged tweets from user profiles identified as depressed or non-depressed. We are introducing an innovative BERT-RF feature engineering method that extracts Contextualized Embeddings and Probabilistic Features from textual input. The Bidirectional Encoder Representations from Transformers (BERT) model, based on the Transformer architecture, is used to extract Contextualized Embedding features. These features are then fed into a random forest model to generate class probabilistic features. These prominent features aid in enhancing the identification of depression from social media. In order to classify tweets using the features derived from the BERT-RF features selection step, we have used five popular classifiers: Random Forest (RF), Multilayer Perceptron (MLP), K-Neighbors Classifier (KNC), Logistic Regression (LR), and Long Short-Term Memory (LSTM). Evaluation experiments show that our approach, using BERT-RF for feature engineering, enables the Logistic Regression model to outperform state-of-the-art methods with a high accuracy score of 99%. We have validated the results through k-fold cross-validation and statistical T-tests. We achieved 99% k-fold accuracy during the validation of the proposed approach. This research contributes significantly to computational linguistics and mental health analytics by providing a robust approach to the early detection of user depression from social media content.
引用
收藏
页码:54087 / 54100
页数:14
相关论文
共 32 条
[1]   Fair and Explainable Depression Detection in Social Media [J].
Adarsh, V ;
Kumar, P. Arun ;
Lavanya, V ;
Gangadharan, G. R. .
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
[2]  
Almouzini S, 2019, Procedia Computer Science, V163, P257, DOI [10.1016/j.procs.2019.12.107, 10.1016/j.procs.2019.12.107]
[3]  
Alsaeedi A, 2019, INT J ADV COMPUT SC, V10, P361
[4]  
Azam Fiza, 2021, Proceedings of 2021 International Conference on Artificial Intelligence (ICAI), P44, DOI 10.1109/ICAI52203.2021.9445271
[5]  
Baale A. A., Tech. Rep.
[6]   A profile-based sentiment-aware approach for depression detection in social media [J].
de Jesus Titla-Tlatelpa, Jose ;
Maria Ortega-Mendoza, Rosa ;
Montes-y-Gomez, Manuel ;
Villasenor-Pineda, Luis .
EPJ DATA SCIENCE, 2021, 10 (01)
[7]  
Deepali J., 2018, Tech. Rep.
[8]  
Figueredo A. L. L. M., 2022, Online Social Netw. Media, V31
[9]  
Gui T, 2019, AAAI CONF ARTIF INTE, P110
[10]   Detecting depression and mental illness on social media: an integrative review [J].
Guntuku, Sharath Chandra ;
Yaden, David B. ;
Kern, Margaret L. ;
Ungar, Lyle H. ;
Eichstaedt, Johannes C. .
CURRENT OPINION IN BEHAVIORAL SCIENCES, 2017, 18 :43-49