Early Risk Prediction of Depression Based on Social Media Posts in Arabic

被引:1
作者
Sabaneh, Kefaya [1 ]
Abu Salameh, Momen [2 ]
Khaleel, Fatima [2 ]
Herzallah, Mohammad M. [3 ]
Natsheh, Joman Y. [4 ]
Maree, Mohammed [1 ]
机构
[1] Arab Amer Univ Palestine, Fac Informat Technol, Jenin, Palestine
[2] Arab Amer Univ Palestine, Fac Grad Studies, Jenin, Palestine
[3] Al Quds Univ, Palestinian Neurosci Initiat, Jerusalem, Palestine
[4] Childrens Specialized Hosp Res Ctr, Newark, NJ USA
来源
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI | 2023年
关键词
Social Media; Depression; Prediction; UMLS; QuickUMLS; Machine Learning; Feature Extraction; TF-IDF; TEXT;
D O I
10.1109/ICTAI59109.2023.00094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is a prevalent global health issue, impacting various aspects of individuals' lives, including home and social interactions. In the Arabic environment, the stigma surrounding mental disorders and the limited awareness in the psychiatry domain has made the early diagnosis of depression a challenging task. However, social media platforms have enabled individuals to express their thoughts and personal experiences, making these platforms a valuable resource for mental health monitoring. In this paper, we propose an approach to predict the early signs of depression utilizing posts expressed in Arabic on the Twitter platform. The proposed methodology integrates knowledge extracted using an LLM-based transformer, the UMLS medical knowledge resource, and machine learning prediction algorithms. To the best of our knowledge, this is the first research study that maps LLM-based translated texts to external medical knowledge resources to improve the accuracy of the prediction model. The proposed model consists of four phases. Firstly, NLP-based data preprocessing pipeline is employed to ensure the input dataset is in a suitable format for analysis. Secondly, the ChatGPT transformer is utilized to translate Arabic tweets into English, enabling further processing and analysis in English. Thirdly, relevant medical concepts are extracted from the translated text using the quickUMLS tool and UMLS metathesaurus, aiding in identifying important terms related to mental health. Fourthly, TF-IDF and Bag of Words (BOW) algorithms are used to assign weights to the extracted features, highlighting the significance of concepts. Finally, classification algorithms, including Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB), and Stochastic Gradient Descent (SGD), are trained using the extracted concepts. Among these classifiers, Random Forest with Bag of Words demonstrated the best performance, achieving an accuracy of 80.24%.
引用
收藏
页码:595 / 602
页数:8
相关论文
共 34 条
  • [1] Predicting Depression Symptoms in an Arabic Psychological Forum
    Alghamdi, Norah Saleh
    Mahmoud, Hanan A. Hosni
    Abraham, Ajith
    Alanazi, Samar Awadh
    Garcia-Hernandez, Laura
    [J]. IEEE ACCESS, 2020, 8 (08): : 57317 - 57334
  • [2] Attention-Based Bi-LSTM Model for Arabic Depression Classification
    Almars, Abdulqader M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02): : 3091 - 3106
  • [3] Almouzini S., 2019, PROCEDIA COMPUTER SC, V163, P257, DOI DOI 10.1016/J.PROCS.2019.12.107
  • [4] Alsudias L, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P4842
  • [5] UMLS users and uses: a current overview
    Amos, Liz
    Anderson, David
    Brody, Stacy
    Ripple, Anna
    Humphreys, Betsy L.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (10) : 1606 - 1611
  • [6] Aronson AR, 2001, J AM MED INFORM ASSN, P17
  • [7] Depression and quality of life in older adults: Mediation effect of steep quality
    Becker, Nathalia Brandolim
    de Jesus, Saul Neves
    Viseu, Joao N.
    Stobaus, Claus Dieter
    Guerreiro, Mariana
    Domingues, Rita B.
    [J]. INTERNATIONAL JOURNAL OF CLINICAL AND HEALTH PSYCHOLOGY, 2018, 18 (01) : 8 - 17
  • [8] Cross-lingual Unified Medical Language System entity linking in online health communities
    Bitton, Yonatan
    Cohen, Raphael
    Schifter, Tamar
    Bachmat, Eitan
    Elhadad, Michael
    Elhadad, Noemie
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (10) : 1585 - 1592
  • [9] Accuracy of Google Translate in translating the directions and counseling points for top-selling drugs from English to Arabic, Chinese, and Spanish
    Cornelison, Bernadette R.
    Al-Mohaish, Sulaiman
    Sun, Yizhou
    Edwards, Christopher J.
    [J]. AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2021, 78 (22) : 2053 - 2058
  • [10] El-Ramly M., 2021, 2021 10 INT C INT CO, DOI [10.1109/icicis52592.2021.9694178, DOI 10.1109/ICICIS52592.2021.9694178]