Early Risk Prediction of Depression Based on Social Media Posts in Arabic

被引:1
作者
Sabaneh, Kefaya [1 ]
Abu Salameh, Momen [2 ]
Khaleel, Fatima [2 ]
Herzallah, Mohammad M. [3 ]
Natsheh, Joman Y. [4 ]
Maree, Mohammed [1 ]
机构
[1] Arab Amer Univ Palestine, Fac Informat Technol, Jenin, Palestine
[2] Arab Amer Univ Palestine, Fac Grad Studies, Jenin, Palestine
[3] Al Quds Univ, Palestinian Neurosci Initiat, Jerusalem, Palestine
[4] Childrens Specialized Hosp Res Ctr, Newark, NJ USA
来源
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI | 2023年
关键词
Social Media; Depression; Prediction; UMLS; QuickUMLS; Machine Learning; Feature Extraction; TF-IDF; TEXT;
D O I
10.1109/ICTAI59109.2023.00094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is a prevalent global health issue, impacting various aspects of individuals' lives, including home and social interactions. In the Arabic environment, the stigma surrounding mental disorders and the limited awareness in the psychiatry domain has made the early diagnosis of depression a challenging task. However, social media platforms have enabled individuals to express their thoughts and personal experiences, making these platforms a valuable resource for mental health monitoring. In this paper, we propose an approach to predict the early signs of depression utilizing posts expressed in Arabic on the Twitter platform. The proposed methodology integrates knowledge extracted using an LLM-based transformer, the UMLS medical knowledge resource, and machine learning prediction algorithms. To the best of our knowledge, this is the first research study that maps LLM-based translated texts to external medical knowledge resources to improve the accuracy of the prediction model. The proposed model consists of four phases. Firstly, NLP-based data preprocessing pipeline is employed to ensure the input dataset is in a suitable format for analysis. Secondly, the ChatGPT transformer is utilized to translate Arabic tweets into English, enabling further processing and analysis in English. Thirdly, relevant medical concepts are extracted from the translated text using the quickUMLS tool and UMLS metathesaurus, aiding in identifying important terms related to mental health. Fourthly, TF-IDF and Bag of Words (BOW) algorithms are used to assign weights to the extracted features, highlighting the significance of concepts. Finally, classification algorithms, including Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB), and Stochastic Gradient Descent (SGD), are trained using the extracted concepts. Among these classifiers, Random Forest with Bag of Words demonstrated the best performance, achieving an accuracy of 80.24%.
引用
收藏
页码:595 / 602
页数:8
相关论文
共 34 条
[21]   Head Concepts Selection for Verbose Medical Queries Expansion [J].
Maree, Mohammed ;
Noor, Israa ;
Rabayah, Khaled S. ;
Belkhatir, Mohammed ;
Alhashmi, Saadat M. .
IEEE ACCESS, 2020, 8 :93987-93999
[22]  
McCann T., 2023, Understanding chatgpt as explained by Chatgpt, Advancing Analytics
[23]   Twitter Arabic Sentiment Analysis to Detect Depression Using Machine Learning [J].
Musleh, Dhiaa A. ;
Alkhales, Taef A. ;
Almakki, Reem A. ;
Alnajim, Shahad E. ;
Almarshad, Shaden K. ;
Alhasaniah, Rana S. ;
Aljameel, Sumayh S. ;
Almuqhim, Abdullah A. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02) :3463-3477
[24]   Ethics and Privacy in Social Media Research for Mental Health [J].
Nicholas, Jennifer ;
Onie, Sandersan ;
Larsen, Mark E. .
CURRENT PSYCHIATRY REPORTS, 2020, 22 (12)
[25]  
Paul Sayanta, 2018, CLEF
[26]  
Rabie E.M., 2022, PREPRINT, DOI [10.21203/rs.3.rs-2281584/v1, DOI 10.21203/RS.3.RS-2281584/V1]
[27]  
Raja MS, 2022, Webology, V19, P250, DOI [10.14704/web/v19i1/web19019, DOI 10.14704/WEB/V19I1/WEB19019]
[28]   Comparison of MetaMap and cTAKES for entity extraction in clinical notes [J].
Reategui, Ruth ;
Ratte, Sylvie .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
[29]   Detecting Depression Signs on Social Media: A Systematic Literature Review [J].
Salas-Zarate, Rafael ;
Alor-Hernandez, Giner ;
del Pilar Salas-Zarate, Maria ;
Andres Paredes-Valverde, Mario ;
Bustos-Lopez, Maritza ;
Luis Sanchez-Cervantes, Jose .
HEALTHCARE, 2022, 10 (02)
[30]   Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications [J].
Savova, Guergana K. ;
Masanz, James J. ;
Ogren, Philip V. ;
Zheng, Jiaping ;
Sohn, Sunghwan ;
Kipper-Schuler, Karin C. ;
Chute, Christopher G. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) :507-513