Leveraging web scraping and stacking ensemble machine learning techniques to enhance detection of major depressive disorder from social media posts

被引:0
|
作者
Hridoy, Md. Tanvir Ahammed [1 ]
Saha, Susmita Rani [1 ]
Islam, Md Manowarul [1 ]
Uddin, Md Ashraf [1 ]
Mahmud, Md. Zulfiker [1 ]
机构
[1] Jagannath Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Major depressive disorder; Suicide; Early detection; Machine learning; Deep learning; Web scraping; Twitter; Reddit; Stacking ensemble;
D O I
10.1007/s13278-024-01392-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media has become a platform for people to express emotions, including happiness and sadness, to their followers. Major Depressive Disorder (MDD), a common mental health disorder, is characterized by sadness and loss of interest in activities, leading to physical, emotional, cognitive, and social suicidal thoughts. Early detection and intervention of MDD are crucial for effective management and treatment. The study investigates the potential of detecting MDD on social media platforms like Facebook, Twitter and Reddit by analyzing text using advanced machine learning and deep learning algorithms. In order to collect dataset, we employed both web scraping techniques and publically existing datasets (Twitter, Reddit) that are available on the Kaggle website. Natural language processing (NLP) techniques are applied to preprocess and excerpt meaningful features from the textual data. Several machine learning algorithms are employed to make prophetic models for MDD discovery grounded on verbal patterns, sentiment analysis, and verbal labels associated with depressive symptoms. We analyse our models using three datasets. The two online datasets for which the LSTM algorithm performs best are Reddit with 93.72% accuracy, Twitter with 99.85% accuracy, and our dataset which is extracted using web scraping technologies from Reddit gets 96.47% accuracy utilizing Stacking ensemble. The model's performance is thoroughly assessed using a variety of criteria, such as accuracy, precision, recall, and F1-score. Additionally, We find an approach with a more effective ML framework for enhancing MDD detection.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Prediction of postpartum depression using machine learning techniques from social media text
    Fatima, Iram
    Abbasi, Burhan Ud Din
    Khan, Sharifullah
    Al-Saeed, Majed
    Ahmad, Hafiz Farooq
    Mumtaz, Rafia
    EXPERT SYSTEMS, 2019, 36 (04)
  • [42] Got Many Labels? Deriving Topic Labels from Multiple Sources for Social Media Posts using Crowdsourcing and Ensemble Learning
    Chang, Shuo
    Dai, Peng
    Chen, Jilin
    Chi, Ed H.
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 397 - 406
  • [43] Machine learning classifiers with pre-processing techniques for rumour detection on social media: an empirical study
    Al-Sarem M.
    Al-Harby M.
    Saeed F.
    Hezzam E.A.
    International Journal of Cloud Computing, 2022, 11 (04) : 330 - 344
  • [44] Identifying neuroimaging biomarkers of major depressive disorder from cortical hemodynamic responses using machine learning approaches
    Li, Zhifei
    McIntyre, Roger S.
    Husain, Syeda F.
    Ho, Roger
    Tran, Bach X.
    Nguyen, Hien Thu
    Soo, Shuenn-Chiang
    Ho, Cyrus S.
    Chen, Nanguang
    EBIOMEDICINE, 2022, 79
  • [45] Machine Learning Insights from Enigma's Studies of Major Depressive Disorder: Classification via Distributed Analysis
    Zhu, Dajiang
    Thompson, Paul M.
    Schmaal, Lianne
    Veltman, Dick
    BIOLOGICAL PSYCHIATRY, 2017, 81 (10) : S307 - S307
  • [46] Automatic detection of eating disorder-related social media posts that could benefit from a mental health intervention
    Yan, Hao
    Fitzsimmons-Craft, Ellen E.
    Goodman, Micah
    Krauss, Melissa
    Das, Sanmay
    Cavazos-Rehg, Patricia
    INTERNATIONAL JOURNAL OF EATING DISORDERS, 2019, 52 (10) : 1150 - 1156
  • [47] Classical-quantum hybrid transfer learning for adverse drug reaction detection from social media posts
    Dey, Arijit
    Shrivastava, Jitendra Nath
    Kumar, Chandan
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (02): : 1433 - 1450
  • [48] Validation of Machine Learning-Based Assessment of Major Depressive Disorder from Paralinguistic Speech Characteristics in Routine Care
    Bauer, Jonathan F.
    Gerczuk, Maurice
    Schindler-Gmelch, Lena
    Amiriparian, Shahin
    Ebert, David Daniel
    Krajewski, Jarek
    Schuller, Bjoern
    Berking, Matthias
    DEPRESSION AND ANXIETY, 2024, 2024
  • [49] The role of senescence-related genes in major depressive disorder: insights from machine learning and single cell analysis
    Lian, Kun
    Yang, Wei
    Ye, Jing
    Chen, Yilan
    Zhang, Lei
    Xu, Xiufeng
    BMC PSYCHIATRY, 2025, 25 (01)
  • [50] DECEN: A deep learning model enhanced by depressive emotions for depression detection from social media content
    Yan, Zhijun
    Peng, Fei
    Zhang, Dongsong
    DECISION SUPPORT SYSTEMS, 2025, 191