Leveraging web scraping and stacking ensemble machine learning techniques to enhance detection of major depressive disorder from social media posts

被引:0
|
作者
Hridoy, Md. Tanvir Ahammed [1 ]
Saha, Susmita Rani [1 ]
Islam, Md Manowarul [1 ]
Uddin, Md Ashraf [1 ]
Mahmud, Md. Zulfiker [1 ]
机构
[1] Jagannath Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Major depressive disorder; Suicide; Early detection; Machine learning; Deep learning; Web scraping; Twitter; Reddit; Stacking ensemble;
D O I
10.1007/s13278-024-01392-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media has become a platform for people to express emotions, including happiness and sadness, to their followers. Major Depressive Disorder (MDD), a common mental health disorder, is characterized by sadness and loss of interest in activities, leading to physical, emotional, cognitive, and social suicidal thoughts. Early detection and intervention of MDD are crucial for effective management and treatment. The study investigates the potential of detecting MDD on social media platforms like Facebook, Twitter and Reddit by analyzing text using advanced machine learning and deep learning algorithms. In order to collect dataset, we employed both web scraping techniques and publically existing datasets (Twitter, Reddit) that are available on the Kaggle website. Natural language processing (NLP) techniques are applied to preprocess and excerpt meaningful features from the textual data. Several machine learning algorithms are employed to make prophetic models for MDD discovery grounded on verbal patterns, sentiment analysis, and verbal labels associated with depressive symptoms. We analyse our models using three datasets. The two online datasets for which the LSTM algorithm performs best are Reddit with 93.72% accuracy, Twitter with 99.85% accuracy, and our dataset which is extracted using web scraping technologies from Reddit gets 96.47% accuracy utilizing Stacking ensemble. The model's performance is thoroughly assessed using a variety of criteria, such as accuracy, precision, recall, and F1-score. Additionally, We find an approach with a more effective ML framework for enhancing MDD detection.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Stacked ensemble machine learning approach for electroencephalography based major depressive disorder classification using temporal statistics
    Ahmed, Nader Nisar
    Bhat, Tejas Kadengodlu
    Powar, Omkar S.
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [22] Detection of major depressive disorder using vocal acoustic analysis and machine learning—an exploratory study
    Espinola C.W.
    Gomes J.C.
    Pereira J.M.S.
    dos Santos W.P.
    Research on Biomedical Engineering, 2021, 37 (01) : 53 - 64
  • [23] Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts
    Alberto Benitez-Andrades, Jose
    Teresa Garcia-Ordas, Maria
    Russo, Mayra
    Sakor, Ahmad
    Rotger, Luis Daniel Fernandes
    Vidal, Maria-Esther
    SEMANTIC WEB, 2023, 14 (05) : 873 - 892
  • [24] Using an Interpretable Amino Acid-Based Machine Learning Method to Enhance the Diagnosis of Major Depressive Disorder
    Ho, Cyrus Su Hui
    Tan, Trevor Wei Kiat
    Khoe, Howard Cai Hao
    Chan, Yee Ling
    Tay, Gabrielle Wann Nii
    Tang, Tong Boon
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (05)
  • [25] Innovative Use of Self-Attention-Based Ensemble Deep Learning for Suicide Risk Detection in Social Media Posts
    Choi, Hoan-Suk
    Yang, Jinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [26] Identifying major depressive disorder among US adults living alone using stacked ensemble machine learning algorithms
    Chen, Zhao
    Liu, Hao
    Zhang, Yao
    Xing, Fei
    Jiang, Jiabao
    Xiang, Zhou
    Duan, Xin
    FRONTIERS IN PUBLIC HEALTH, 2025, 13
  • [27] Ensemble machine learning technique-based plagiarism detection over opinions in social media
    Vadivu, Sethu Vinayaga
    Nagaraj, Palanigurupackiam
    Murugan, Bagavathi Ammai Shanmugam
    AUTOMATIKA, 2024, 65 (03) : 983 - 991
  • [28] A diagnostic model based on bioinformatics and machine learning to differentiate bipolar disorder from schizophrenia and major depressive disorder
    Shen, Jing
    Xiao, Chenxu
    Qiao, Xiwen
    Zhu, Qichen
    Yan, Hanfei
    Pan, Julong
    Feng, Yu
    SCHIZOPHRENIA, 2024, 10 (01)
  • [29] Demystifying Black-box Learning Models of Rumor Detection from Social Media Posts
    Tafannum, Faiza
    Shopnil, Mir Nafis Sharear
    Salsabil, Anika
    Ahmed, Navid
    Alam, Md Golam Rabiul
    Reza, Md Tanzim
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 358 - 364
  • [30] Detecting Pain Points from User-Generated Social Media Posts Using Machine Learning
    Salminen, Joni
    Mustak, Mekhail
    Corporan, Juan
    Jung, Soon-gyo
    Jansen, Bernard J.
    JOURNAL OF INTERACTIVE MARKETING, 2022, 57 (03) : 517 - 539