Using Large Language Models to Understand Suicidality in a Social Media-Based Taxonomy of Mental Health Disorders: Linguistic Analysis of Reddit Posts

被引:2
|
作者
Bauer, Brian [1 ]
Norel, Raquel [2 ]
Leow, Alex [3 ,4 ]
Rached, Zad Abi [5 ]
Wen, Bo [2 ]
Cecchi, Guillermo [2 ]
机构
[1] Univ Georgia, Dept Psychol, 125 Baldwin St, Athens, GA 30602 USA
[2] IBM Res, Digital Hlth, Yorktown Hts, NY USA
[3] Univ Illinois, Dept Psychiat, Chicago, IL USA
[4] Univ Illinois, Dept Biomed Engn & Comp Sci, Chicago, IL USA
[5] Coll Louise Wegmann, Beirut, Lebanon
来源
JMIR MENTAL HEALTH | 2024年 / 11卷
关键词
natural language processing; explainable AI; suicide; mental health disorders; mental health disorder; mental health; social media; online discussions; online; large language model; LLM; downstream analyses; trauma; stress; depression; anxiety; AI; artificial intelligence; explainable artificial intelligence; web-based discussions; METAANALYSIS; DEPRESSION; RISK;
D O I
10.2196/57234
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background: Rates of suicide have increased by over 35% since 1999. Despite concerted efforts, our ability to predict, explain, or treat suicide risk has not significantly improved over the past 50 years. Objective: The aim of this study was to use large language models to understand natural language use during public web-based discussions (on Reddit) around topics related to suicidality. Methods: We used large language model-based sentence embedding to extract the latent linguistic dimensions of user postings derived from several mental health-related subreddits, with a focus on suicidality. We then applied dimensionality reduction to these sentence embeddings, allowing them to be summarized and visualized in a lower-dimensional Euclidean space for further downstream analyses. We analyzed 2.9 million posts extracted from 30 subreddits, including r/SuicideWatch, between October 1 and December 31, 2022, and the same period in 2010. Results: Our results showed that, in line with existing theories of suicide, posters in the suicidality community (r/SuicideWatch) predominantly wrote about feelings of disconnection, burdensomeness, hopeless, desperation, resignation, and trauma. Further, we identified distinct latent linguistic dimensions (well-being, seeking support, and severity of distress) among all mental health subreddits, and many of the resulting subreddit clusters were in line with a statistically driven diagnostic classification system-namely, the Hierarchical Taxonomy of Psychopathology (HiTOP)-by mapping onto the proposed Conclusions: Overall, our findings provide data-driven support for several language-based theories of suicide, as well processing techniques can assist researchers in gaining deeper insights about emotions and experiences shared on the web and may aid in the validation and refutation of different mental health theories.
引用
收藏
页数:13
相关论文
共 12 条
  • [1] Tracking pregnant women's mental health through social media: an analysis of reddit posts
    Dhankar, Abhishek
    Katz, Alan
    JAMIA OPEN, 2023, 6 (04)
  • [2] Understanding Mental Health Issues in Different Subdomains of Social Networking Services: Computational Analysis of Text-Based Reddit Posts
    Kim, Seoyun
    Cha, Junyeop
    Kim, Dongjae
    Park, Eunil
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25 : e49074
  • [3] An emotion and cognitive based analysis of mental health disorders from social media data
    Uban, Ana-Sabina
    Chulvi, Berta
    Rosso, Paolo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 124 : 480 - 494
  • [4] A platform for connecting social media data to domain-specific topics using large language models: an application to student mental health
    Ruocco, Leonard
    Zhuang, Yuqian
    Ng, Raymond
    Munthali, Richard J.
    Hudec, Kristen L.
    Wang, Angel Y.
    Vereschagin, Melissa
    Vigo, Daniel V.
    JAMIA OPEN, 2024, 7 (01)
  • [5] Sentence Level Analysis for Detecting Mental Health Causes Using Social Media Posts
    Azmee, Abm Adnan
    Murikipudi, Manohar
    Khan, Md Abdullah Al Hatiz
    Pei, Yong
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 1388 - 1393
  • [6] Detection and Prediction of Future Mental Disorder From Social Media Data Using Machine Learning, Ensemble Learning, and Large Language Models
    Abdullah, Mohammed
    Negied, Nermin
    IEEE ACCESS, 2024, 12 : 120553 - 120569
  • [7] Balanced and Explainable Social Media Analysis for Public Health with Large Language Models
    Jiang, Yan
    Qiu, Ruihong
    Zhang, Yi
    Zhang, Peng-Fei
    DATABASES THEORY AND APPLICATIONS, ADC 2023, 2024, 14386 : 73 - 86
  • [8] Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
    Kabir, Muhammad Khubayeeb
    Islam, Maisha
    Kabir, Anika Nahian Binte
    Haque, Adiba
    Rhaman, Md Khalilur
    JMIR FORMATIVE RESEARCH, 2022, 6 (09)
  • [9] Leveraging Prompt-Based Large Language Models: Predicting Pandemic Health Decisions and Outcomes Through Social Media Language
    Ding, Xiaohan
    Carik, Buse
    Gunturi, Uma Sushmitha
    Reyna, Valerie
    Rho, Eugenia H.
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [10] Sentiment analysis of the United States public support of nuclear power on social media using large language models
    Kwon, O. Hwang
    Vu, Katie
    Bhargava, Naman
    Radaideh, Mohammed I.
    Cooper, Jacob
    Joynt, Veda
    Radaideh, Majdi I.
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2024, 200