Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models

被引:0
作者
Bazrafshan, Mohammad [1 ]
Sayehmiri, Kourosh [2 ]
机构
[1] Ilam Univ Med Sci, Fac Med, Ilam, Iran
[2] Ilam Univ Med Sci, Fac Hlth, Dept Biostat, Ilam, Iran
关键词
Suicide; Suicide attempt; Suicidal behavior; Suicide risk factors; Machine learning; Classification algorithms; RISK-FACTORS; LOGISTIC-REGRESSION; GENERAL-POPULATION; COMPLETED SUICIDE; MEANS RESTRICTION; MORTALITY;
D O I
10.1186/s12888-024-06273-2
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes. Materials and methods This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher's exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC. Results Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father's education, and mother's education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested. Conclusion This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.
引用
收藏
页数:12
相关论文
共 56 条
[1]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[2]  
Amini P, 2016, IRAN J PUBLIC HEALTH, V45, P1179
[3]  
[Anonymous], 2021, Suicide worldwide in 2019: global health estimates
[4]   Prediction of Number of Suicidal People Based on KNN [J].
Aslan, Haci Ismail ;
Yilmaz, Adnan Berat ;
Jeong, Namgyu ;
Lee, Saebom ;
Choi, Chang .
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[5]   Epidemiology of Suicide and the Psychiatric Perspective [J].
Bachmann, Silke .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (07)
[6]   The prediction model of suicidal thoughts in Korean adults using Decision Tree Analysis: A nationwide cross-sectional study [J].
Bae, Sung-Man .
PLOS ONE, 2019, 14 (10)
[7]  
Bakirarar B, 2023, Turk. Klin. J. Biostat., V15, P19, DOI [10.5336/biostatic.2022-93961, DOI 10.5336/BIOSTATIC.2022-93961]
[8]   Risk factors for fatal and nonfatal repetition of suicide attempts: a literature review [J].
Beghi, Massimiliano ;
Rosenbaum, Jerrold F. ;
Cerri, Cesare ;
Cornaggia, Cesare M. .
NEUROPSYCHIATRIC DISEASE AND TREATMENT, 2013, 9 :1725-1735
[9]   Prediction Models for Suicide Attempts and Deaths: A Systematic Review and Simulation [J].
Belsher, Bradley E. ;
Smolenski, Derek J. ;
Pruitt, Larry D. ;
Bush, Nigel E. ;
Beech, Erin H. ;
Workman, Don E. ;
Morgan, Rebecca L. ;
Evatt, Daniel P. ;
Tucker, Jennifer ;
Skopp, Nancy A. .
JAMA PSYCHIATRY, 2019, 76 (06) :642-651
[10]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281