Prediction of Mental Health Problem Using Annual Student Health Survey: Machine Learning Approach

被引：6

作者：

Baba, Ayako ^{[1
,3
]}

Bunji, Kyosuke ^{[2
]}

机构：

[1] Kanazawa Univ, Hlth Serv Ctr, Kanazawa, Ishikawa, Japan

[2] Kobe Univ, Grad Sch Business Adm, Kobe, Hyogo, Japan

[3] Kanazawa Univ, Hlth Serv Ctr, Kakuma Machi, Kanazawa, Ishikawa 9201192, Japan

来源：

JMIR MENTAL HEALTH | 2023年 / 10卷

关键词：

student counseling; health survey; machine learning; mental health problem; response time; PERFORMANCE; IDEATION; JULIA;

D O I：

10.2196/42420

中图分类号：

R749 [精神病学];

学科分类号：

100205 ;

摘要：

Background: One of the reasons why students go to counseling is being called on based on self-reported health survey results. However, there is no concordant standard for such calls.Objective: This study aims to develop a machine learning (ML) model to predict students' mental health problems in 1 year and the following year using the health survey's content and answering time (response time, response time stamp, and answer date).Methods: Data were obtained from the responses of 3561 (62.58%) of 5690 undergraduate students from University A in Japan (a national university) who completed the health survey in 2020 and 2021. We performed 2 analyses; in analysis 1, a mental health problem in 2020 was predicted from demographics, answers for the health survey, and answering time in the same year, and in analysis 2, a mental health problem in 2021 was predicted from the same input variables as in analysis 1. We compared the results from different ML models, such as logistic regression, elastic net, random forest, XGBoost, and LightGBM. The results with and without answering time conditions were compared using the adopted model.Results: On the basis of the comparison of the models, we adopted the LightGBM model. In this model, both analyses and conditions achieved adequate performance (eg, Matthews correlation coefficient [MCC] of with answering time condition in analysis 1 was 0.970 and MCC of without answering time condition in analysis 1 was 0.976; MCC of with answering time condition in analysis 2 was 0.986 and that of without answering time condition in analysis 2 was 0.971). In both analyses and in both conditions, the response to the questions about campus life (eg, anxiety and future) had the highest impact (Gain 0.131-0.216; Shapley additive explanations 0.018-0.028). Shapley additive explanations of 5 to 6 input variables from questions about campus life were included in the top 10. In contrast to our expectation, the inclusion of answering time-related variables did not exhibit substantial improvement in the prediction of students' mental health problems. However, certain variables generated based on the answering time are apparently helpful in improving the prediction and affecting the prediction probability. Conclusions: These results demonstrate the possibility of predicting mental health across years using health survey data. Demographic and behavioral data, including answering time, were effective as well as self-rating items. This model demonstrates the possibility of synergistically using the characteristics of health surveys and advantages of ML. These findings can improve health survey items and calling criteria.

引用

页数：23

共 50 条

[41] Prediction of drinking water quality with machine learning models: A public health nursing approach
Ozsezer, Gozde
Mermer, Gulengul
PUBLIC HEALTH NURSING, 2024, 41 (01) : 175 - 191
[42] A Review of Machine Learning and Deep Learning Approaches on Mental Health Diagnosis
Iyortsuun, Ngumimi Karen
Kim, Soo-Hyung
Jhon, Min
Yang, Hyung-Jeong
Pant, Sudarshan
HEALTHCARE, 2023, 11 (03)
[43] Survey of Machine Learning Techniques for Student Profile Modelling
Hamim, Touria
Benabbou, Faouzia
Sael, Nawal
INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (04) : 136 - 151
[44] Health status prediction for the elderly based on machine learning
Qin, Fang-Yu
Lv, Zhe-Qi
Wang, Dan-Ni
Hu, Bo
Wu, Chao
ARCHIVES OF GERONTOLOGY AND GERIATRICS, 2020, 90
[45] Hepatocellular Carcinoma Risk Prediction in the NIH-AARP Diet and Health Study Cohort: A Machine Learning Approach
Thomas, Jonathan
Liao, Linda M.
Sinha, Rashmi
Patel, Tushar
Antwi, Samuel O.
JOURNAL OF HEPATOCELLULAR CARCINOMA, 2022, 9 : 69 - 81
[46] A comparative study on student performance prediction using machine learning
Chen, Yawen
Zhai, Linbo
EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (09) : 12039 - 12057
[47] Application of Machine Learning in Transformer Health Index Prediction
Alqudsi, Alhaytham
El-Hag, Ayman
ENERGIES, 2019, 12 (14)
[48] A comparative study on student performance prediction using machine learning
Yawen Chen
Linbo Zhai
Education and Information Technologies, 2023, 28 : 12039 - 12057
[49] Student Performance Prediction and Classification Using Machine Learning Algorithms
Sekeroglu, Boran
Dimililer, Kamil
Tuncal, Kubra
PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 7 - 11
[50] Identification of Suicidal Ideation in the Canadian Community Health Survey-Mental Health Component Using Deep Learning
Desai, Sneha
Tanguay-Sela, Myriam
Benrimoh, David
Fratila, Robert
Brown, Eleanor
Perlman, Kelly
John, Ann
DelPozo-Banos, Marcos
Low, Nancy
Israel, Sonia
Palladini, Lisa
Turecki, Gustavo
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4

← 1 2 3 4 5 →