Prediction of Mental Health Problem Using Annual Student Health Survey: Machine Learning Approach

被引:6
|
作者
Baba, Ayako [1 ,3 ]
Bunji, Kyosuke [2 ]
机构
[1] Kanazawa Univ, Hlth Serv Ctr, Kanazawa, Ishikawa, Japan
[2] Kobe Univ, Grad Sch Business Adm, Kobe, Hyogo, Japan
[3] Kanazawa Univ, Hlth Serv Ctr, Kakuma Machi, Kanazawa, Ishikawa 9201192, Japan
来源
JMIR MENTAL HEALTH | 2023年 / 10卷
关键词
student counseling; health survey; machine learning; mental health problem; response time; PERFORMANCE; IDEATION; JULIA;
D O I
10.2196/42420
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background: One of the reasons why students go to counseling is being called on based on self-reported health survey results. However, there is no concordant standard for such calls.Objective: This study aims to develop a machine learning (ML) model to predict students' mental health problems in 1 year and the following year using the health survey's content and answering time (response time, response time stamp, and answer date).Methods: Data were obtained from the responses of 3561 (62.58%) of 5690 undergraduate students from University A in Japan (a national university) who completed the health survey in 2020 and 2021. We performed 2 analyses; in analysis 1, a mental health problem in 2020 was predicted from demographics, answers for the health survey, and answering time in the same year, and in analysis 2, a mental health problem in 2021 was predicted from the same input variables as in analysis 1. We compared the results from different ML models, such as logistic regression, elastic net, random forest, XGBoost, and LightGBM. The results with and without answering time conditions were compared using the adopted model.Results: On the basis of the comparison of the models, we adopted the LightGBM model. In this model, both analyses and conditions achieved adequate performance (eg, Matthews correlation coefficient [MCC] of with answering time condition in analysis 1 was 0.970 and MCC of without answering time condition in analysis 1 was 0.976; MCC of with answering time condition in analysis 2 was 0.986 and that of without answering time condition in analysis 2 was 0.971). In both analyses and in both conditions, the response to the questions about campus life (eg, anxiety and future) had the highest impact (Gain 0.131-0.216; Shapley additive explanations 0.018-0.028). Shapley additive explanations of 5 to 6 input variables from questions about campus life were included in the top 10. In contrast to our expectation, the inclusion of answering time-related variables did not exhibit substantial improvement in the prediction of students' mental health problems. However, certain variables generated based on the answering time are apparently helpful in improving the prediction and affecting the prediction probability. Conclusions: These results demonstrate the possibility of predicting mental health across years using health survey data. Demographic and behavioral data, including answering time, were effective as well as self-rating items. This model demonstrates the possibility of synergistically using the characteristics of health surveys and advantages of ML. These findings can improve health survey items and calling criteria.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Prediction of drinking water quality with machine learning models: A public health nursing approach
    Ozsezer, Gozde
    Mermer, Gulengul
    PUBLIC HEALTH NURSING, 2024, 41 (01) : 175 - 191
  • [42] A Review of Machine Learning and Deep Learning Approaches on Mental Health Diagnosis
    Iyortsuun, Ngumimi Karen
    Kim, Soo-Hyung
    Jhon, Min
    Yang, Hyung-Jeong
    Pant, Sudarshan
    HEALTHCARE, 2023, 11 (03)
  • [43] Survey of Machine Learning Techniques for Student Profile Modelling
    Hamim, Touria
    Benabbou, Faouzia
    Sael, Nawal
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (04) : 136 - 151
  • [44] Health status prediction for the elderly based on machine learning
    Qin, Fang-Yu
    Lv, Zhe-Qi
    Wang, Dan-Ni
    Hu, Bo
    Wu, Chao
    ARCHIVES OF GERONTOLOGY AND GERIATRICS, 2020, 90
  • [45] Hepatocellular Carcinoma Risk Prediction in the NIH-AARP Diet and Health Study Cohort: A Machine Learning Approach
    Thomas, Jonathan
    Liao, Linda M.
    Sinha, Rashmi
    Patel, Tushar
    Antwi, Samuel O.
    JOURNAL OF HEPATOCELLULAR CARCINOMA, 2022, 9 : 69 - 81
  • [46] A comparative study on student performance prediction using machine learning
    Chen, Yawen
    Zhai, Linbo
    EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (09) : 12039 - 12057
  • [47] Application of Machine Learning in Transformer Health Index Prediction
    Alqudsi, Alhaytham
    El-Hag, Ayman
    ENERGIES, 2019, 12 (14)
  • [48] A comparative study on student performance prediction using machine learning
    Yawen Chen
    Linbo Zhai
    Education and Information Technologies, 2023, 28 : 12039 - 12057
  • [49] Student Performance Prediction and Classification Using Machine Learning Algorithms
    Sekeroglu, Boran
    Dimililer, Kamil
    Tuncal, Kubra
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 7 - 11
  • [50] Identification of Suicidal Ideation in the Canadian Community Health Survey-Mental Health Component Using Deep Learning
    Desai, Sneha
    Tanguay-Sela, Myriam
    Benrimoh, David
    Fratila, Robert
    Brown, Eleanor
    Perlman, Kelly
    John, Ann
    DelPozo-Banos, Marcos
    Low, Nancy
    Israel, Sonia
    Palladini, Lisa
    Turecki, Gustavo
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4