Machine Learning Methods to Identify Predictors of Psychological Distress

被引:2
作者
Chen, Yang [1 ]
Zhang, Xiaomei [1 ]
Lu, Lin [2 ]
Wang, Yinzhi [1 ]
Liu, Jiajia [3 ]
Qin, Lei [1 ]
Ye, Linglong [4 ]
Zhu, Jianping [5 ,6 ]
Shia, Ben-Chang [7 ,8 ]
Chen, Ming-Chih [7 ,8 ]
机构
[1] Univ Int Business & Econ, Sch Stat, Beijing 100029, Peoples R China
[2] Univ Int Business & Econ, Inst Educ & Econ Res, Beijing 100029, Peoples R China
[3] Univ Int Business & Econ, Sch Int Relat, Beijing 100029, Peoples R China
[4] Xiamen Univ, Sch Publ Affairs, Xiamen 361005, Peoples R China
[5] Xiamen Univ, Sch Management, Xiamen 361005, Peoples R China
[6] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen 361005, Peoples R China
[7] Fu Jen Catholic Univ, Coll Management, Grad Inst Business Adm, New Taipei 24205, Taiwan
[8] Fu Jen Catholic Univ, Artificial Intelligence Dev Ctr, New Taipei 24205, Taiwan
关键词
psychological distress; predictors; machine learning; HINTS; DISORDERS;
D O I
10.3390/pr10051030
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
As people pay ever-increasing attention to the problems caused by psychological stress, research on its influencing factors becomes crucial. This study analyzed the Health Information National Trends Survey (HINTS, Cycle 3 and Cycle 4) data (N = 5484) and assessed the outcomes using descriptive statistics, Chi-squared tests, and t-tests. Four machine learning algorithms were applied for modeling: logistic regression (linear), random forests (RF) (ensemble), the artificial neural network (ANN) (nonlinear), and gradient boosting (GB) (ensemble). The samples were randomly assigned to a 50% training set and a 50% validation set. Twenty-six preselected variables from the databases were used in the study as predictors, and the four models identified twenty predictors of psychological distress. The essence of this paper is a binary classification problem of judging whether an individual has psychological distress based on many different factors. Therefore, accuracy, precision, recall, F1-score, and AUC were used to evaluate the model performance. The logistic regression model selected predictors by forward selection, backward selection, and stepwise regression; variable importance values were used to identify predictors in the other three machine learning methods. Of the four machine learning models, the ANN exhibited the best predictive effect (AUC = 73.90%). A range of predictors of psychological distress was identified by combining the four machine learning models, which would help improve the performance of the existing mental health screening tools.
引用
收藏
页数:13
相关论文
共 23 条
  • [1] Understanding persons with psychological distress in primary health care
    Arvidsdotter, Tina
    Marklund, Bertil
    Kylen, Sven
    Taft, Charles
    Ekman, Inger
    [J]. SCANDINAVIAN JOURNAL OF CARING SCIENCES, 2016, 30 (04) : 687 - 694
  • [2] Breiman L., 2001, Machine Learning, V45, P5
  • [3] Cafri G., 2016, J DATA SCI, V14, P67, DOI DOI 10.6339/JDS.201601_14(1).0005
  • [4] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [5] Screening for mood and anxiety disorders with the five-item, the three-item, and the two-item Mental Health Inventory
    Cuijpers, Pim
    Smits, Niels
    Donker, Tara
    ten Have, Margreet
    de Graaf, Ron
    [J]. PSYCHIATRY RESEARCH, 2009, 168 (03) : 250 - 255
  • [6] A combined strategy of feature selection and machine learning to identify predictors of prediabetes
    De Silva, Kushan
    Jonsson, Daniel
    Demmer, Ryan T.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) : 396 - 406
  • [7] Drapeau A., 2012, MENTAL ILLNESSES UND, P105, DOI [10.5772/30872, DOI 10.5772/30872, DOI 10.5772/1235]
  • [8] Goldberg D.P., 1972, DETECTION PSYCHIAT I, P21
  • [9] Hosmer DW, 2013, WILEY SER PROBAB ST, P1, DOI 10.1002/9781118548387
  • [10] The social consequences of psychiatric disorders, III: Probability of marital stability
    Kessler, RC
    Walters, EE
    Forthofer, MS
    [J]. AMERICAN JOURNAL OF PSYCHIATRY, 1998, 155 (08) : 1092 - 1096