Predicting self-perceived general health status using machine learning: an external exposome study

被引:5
作者
Hoekstra, Jurriaan [1 ]
Lenssen, Esther S. [2 ]
Wong, Albert [1 ]
Loef, Bette [1 ]
Herber, Gerrie-Cor M. [1 ]
Boshuizen, Hendriek C. [1 ,3 ]
Strak, Maciek [1 ]
Verschuren, W. M. Monique [1 ,4 ]
Janssen, Nicole A. H. [1 ]
机构
[1] Natl Inst Publ Hlth & Environm RIVM, Bilthoven, Netherlands
[2] Univ Utrecht, Inst Risk Assessment Sci, Utrecht, Netherlands
[3] Wageningen Univ & Res, Wageningen, Netherlands
[4] Univ Utrecht, Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
关键词
Exposome; Machine learning; Random forest; Self-perceived general health; RATED HEALTH; RELIABILITY; MORTALITY; SCALES;
D O I
10.1186/s12889-023-15962-8
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
BackgroundSelf-perceived general health (SPGH) is a general health indicator commonly used in epidemiological research and is associated with a wide range of exposures from different domains. However, most studies on SPGH only investigated a limited set of exposures and did not take the entire external exposome into account. We aimed to develop predictive models for SPGH based on exposome datasets using machine learning techniques and identify the most important predictors of poor SPGH status.MethodsRandom forest (RF) was used on two datasets based on personal characteristics from the 2012 and 2016 editions of the Dutch national health survey, enriched with environmental and neighborhood characteristics. Model performance was determined using the area under the curve (AUC) score. The most important predictors were identified using a variable importance procedure and individual effects of exposures using partial dependence and accumulated local effect plots. The final 2012 dataset contained information on 199,840 individuals and 81 variables, whereas the final 2016 dataset had 244,557 individuals with 91 variables.ResultsOur RF models had overall good predictive performance (2012: AUC = 0.864 (CI: 0.852-0.876); 2016: AUC = 0.890 (CI: 0.883-0.896)) and the most important predictors were "Control of own life", "Physical activity", "Loneliness" and "Making ends meet". Subjects who felt insufficiently in control of their own life, scored high on the De Jong-Gierveld loneliness scale or had difficulty in making ends meet were more likely to have poor SPGH status, whereas increased physical activity per week reduced the probability of poor SPGH. We observed associations between some neighborhood and environmental characteristics, but these variables did not contribute to the overall predictive strength of the models.ConclusionsThis study identified that within an external exposome dataset, the most important predictors for SPGH status are related to mental wellbeing, physical exercise, loneliness, and financial status.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Elevated homocysteine is associated with poorer self-perceived physical health in older men: The Health in Men Study
    Wong, Yuen Y. E.
    Almeida, Osvaldo P.
    McCaul, Kieran A.
    Yeap, Bu B.
    Hankey, Graeme J.
    van Bockxmeer, Frank M.
    Flicker, Leon
    MATURITAS, 2012, 73 (02) : 158 - 163
  • [42] SELF-REPORTED HEALTH STATUS PREDICTING RESILIENCE AND BURNOUT IN LONGITUDINAL STUDY
    Solcova, Iva
    Kebza, Vladimir
    Kodl, Miloslav
    Kernova, Vera
    CENTRAL EUROPEAN JOURNAL OF PUBLIC HEALTH, 2017, 25 (03) : 222 - 227
  • [43] Predicting Phubbing Through Machine Learning: A Study of Internet Usage and Health Risks
    Yalman, Aysen
    Arik, Mehmet Arif
    Kayakus, Mehmet
    Karaduman, Murad
    Karaduman, Sibel
    Acikgoz, Fatma Yigit
    Livberber, Tuba
    Kayan, Fahrettin
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [44] Predicting Participation Willingness in Ecological Momentary Assessment of General Population Health and Behavior: Machine Learning Study
    Murray, Aja
    Ushakova, Anastasia
    Zhu, Xinxin
    Yang, Yi
    Xiao, Zhuoni
    Brown, Ruth
    Speyer, Lydia
    Ribeaud, Denis
    Eisner, Manuel
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [45] A POI-Based Machine Learning Method for Predicting Residents' Health Status
    Cao, Shicong
    Zheng, Hao
    PROCEEDINGS OF THE 2021 DIGITALFUTURES, CDRF 2021, 2022, : 139 - 147
  • [46] Knowledge attainment, learning approaches, and self-perceived study burnout among European veterinary students
    Iivanainen, Antti
    Collares, Carlos Fernando
    Wandall, Jakob
    Parpala, Anna
    Nevgi, Anne
    Keto-Timonen, Riikka
    Tipold, Andrea
    Schaper, Elisabeth
    van Haeften, Theo
    Pihl, Tina Holberg
    Press, Charles McLean
    Holm, Peter
    FRONTIERS IN VETERINARY SCIENCE, 2024, 11
  • [47] Occupational stress and self-perceived oral health in Brazilian adults: a Pro-Saude study
    da Cunha Scalco, Giovana Pereira
    Abegg, Claides
    Celeste, Roger Keller
    Marques Hkerberg, Yara Hahr
    Faerstein, Eduardo
    CIENCIA & SAUDE COLETIVA, 2013, 18 (07): : 2069 - 2074
  • [48] Predicting perinatal mortality based on maternal health status and health insurance service using homogeneous ensemble machine learning methods
    Dawit S. Bogale
    Tesfamariam M. Abuhay
    Belayneh E. Dejene
    BMC Medical Informatics and Decision Making, 22
  • [49] Predicting perinatal mortality based on maternal health status and health insurance service using homogeneous ensemble machine learning methods
    Bogale, Dawit S.
    Abuhay, Tesfamariam M.
    Dejene, Belayneh E.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [50] Self-perceived health in Spanish and Portuguese young seniors after the great recession according to the European Health Survey: A cross-sectional study
    Pereira-de-Sousa, Ana M.
    Lopez-Rodriguez, Juan A.
    ATENCION PRIMARIA, 2021, 53 (07):