Characterisation of cardiovascular disease (CVD) incidence and machine learning risk prediction in middle-aged and elderly populations: data from the China health and retirement longitudinal study (CHARLS)

被引:0
|
作者
Huang, Qing [1 ]
Jiang, Zihao [1 ]
Shi, Bo [2 ]
Meng, Jiaxu [2 ]
Shu, Li [1 ]
Hu, Fuyong [1 ]
Mi, Jing [1 ]
机构
[1] Bengbu Med Univ, Sch Publ Hlth, 2600 Donghai Ave, Bengbu 233030, Anhui, Peoples R China
[2] Bengbu Med Univ, Sch Med Imaging, 2600 Donghai Ave, Bengbu 233030, Anhui, Peoples R China
关键词
Cardiovascular disease; Middle-aged and elderly individuals; Morbidity characteristics; Machine learning; Predictive modelling; MORTALITY; UPDATE;
D O I
10.1186/s12889-025-21609-7
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
BackgroundDue to the ageing population and evolving lifestyles occurring in China, middle-aged and elderly populations have become high-risk groups for cardiovascular disease (CVD). The aim of this study was to analyse the incidence characteristics of CVD in these populations and develop a prediction model by using data from the China Health and Retirement Longitudinal Study (CHARLS).MethodsWe used follow-up data from the CHARLS to analyse CVD incidence in the Chinese middle-aged and elderly population over a time span of 9 years. Five machine learning (ML) algorithms were employed for risk prediction. Data preprocessing included missing value imputation via random forest. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (Lasso CV) method with cross-validation prior to model training. The application of the synthetic minority over-sampling technique (SMOTE) to address class imbalance. Model performance was evaluated via analyses including the area under the ROC curve (AUC), precision, recall, F1 score, and SHAP plots for interpretability.ResultsIn accordance with the exclusion criteria, 12,580, 12,061, 11,545, and 11,619 participants were enrolled in four follow-up rounds. The cumulative incidence (CI) of CVD at 2, 4, 7, and 9 years was 2.846%, 8.971%, 17.869% and 20.518%,, respectively. Significant differences in CVD incidence were observed across gender, age, ethnicity, and region, with higher rates observed in females and in the northeast region. Ultimately, 8,080 participants and 24 features were analysed for CVD risk prediction. Five ML models were built based on these features. Although the LGB model achieves an AUC of 0.818, indicating strong overall performance, its F1 score and recall rate are relatively low, at 0.509 and 43.1%, respectively. Shapley additive explanations (SHAP) analyses revealed the importance of key features, such as night sleep duration, TG levels, and waist circumference, in predicting outcomes, and highlighted the nonlinear relationships between these features and CVD risk.ConclusionsGender, age, ethnicity, and region are significant factors influencing CVD incidence. Although the LGB model demonstrates good overall performance, its low F1 score and recall rate reveal limitations in identifying high-risk cardiovascular disease patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Medication of diabetes in middle-aged and elderly: Survey data from the china health and retirement longitudinal study (CHARLS)
    Liu, Lili
    Wang, Shengfeng
    Zhan, Siyan
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 227 - 228
  • [2] Prevalence and risk factors of hypertension for the middle-aged population in China - results from the China Health and Retirement Longitudinal Study (CHARLS)
    Li, Zhen
    Fu, Chang
    Yang, Fan
    Mao, Zongfu
    CLINICAL AND EXPERIMENTAL HYPERTENSION, 2019, 41 (01) : 80 - 86
  • [3] ASSOCIATION BETWEEN FRAILTY AND RISK OF FALL IN MIDDLE-AGED AND ELDERLY DIABETES PATIENTS: FINDINGS FROM THE CHINA HEALTH AND RETIREMENT LONGITUDINAL STUDY (CHARLS)
    Wang, X.
    Li, G.
    OSTEOPOROSIS INTERNATIONAL, 2020, 31 (SUPPL 1) : S177 - S177
  • [4] Health status of middle-aged and older cancer survivors in China: Results from the China Health and Retirement Longitudinal Study (CHARLS)
    Li, J.
    Zhao, L.
    Bai, C.
    Pang, H.
    Sun, Z.
    ANNALS OF ONCOLOGY, 2019, 30
  • [5] Comprehensive treatment of hypertension middle-aged and elderly people: cross-sectional survey data from the China Health and Retirement Longitudinal Study (CHARLS)
    Wang, Shengfeng
    Chen, Ru
    Liu, Qing
    Zhan, Siyan
    Li, Liming
    LANCET, 2015, 386 : 67 - 67
  • [6] Does Internet Use Impact the Health Status of Middle-Aged and Older Populations? Evidence from China Health and Retirement Longitudinal Study (CHARLS)
    Li, Liqing
    Ding, Haifeng
    Li, Zihan
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (06)
  • [7] Association of estimated cardiorespiratory fitness in middle-aged and elderly people with cardiovascular disease: Evidence from the China health and retirement longitudinal study
    Li, Yiqun
    Ren, Xiao
    Jiang, Minglan
    Han, Longyang
    Zheng, Xiaowei
    NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2024, 34 (10) : 2257 - 2265
  • [8] Does Education Influence Life-Course Depression in Middle-Aged and Elderly in China? Evidence from the China Health and Retirement Longitudinal Study (CHARLS)
    Xu, Xiwu
    Zhou, Yaodong
    Su, Dai
    Dang, Yuan
    Zhang, Xianwen
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (02)
  • [9] Temporal trends in the prevalence of metabolic syndrome among middle-aged and elderly adults from 2011 to 2015 in China: the China health and retirement longitudinal study (CHARLS)
    Bo Liu
    Guanqun Chen
    Ruijie Zhao
    Dan Huang
    Lixin Tao
    BMC Public Health, 21
  • [10] Temporal trends in the prevalence of metabolic syndrome among middle-aged and elderly adults from 2011 to 2015 in China: the China health and retirement longitudinal study (CHARLS)
    Liu, Bo
    Chen, Guanqun
    Zhao, Ruijie
    Huang, Dan
    Tao, Lixin
    BMC PUBLIC HEALTH, 2021, 21 (01)