Ensemble-learning approach improves fracture prediction using genomic and phenotypic data

被引:1
|
作者
Wu, Qing [1 ]
Jung, Jongyun [1 ]
机构
[1] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
基金
美国国家卫生研究院;
关键词
Accuracy; Ensemble learning; Fracture; Genomics; Machine learning; Osteoporosis; Sensitivity; Specificity; Super Learner; OSTEOPOROTIC FRACTURES; MEN; RISK; PROBABILITY; WOMEN; MODEL;
D O I
10.1007/s00198-025-07437-w
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
This study presents an innovative ensemble machine learning model integrating genomic and clinical data to enhance the prediction of major osteoporotic fractures in older men. The Super Learner (SL) model achieved superior performance (AUC = 0.76, accuracy = 95.6%, sensitivity = 94.5%, specificity = 96.1%) compared to individual models. Ensemble machine learning improves fracture prediction accuracy, demonstrating the potential for personalized osteoporosis management. fracture risk models have limitations in their accuracy and in integrating genomic data. This study developed and validated an innovative ensemble machine learning (ML) model that combines multiple algorithms and integrates clinical, lifestyle, skeletal, and genomic data to enhance prediction for major osteoporotic fractures (MOF) in older men. Methods This study analyzed data from 5130 participants in the Osteoporotic Fractures in Men cohort Study. The model incorporated 1103 individual genome-wide significant variants and conventional risk factors of MOF. The participants were randomly divided into training (80%) and testing (20%) sets. Seven ML algorithms were combined using the SL ensemble method with tenfold cross-validation MOF prediction. Model performance was evaluated on the testing set using the area under the curve (AUC), the area under the precision-recall curve, calibration, accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and reclassification metrics. SL model performances were evaluated by comparison with baseline models and subgroup analyses by race. Results The SL model demonstrated the best performance with an AUC of 0.76, accuracy of 95.6%, sensitivity of 94.5%, specificity of 96.1%, NPV of 95.1%, and PPV of 94.7%. Among the individual ML, gradient boosting performed optimally. The SL model outperformed baseline models, and it also achieved accuracies of 93.1% for Whites and 91.6% for Minorities, outperforming single ML in subgroup analysis. Conclusion The ensemble learning approach significantly improved fracture prediction accuracy and model performance compared to individual ML. Integrating genomic and phenotypic data via the SL approach represents a promising advancement for personalized osteoporosis management.
引用
收藏
页码:811 / 821
页数:11
相关论文
共 50 条
  • [1] Machine Learning Approaches for Fracture Risk Assessment: A Comparative Analysis of Genomic and Phenotypic Data in 5130 Older Men
    Wu, Qing
    Nasoz, Fatma
    Jung, Jongyun
    Bhattarai, Bibek
    Han, Mira V.
    CALCIFIED TISSUE INTERNATIONAL, 2020, 107 (04) : 353 - 361
  • [2] An ensemble-learning model for failure rate prediction
    Marcello, Braglia
    Davide, Castellano
    Marco, Frosolini
    Roberto, Gabbrielli
    Leonardo, Marrazzini
    Luca, Padellini
    INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING (ISM 2019), 2020, 42 : 41 - 48
  • [3] A Stacking Ensemble Learning Framework for Genomic Prediction
    Liang, Mang
    Chang, Tianpeng
    An, Bingxing
    Duan, Xinghai
    Du, Lili
    Wang, Xiaoqiao
    Miao, Jian
    Xu, Lingyang
    Gao, Xue
    Zhang, Lupei
    Li, Junya
    Gao, Huijiang
    FRONTIERS IN GENETICS, 2021, 12
  • [4] Machine Learning Approaches for Fracture Risk Assessment: A Comparative Analysis of Genomic and Phenotypic Data in 5130 Older Men
    Qing Wu
    Fatma Nasoz
    Jongyun Jung
    Bibek Bhattarai
    Mira V. Han
    Calcified Tissue International, 2020, 107 : 353 - 361
  • [5] Prediction of Promotors in Agrobacterium and Klebsiella Using Novel Feature Engineering and Ensemble Learning Approach
    Samee, Nagwan Abdel
    Talaat, Rawan
    Raza, Ali
    Shaiba, Hadil
    Meshoul, Souham
    IEEE ACCESS, 2025, 13 : 42116 - 42128
  • [6] Development and validation of common data model-based fracture prediction model using machine learning algorithm
    Kong, Sung Hye
    Kim, Sihyeon
    Kim, Yisak
    Kim, Jung Hee
    Kim, Kwangsoo
    Shin, Chan Soo
    OSTEOPOROSIS INTERNATIONAL, 2023, 34 (08) : 1437 - 1451
  • [7] Prediction of Heart Disease using an Ensemble Learning Approach
    Alshehri G.A.
    Alharbi H.M.
    International Journal of Advanced Computer Science and Applications, 2023, 14 (08) : 1089 - 1097
  • [8] ELMV: an Ensemble-Learning Approach for Analyzing Electrical Health Records with Significant Missing Values
    Liu, Lucas Jing
    Zhang, Hongwei
    Di, Jianzhong
    Chen, Jin
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [9] Prediction Model for a Good Learning Environment Using an Ensemble Approach
    Subha, S.
    Priya, S. Baghavathi
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (03): : 2081 - 2093
  • [10] Ensemble-Learning and Feature Selection Techniques for Enhanced Antisense Oligonucleotide Efficacy Prediction in Exon Skipping
    Zhu, Alex
    Chiba, Shuntaro
    Shimizu, Yuki
    Kunitake, Katsuhiko
    Okuno, Yasushi
    Aoki, Yoshitsugu
    Yokota, Toshifumi
    PHARMACEUTICS, 2023, 15 (07)