Ensemble-learning approach improves fracture prediction using genomic and phenotypic data

被引:3
作者
Wu, Qing [1 ]
Jung, Jongyun [1 ]
机构
[1] Ohio State Univ, Coll Med, Dept Biomed Informat, Columbus, OH 43210 USA
基金
美国国家卫生研究院;
关键词
Accuracy; Ensemble learning; Fracture; Genomics; Machine learning; Osteoporosis; Sensitivity; Specificity; Super Learner; OSTEOPOROTIC FRACTURES; MEN; RISK; PROBABILITY; WOMEN; MODEL;
D O I
10.1007/s00198-025-07437-w
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
This study presents an innovative ensemble machine learning model integrating genomic and clinical data to enhance the prediction of major osteoporotic fractures in older men. The Super Learner (SL) model achieved superior performance (AUC = 0.76, accuracy = 95.6%, sensitivity = 94.5%, specificity = 96.1%) compared to individual models. Ensemble machine learning improves fracture prediction accuracy, demonstrating the potential for personalized osteoporosis management. fracture risk models have limitations in their accuracy and in integrating genomic data. This study developed and validated an innovative ensemble machine learning (ML) model that combines multiple algorithms and integrates clinical, lifestyle, skeletal, and genomic data to enhance prediction for major osteoporotic fractures (MOF) in older men. Methods This study analyzed data from 5130 participants in the Osteoporotic Fractures in Men cohort Study. The model incorporated 1103 individual genome-wide significant variants and conventional risk factors of MOF. The participants were randomly divided into training (80%) and testing (20%) sets. Seven ML algorithms were combined using the SL ensemble method with tenfold cross-validation MOF prediction. Model performance was evaluated on the testing set using the area under the curve (AUC), the area under the precision-recall curve, calibration, accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and reclassification metrics. SL model performances were evaluated by comparison with baseline models and subgroup analyses by race. Results The SL model demonstrated the best performance with an AUC of 0.76, accuracy of 95.6%, sensitivity of 94.5%, specificity of 96.1%, NPV of 95.1%, and PPV of 94.7%. Among the individual ML, gradient boosting performed optimally. The SL model outperformed baseline models, and it also achieved accuracies of 93.1% for Whites and 91.6% for Minorities, outperforming single ML in subgroup analysis. Conclusion The ensemble learning approach significantly improved fracture prediction accuracy and model performance compared to individual ML. Integrating genomic and phenotypic data via the SL approach represents a promising advancement for personalized osteoporosis management.
引用
收藏
页码:811 / 821
页数:11
相关论文
共 50 条
[31]   Prediction of Cavity Length Using an Interpretable Ensemble Learning Approach [J].
Guo, Ganggui ;
Li, Shanshan ;
Liu, Yakun ;
Cao, Ze ;
Deng, Yangyu .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (01)
[32]   Lightning prediction using an ensemble learning approach for northeast of Iran [J].
Pakdaman, Morteza ;
Naghab, Sina Samadi ;
Khazanedari, Leili ;
Malbousi, Sharare ;
Falamarzi, Yashar .
JOURNAL OF ATMOSPHERIC AND SOLAR-TERRESTRIAL PHYSICS, 2020, 209
[33]   Ensemble-learning for pressure prediction in vacuum circuit breaker using feature fusion of laser-induced plasma spectra and images [J].
Ke, Wei ;
Pan, Jianbin ;
Yuan, Huan ;
Wang, Xiaohua ;
Zhang, Dongzhi ;
Rong, Mingzhe .
SPECTROCHIMICA ACTA PART B-ATOMIC SPECTROSCOPY, 2025, 226
[34]   Ensemble-learning model based ultimate moment prediction of reinforced concrete members strengthened by UHPC [J].
Taffese, Woubishet Zewdu ;
Zhu, Yanping ;
Chen, Genda .
ENGINEERING STRUCTURES, 2024, 305
[35]   Malignant melanoma detection using multi-scale image decomposition and a new ensemble-learning scheme [J].
Asmae Ennaji ;
Hasnae El Khoukhi ;
My Abdelouahed Sabri ;
Abdellah Aarab .
Multimedia Tools and Applications, 2024, 83 :21213-21228
[36]   Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing [J].
Sayed, Eslam Hussein ;
Alabrah, Amerah ;
Rahouma, Kamel Hussein ;
Zohaib, Muhammad ;
Badry, Rasha M. .
IEEE ACCESS, 2024, 12 :193997-194019
[37]   SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models [J].
Wang, Xiaochuan ;
Li, Chen ;
Li, Fuyi ;
Sharma, Varun S. ;
Song, Jiangning ;
Webb, Geoffrey, I .
BMC BIOINFORMATICS, 2019, 20 (01)
[38]   EnsembleNPPred: A Robust Approach to Neuropeptide Prediction and Recognition Using Ensemble Machine Learning and Deep Learning Methods [J].
Lertampaiporn, Supatcha ;
Wattanapornprom, Warin ;
Thammarongtham, Chinae ;
Hongsthong, Apiradee .
LIFE-BASEL, 2025, 15 (07)
[39]   Prediction of wart treatment response using a hybrid GA-ensemble learning approach [J].
Qasem, Ahmed Gailan ;
Lam, Sarah S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
[40]   Explainable Heart Disease Prediction Using Ensemble-Quantum Machine Learning Approach [J].
Abdulsalam, Ghada ;
Meshoul, Souham ;
Shaiba, Hadil .
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01) :761-779