Exploring the correlation between DNA methylation and biological age using an interpretable machine learning framework

被引:0
|
作者
Zhou, Sheng [1 ]
Chen, Jing [2 ]
Wei, Shanshan [1 ]
Zhou, Chengxing [3 ]
Wang, Die [4 ]
Yan, Xiaofan [5 ]
He, Xun [5 ]
Yan, Pengcheng [6 ]
机构
[1] Guizhou Med Univ, Dept Publ Hlth & Hlth, Guiyang, Guizhou, Peoples R China
[2] Guizhou Prov Drug Adm Inspect Ctr, Guiyang, Guizhou, Peoples R China
[3] Guizhou Med Univ, Sch Biology&Engineering, Sch Hlth Med Modern Ind, Guiyang, Guizhou, Peoples R China
[4] Guizhou Med Univ, Coll Anesthesia, Guiyang, Guizhou, Peoples R China
[5] Guizhou Med Univ, Sch Med & Hlth Management, Guiyang, Guizhou, Peoples R China
[6] Guizhou Med Univ, Sch Clin Med, Guiyang, Guizhou, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
DNA methylation; Biological age; GO enrichment analysis; XGBoost; Interpretable machine learning; Shapley Additive exPlanations;
D O I
10.1038/s41598-024-75586-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA methylation plays a significant role in regulating transcription and exhibits a systematic change with age. These changes can be used to predict an individual's age. First, to identify methylation sites associated with biological age; second, to construct a biological age prediction model and preliminarily explore the biological significance of methylation-associated genes using machine learning. A biological age prediction model was constructed using human methylation data through data preprocessing, feature selection procedures, statistical analysis, and machine learning techniques. Subsequently, 15 methylation data sets were subjected to in-depth analysis using SHAP, GO enrichment, and KEGG analysis. XGBoost, LightGBM, and CatBoost identified 15 groups of methylation sites associated with biological age. The cg23995914 locus was identified as the most significant contributor to predicting biological age by calculating SHAP values. Furthermore, GO enrichment and KEGG analyses were employed to initially explore the methylated loci's biological significance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Probabilistic framework for strain-based fatigue life prediction and uncertainty quantification using interpretable machine learning
    Deng, Xi
    Zhu, Shun-Peng
    Wang, Lanyi
    Luo, Changqi
    Fu, Sicheng
    Wang, Qingyuan
    INTERNATIONAL JOURNAL OF FATIGUE, 2025, 190
  • [32] Correlation between DNA methylation and chronological age of Moso bamboo (Phyllostachys heterocycla var. pubescens)
    Yuan, Jin-Ling
    Sun, Hui-Min
    Guo, Guang-Ping
    Yue, Jin-Jun
    Gu, Xiao-Ping
    BOTANICAL STUDIES, 2014, 55
  • [33] Correlation between DNA methylation and chronological age of Moso bamboo (Phyllostachys heterocycla var. pubescens)
    Jin-Ling Yuan
    Hui-Min Sun
    Guang-Ping Guo
    Jin-Jun Yue
    Xiao-Ping Gu
    Botanical Studies, 55
  • [34] DNA methylation signature of psychological resilience in young adults: Constructing a methylation risk score using a machine learning method
    Lu, Andrew Ke-Ming
    Hsieh, Shulan
    Yang, Cheng-Ta
    Wang, Xin-Yu
    Lin, Sheng-Hsiang
    FRONTIERS IN GENETICS, 2023, 13
  • [35] A Transparent and Valid Framework for Rockburst Assessment: Unifying Interpretable Machine Learning and Conformal Prediction
    Ibrahim, Bemah
    Tetteh-Asare, Abigail
    Ahenkorah, Isaac
    ROCK MECHANICS AND ROCK ENGINEERING, 2024, 57 (08) : 6211 - 6225
  • [36] Estimation aboveground biomass in subtropical bamboo forests based on an interpretable machine learning framework
    Li, Xuejian
    Du, Huaqiang
    Mao, Fangjie
    Xu, Yanxin
    Huang, Zihao
    Xuan, Jie
    Zhou, Yongxia
    Hu, Mengchen
    ENVIRONMENTAL MODELLING & SOFTWARE, 2024, 178
  • [37] Interpretable machine learning framework for catalyst performance prediction and validation with dry reforming of methane
    Roh, Jiwon
    Park, Hyundo
    Kwon, Hyukwon
    Joo, Chonghyo
    Moon, Il
    Cho, Hyungtae
    Ro, Insoo
    Kim, Junghwan
    APPLIED CATALYSIS B-ENVIRONMENT AND ENERGY, 2024, 343
  • [38] Explainable Human-Machine Teaming using Model Checking and Interpretable Machine Learning
    Bersani, Marcello M.
    Camilli, Matteo
    Lestingi, Livia
    Mirandola, Raffaela
    Rossi, Matteo
    2023 IEEE/ACM 11TH INTERNATIONAL CONFERENCE ON FORMAL METHODS IN SOFTWARE ENGINEERING, FORMALISE, 2023, : 18 - 28
  • [39] The correlation between sperm DNA methylation and DNA damage: a comparison of comet and TUNEL
    Zimmerman, Hailey
    Jenkins, Tim
    FRONTIERS IN REPRODUCTIVE HEALTH, 2025, 7
  • [40] Traffic signal optimization framework using interpretable machine learning technique under heterogeneous-autonomy traffic environment
    Al-Turki M.
    Kashifi M.T.
    Ratrout N.T.
    Rahman S.M.
    Neural Computing and Applications, 2024, 36 (22) : 13761 - 13781