Exploring the correlation between DNA methylation and biological age using an interpretable machine learning framework

被引:0
|
作者
Zhou, Sheng [1 ]
Chen, Jing [2 ]
Wei, Shanshan [1 ]
Zhou, Chengxing [3 ]
Wang, Die [4 ]
Yan, Xiaofan [5 ]
He, Xun [5 ]
Yan, Pengcheng [6 ]
机构
[1] Guizhou Med Univ, Dept Publ Hlth & Hlth, Guiyang, Guizhou, Peoples R China
[2] Guizhou Prov Drug Adm Inspect Ctr, Guiyang, Guizhou, Peoples R China
[3] Guizhou Med Univ, Sch Biology&Engineering, Sch Hlth Med Modern Ind, Guiyang, Guizhou, Peoples R China
[4] Guizhou Med Univ, Coll Anesthesia, Guiyang, Guizhou, Peoples R China
[5] Guizhou Med Univ, Sch Med & Hlth Management, Guiyang, Guizhou, Peoples R China
[6] Guizhou Med Univ, Sch Clin Med, Guiyang, Guizhou, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
DNA methylation; Biological age; GO enrichment analysis; XGBoost; Interpretable machine learning; Shapley Additive exPlanations;
D O I
10.1038/s41598-024-75586-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA methylation plays a significant role in regulating transcription and exhibits a systematic change with age. These changes can be used to predict an individual's age. First, to identify methylation sites associated with biological age; second, to construct a biological age prediction model and preliminarily explore the biological significance of methylation-associated genes using machine learning. A biological age prediction model was constructed using human methylation data through data preprocessing, feature selection procedures, statistical analysis, and machine learning techniques. Subsequently, 15 methylation data sets were subjected to in-depth analysis using SHAP, GO enrichment, and KEGG analysis. XGBoost, LightGBM, and CatBoost identified 15 groups of methylation sites associated with biological age. The cg23995914 locus was identified as the most significant contributor to predicting biological age by calculating SHAP values. Furthermore, GO enrichment and KEGG analyses were employed to initially explore the methylated loci's biological significance.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Predicting stroke severity of patients using interpretable machine learning algorithms
    Sorayaie Azar, Amir
    Samimi, Tahereh
    Tavassoli, Ghanbar
    Naemi, Amin
    Rahimi, Bahlol
    Hadianfard, Zahra
    Wiil, Uffe Kock
    Nazarbaghi, Surena
    Bagherzadeh Mohasefi, Jamshid
    Lotfnezhad Afshar, Hadi
    EUROPEAN JOURNAL OF MEDICAL RESEARCH, 2024, 29 (01) : 547
  • [42] Teaching freight mode choice models new tricks using interpretable machine learning methods
    Xu, Xiaodan
    Yang, Hung-Chia
    Jeong, Kyungsoo
    Bui, William
    Ravulaparthy, Srinath
    Laarabi, Haitam
    Needell, Zachary A.
    Spurlock, C. Anna
    FRONTIERS IN FUTURE TRANSPORTATION, 2024, 5
  • [43] Interpretable Machine Learning - An Application Study Using the Munich Rent Index
    Brosig, Julia
    3RD INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2020), 2020, : 340 - 340
  • [44] Cost Estimation of Metro Construction Projects Using Interpretable Machine Learning
    Meng, Chuncheng
    Qu, Daoyuan
    Duan, Xiaochen
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2024, 38 (06)
  • [45] Genetic and Environmental Causes of Variation in the Difference Between Biological Age Based on DNA Methylation and Chronological Age for Middle-Aged Women
    Li, Shuai
    Wong, Ee Ming
    Joo, JiHoon E.
    Jung, Chol-Hee
    Chung, Jessica
    Apicella, Carmel
    Stone, Jennifer
    Dite, Gillian S.
    Giles, Graham G.
    Southey, Melissa C.
    Hopper, John L.
    TWIN RESEARCH AND HUMAN GENETICS, 2015, 18 (06) : 720 - 726
  • [46] Exploring the key influencing factors of low-carbon innovation from urban characteristics in China using interpretable machine learning
    Wang, Wentao
    Li, Dezhi
    Zhou, Shenghua
    Wang, Yang
    Yu, Lugang
    ENVIRONMENTAL IMPACT ASSESSMENT REVIEW, 2024, 107
  • [47] Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
    Garbulowski, Mateusz
    Smolinska, Karolina
    Diamanti, Klev
    Pan, Gang
    Maqbool, Khurram
    Feuk, Lars
    Komorowski, Jan
    FRONTIERS IN GENETICS, 2021, 12
  • [48] Rapid estimation of battery state of health using partial electrochemical impedance spectra and interpretable machine learning
    Xia, Bizhong
    Qin, Zhanpeng
    Fu, Hongye
    JOURNAL OF POWER SOURCES, 2024, 603
  • [49] AutoScore-Ordinal: an interpretable machine learning framework for generating scoring models for ordinal outcomes
    Saffari, Seyed Ehsan
    Ning, Yilin
    Xie, Feng
    Chakraborty, Bibhas
    Volovici, Victor
    Vaughan, Roger
    Ong, Marcus Eng Hock
    Liu, Nan
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [50] AutoScore-Ordinal: an interpretable machine learning framework for generating scoring models for ordinal outcomes
    Seyed Ehsan Saffari
    Yilin Ning
    Feng Xie
    Bibhas Chakraborty
    Victor Volovici
    Roger Vaughan
    Marcus Eng Hock Ong
    Nan Liu
    BMC Medical Research Methodology, 22