Integration of datasets for individual prediction of DNA methylation-based biomarkers

被引:1
|
作者
Merzbacher, Charlotte [1 ]
Ryan, Barry [1 ]
Goldsborough, Thibaut [1 ]
Hillary, Robert F. [2 ]
Campbell, Archie [2 ]
Murphy, Lee [3 ]
Mcintosh, Andrew M. [2 ,4 ]
Liewald, David [5 ]
Harris, Sarah E. [5 ]
Mcrae, Allan F. [6 ]
Cox, Simon R. [5 ]
Cannings, Timothy I. [7 ]
Vallejos, Catalina A. [8 ,9 ]
Mccartney, Daniel L. [2 ]
Marioni, Riccardo E. [2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Scotland
[2] Univ Edinburgh, Inst Genet & Canc, Ctr Genom & Expt Med, Edinburgh EH4 2XU, Scotland
[3] Univ Edinburgh, Edinburgh Clin Res Facil, Edinburgh EH4 2XU, Scotland
[4] Univ Edinburgh, Ctr Clin Brain Sci, Div Psychiat, Edinburgh, Scotland
[5] Univ Edinburgh, Dept Psychol, Lothian Birth Cohorts, Edinburgh EH8 9JZ, Scotland
[6] Univ Queensland, Inst Mol Biosci, Brisbane, Australia
[7] Univ Edinburgh, Maxwell Inst Math Sci, Sch Math, Edinburgh EH9 3FD, Scotland
[8] Univ Edinburgh, Inst Genet & Canc, MRC Human Genet Unit, Edinburgh EH4 2XU, Scotland
[9] Alan Turing Inst, London, England
基金
英国惠康基金;
关键词
DNA methylation; Prediction; Biomarker; QUANTILE NORMALIZATION; PACKAGE; DESIGN;
D O I
10.1186/s13059-023-03114-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundEpigenetic scores (EpiScores) can provide biomarkers of lifestyle and disease risk. Projecting new datasets onto a reference panel is challenging due to separation of technical and biological variation with array data. Normalisation can standardise data distributions but may also remove population-level biological variation.ResultsWe compare two birth cohorts (Lothian Birth Cohorts of 1921 and 1936 - nLBC1921 = 387 and nLBC1936 = 498) with blood-based DNA methylation assessed at the same chronological age (79 years) and processed in the same lab but in different years and experimental batches. We examine the effect of 16 normalisation methods on a novel BMI EpiScore (trained in an external cohort, n = 18,413), and Horvath's pan-tissue DNA methylation age, when the cohorts are normalised separately and together. The BMI EpiScore explains a maximum variance of R2=24.5% in BMI in LBC1936 (SWAN normalisation). Although there are cross-cohort R2 differences, the normalisation method makes a minimal difference to within-cohort estimates. Conversely, a range of absolute differences are seen for individual-level EpiScore estimates for BMI and age when cohorts are normalised separately versus together. While within-array methods result in identical EpiScores whether a cohort is normalised on its own or together with the second dataset, a range of differences is observed for between-array methods.ConclusionsNormalisation methods returning similar EpiScores, whether cohorts are analysed separately or together, will minimise technical variation when projecting new data onto a reference panel. These methods are important for cases where raw data is unavailable and joint normalisation of cohorts is computationally expensive.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Application of droplet digital PCR method for DNA methylation-based age prediction from saliva
    Lee, Min Ho
    Hwang, Jung Hee
    Seong, Ki Min
    Ahn, Jeong Jin
    Kim, Seung Jun
    Hwang, Seung Yong
    Lim, Si-Keun
    LEGAL MEDICINE, 2022, 54
  • [32] DNA methylation-based variation between human populations
    Farzeen Kader
    Meenu Ghai
    Molecular Genetics and Genomics, 2017, 292 : 5 - 35
  • [33] A DNA methylation-based test for esophageal cancer detection
    Salta, Sofia
    Macedo-Silva, Catarina
    Miranda-Goncalves, Vera
    Lopes, Nair
    Gigliano, Davide
    Guimaraes, Rita
    Farinha, Monica
    Sousa, Olga
    Henrique, Rui
    Jeronimo, Carmen
    BIOMARKER RESEARCH, 2020, 8 (01)
  • [34] Are we ready for DNA methylation-based prenatal testing?
    Yuen, Ryan K. C.
    Manokhina, Irina
    Robinson, Wendy P.
    EPIGENOMICS, 2011, 3 (04) : 387 - 390
  • [35] Novel multiplex strategy for DNA methylation-based age prediction from small amounts of DNA via Pyrosequencing
    Fleckhaus, Jan
    Schneider, Peter M.
    FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2020, 44
  • [36] Methylation-based epigenetic studies and gene integration analysis of preeclampsia
    Jiang, Lei
    Chang, Ruijing
    Liu, Jing
    Xin, Hong
    ANNALS OF TRANSLATIONAL MEDICINE, 2022, 10 (24)
  • [37] DNA Methylation-Based Biomarkers of Protein Levels and Cardiovascular Disease Risk: Opportunities and Challenges for Precision Cardiology
    Bozack, Anne K.
    Navas-Acien, Ana
    Cardenas, Andres
    CIRCULATION-GENOMIC AND PRECISION MEDICINE, 2024, 17 (02):
  • [38] Methylation-Based Biological Age and Breast Cancer Risk
    Kresovich, Jacob K.
    Xu, Zongli
    O'Brien, Katie M.
    Weinberg, Clarice R.
    Sandler, Dale P.
    Taylor, Jack A.
    JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2019, 111 (10): : 1051 - 1058
  • [39] Performance of DNA methylation-based biomarkers in the cervical cancer screening program of northern Portugal: A feasibility study
    Salta, Sofia
    Maia-Moco, Leonardo
    Estevao-Pereira, Helena
    Sequeira, Jose Pedro
    Vieira, Renata
    Bartosch, Carla
    Petronilho, Sara
    Monteiro, Paula
    Sousa, Ana
    Baldaque, Ines
    Rodrigues, Jessica
    Sousa, Hugo
    Tavares, Fernando
    Henrique, Rui
    Jeronimo, Carmen
    INTERNATIONAL JOURNAL OF CANCER, 2021, 149 (11) : 1916 - 1925
  • [40] A validation study of DNA methylation-based age prediction using semen in forensic casework samples
    Lee, Jee Won
    Choung, Chong Min
    Jung, Ju Yeon
    Lee, Hwan Young
    Lim, Si-Keun
    LEGAL MEDICINE, 2018, 31 : 74 - 77