Improved Genetic Profiling of Anthropometric Traits Using a Big Data Approach

被引:8
|
作者
Canela-Xandri, Oriol [1 ]
Rawlik, Konrad [1 ]
Woolliams, John A. [1 ]
Tenesa, Albert [1 ,2 ]
机构
[1] Univ Edinburgh, Royal Dick Sch Vet Studies, Roslin Inst, Easter Bush Campus, Edinburgh, Midlothian, Scotland
[2] Univ Edinburgh, Western Gen Hosp, MRC IGMM, MRC HGU, Edinburgh, Midlothian, Scotland
来源
PLOS ONE | 2016年 / 11卷 / 12期
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
COMPLEX TRAITS; PREDICTION; ASSOCIATION;
D O I
10.1371/journal.pone.0166755
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) promised to translate their findings into clinically beneficial improvements of patient management by tailoring disease management to the individual through the prediction of disease risk. However, the ability to translate genetic findings from GWAS into predictive tools that are of clinical utility and which may inform clinical practice has, so far, been encouraging but limited. Here we propose to use a more powerful statistical approach, the use of which has traditionally been limited due to computational requirements and lack of sufficiently large individual level genotyped cohorts, but which improve the prediction of multiple medically relevant phenotypes using the same panel of SNPs. As a proof of principle, we used a shared panel of 319,038 common SNPs with MAF > 0.05 to train the prediction models in 114,264 unrelated White-British individuals for height and four obesity related traits (body mass index, basal metabolic rate, body fat percentage, and waist-to-hip ratio). We obtained prediction accuracies that ranged between 46% and 75% of the maximum achievable given the captured heritable component. For height, this represents an improvement in prediction accuracy of up to 68% (184% more phenotypic variance explained) over SNPs reported to be robustly associated with height in a previous GWAS meta-analysis of similar size. Across-population predictions in White non-British individuals were similar to those in White-British whilst those in Asian and Black individuals were informative but less accurate. We estimate that the genotyping of circa 500,000 unrelated individuals will yield predictions between 66% and 82% of the SNP-heritability captured by common variants in our array. Prediction accuracies did not improve when including rarer SNPs or when fitting multiple traits jointly in multivariate models.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data
    Bouaguel, Waad
    INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015, 2016, 5 : 75 - 83
  • [2] Phenotypic variance partitioning by transcriptomic gene expression levels and environmental variables for anthropometric traits using GTEx data
    Fabres, Pastor Jullian
    Lee, S. Hong
    GENETIC EPIDEMIOLOGY, 2023, 47 (07) : 465 - 474
  • [3] Genetic Correlations Among Corneal Biophysical Parameters and Anthropometric Traits
    Cousins, Henry C.
    Cousins, Clara C.
    Valluru, Girish
    Altman, Russ B.
    Liu, Yutao
    Pasquale, Louis R.
    Ahmad, Sumayya
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (08):
  • [4] Cricket Match Analytics Using the Big Data Approach
    Awan, Mazhar Javed
    Gilani, Syed Arbaz Haider
    Ramzan, Hamza
    Nobanee, Haitham
    Yasin, Awais
    Zain, Azlan Mohd
    Javed, Rabia
    ELECTRONICS, 2021, 10 (19)
  • [5] A new tool called DISSECT for analysing large genomic data sets using a Big Data approach
    Canela-Xandri, Oriol
    Law, Andy
    Gray, Alan
    Woolliams, John A.
    Tenesa, Albert
    NATURE COMMUNICATIONS, 2015, 6
  • [6] Performance prediction models based on anthropometric, genetic and psychological traits of Croatian sprinters
    Zaccagni, Luciana
    Lunghi, Barbara
    Barbieri, Davide
    Rinaldo, Natascia
    Missoni, Sasa
    Saric, Tena
    Sarac, Jelena
    Babic, Vesna
    Rakovac, Marija
    Bernardi, Francesco
    Gualdi-Russo, Emanuela
    BIOLOGY OF SPORT, 2019, 36 (01) : 17 - 23
  • [7] Improved genetic prediction of complex traits from individual-level data or summary statistics
    Zhang, Qianqian
    Prive, Florian
    Vilhjalmsson, Bjarni
    Speed, Doug
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [8] Analysis commons, a team approach to discovery in a big-data environment for genetic epidemiology
    Brody, Jennifer A.
    Morrison, Alanna C.
    Bis, Joshua C.
    O'Connell, Jeffrey R.
    Brown, Michael R.
    Huffman, Jennifer E.
    Ames, Darren C.
    Carroll, Andrew
    Conomos, Matthew P.
    Gabriel, Stacey
    Gibbs, Richard A.
    Gogarten, Stephanie M.
    Gupta, Namrata
    Jaquish, Cashell E.
    Johnson, Andrew D.
    Lewis, Joshua P.
    Liu, Xiaoming
    Manning, Alisa K.
    Papanicolaou, George J.
    Pitsillides, Achilleas N.
    Rice, Kenneth M.
    Salerno, William
    Sitlani, Colleen M.
    Smith, Nicholas L.
    Heckbert, Susan R.
    Laurie, Cathy C.
    Mitchell, Braxton D.
    Vasan, Ramachandran S.
    Rich, Stephen S.
    Rotter, Jerome I.
    Wilson, James G.
    Boerwinkle, Eric
    Psaty, Bruce M.
    Cupples, L. Adrienne
    NATURE GENETICS, 2017, 49 (11) : 1560 - 1563
  • [9] Reduced Time Compression in Big Data Using MapReduce Approach and Hadoop
    Meena, K.
    Sujatha, J.
    JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (08)
  • [10] A calibrated data-driven approach for small area estimation using big data
    Tam, Siu-Ming
    Sharmeen, Shaila
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2024, 66 (02) : 125 - 145