Improved Genetic Profiling of Anthropometric Traits Using a Big Data Approach

被引:8
作者
Canela-Xandri, Oriol [1 ]
Rawlik, Konrad [1 ]
Woolliams, John A. [1 ]
Tenesa, Albert [1 ,2 ]
机构
[1] Univ Edinburgh, Royal Dick Sch Vet Studies, Roslin Inst, Easter Bush Campus, Edinburgh, Midlothian, Scotland
[2] Univ Edinburgh, Western Gen Hosp, MRC IGMM, MRC HGU, Edinburgh, Midlothian, Scotland
来源
PLOS ONE | 2016年 / 11卷 / 12期
基金
英国医学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
COMPLEX TRAITS; PREDICTION; ASSOCIATION;
D O I
10.1371/journal.pone.0166755
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) promised to translate their findings into clinically beneficial improvements of patient management by tailoring disease management to the individual through the prediction of disease risk. However, the ability to translate genetic findings from GWAS into predictive tools that are of clinical utility and which may inform clinical practice has, so far, been encouraging but limited. Here we propose to use a more powerful statistical approach, the use of which has traditionally been limited due to computational requirements and lack of sufficiently large individual level genotyped cohorts, but which improve the prediction of multiple medically relevant phenotypes using the same panel of SNPs. As a proof of principle, we used a shared panel of 319,038 common SNPs with MAF > 0.05 to train the prediction models in 114,264 unrelated White-British individuals for height and four obesity related traits (body mass index, basal metabolic rate, body fat percentage, and waist-to-hip ratio). We obtained prediction accuracies that ranged between 46% and 75% of the maximum achievable given the captured heritable component. For height, this represents an improvement in prediction accuracy of up to 68% (184% more phenotypic variance explained) over SNPs reported to be robustly associated with height in a previous GWAS meta-analysis of similar size. Across-population predictions in White non-British individuals were similar to those in White-British whilst those in Asian and Black individuals were informative but less accurate. We estimate that the genotyping of circa 500,000 unrelated individuals will yield predictions between 66% and 82% of the SNP-heritability captured by common variants in our array. Prediction accuracies did not improve when including rarer SNPs or when fitting multiple traits jointly in multivariate models.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A Big Data Based Dynamic Weight Approach for RFM Segmentation
    Lang, Lin
    Zhou, Shuang
    Zhong, Minjuan
    Sun, Guang
    Pan, Bin
    Guo, Peng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3503 - 3513
  • [32] Big Data Analytics in Immunology: A Knowledge-Based Approach
    Zhang, Guang Lan
    Sun, Jing
    Chitkushev, Lou
    Brusic, Vladimir
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [33] Big Data Approach on Polymer Materials: Fundamental, Progress and Challenge
    Liu, Lun-yang
    Ding, Fang
    Li, Yun-qi
    ACTA POLYMERICA SINICA, 2022, 53 (06): : 564 - 580
  • [34] Big Data Approach to Characterize Restricted and Repetitive Behaviors in Autism
    Uljarevic, Mirko
    Frazier, Thomas W.
    Jo, Booil
    Billingham, Wesley D.
    Cooper, Matthew N.
    Youngstrom, Eric A.
    Scahill, Lawrence
    Hardan, Antonio Y.
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2022, 61 (03) : 446 - 457
  • [35] Genetic dissection of complex traits using hierarchical biological knowledge
    Tanaka, Hidenori
    Kreisberg, Jason F.
    Ideker, Trey
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (09)
  • [36] Statistical modeling of spatial big data: An approach from a functional data analysis perspective
    Giraldo, Ramon
    Dabo-Niang, Sophie
    Martinez, Sergio
    STATISTICS & PROBABILITY LETTERS, 2018, 136 : 126 - 129
  • [37] Bayesian estimation of genetic parameters for superovulatory response traits in Japanese Black donor cows using count data models
    Ogawa, Shinichiro
    Zoda, Atsushi
    Kagawa, Rino
    Obinata, Rui
    Urakawa, Manami
    Oono, Yoshio
    THERIOGENOLOGY, 2022, 190 : 38 - 45
  • [38] Inference of causal relationships between sleep-related traits and 1,527 phenotypes using genetic data
    Garcia-Marin, Luis M.
    Campos, Adrian, I
    Martin, Nicholas G.
    Cuellar-Partida, Gabriel
    Renteria, Miguel E.
    SLEEP, 2021, 44 (01)
  • [39] Survey on Weather Prediction using Big Data Analystics
    Reddy, P. Chandrashaker
    Babu, A. Suresh
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [40] Cultivating historical heritage area vitality using urban morphology approach based on big data and machine learning
    Wu, Jiayu
    Lu, Yutian
    Gao, Hei
    Wang, Mingshu
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2022, 91