Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction

被引:3
作者
Bocher, Ozvan [1 ]
Gilly, Arthur [1 ]
Park, Young-Chan [1 ]
Zeggini, Eleftheria [1 ,2 ,3 ]
Morris, Andrew P. [1 ,4 ]
机构
[1] Helmholtz Zentrum Munchen, ITG, Munich, Germany
[2] Tech Univ Munich, Munich, Germany
[3] Klinikum Rechts Der Isar, Munich, Germany
[4] Univ Manchester, Ctr Genet & Genom Versus Arthrit, Ctr Musculoskeletal Res, Manchester, England
来源
HUMAN GENETICS AND GENOMICS ADVANCES | 2023年 / 4卷 / 03期
关键词
POLYGENIC RISK SCORES; METAANALYSIS; ASSOCIATION; DISCOVERY; INSIGHTS; HISTORY;
D O I
10.1016/j.xhgg.2023.100214
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genetic prediction of common complex disease risk is an essential component of precision medicine. Currently, genome-wide association studies (GWASs) are mostly composed of European-ancestry samples and resulting polygenic scores (PGSs) have been shown to poorly transfer to other ancestries partly due to heterogeneity of allelic effects between populations. Fixed-effects (FETA) and random-effects (RETA) trans-ancestry meta-analyses do not model such ancestry-related heterogeneity, while ancestry-specific (AS) scores may suffer from low power due to low sample sizes. In contrast, trans-ancestry meta-regression (TAMR) builds ancestry-aware PGS that account for more complex trans-ancestry architectures. Here, we examine the predictive performance of these four PGSs under multiple genetic architectures and ancestry configurations. We show that the predictive performance of FETA and RETA is strongly affected by cross-ancestry genetic heterogeneity, while AS PGS performance decreases in under-represented target populations. TAMR PGS is also impacted by heterogeneity but maintains good prediction performance in most situations, especially in ancestry-diverse scenarios. In simulations of human complex traits, TAMR scores currently explain 25% more phenotypic variance than AS in triglyceride levels and 33% more phenotypic variance than FETA in type 2 diabetes in most non-European populations. Importantly, a high proportion of non-European-ancestry individuals is needed to reach prediction levels that are comparable in those populations to the one observed in European-ancestry studies. Our results highlight the need to rebalance the ancestral composition of GWAS to enable accurate prediction in non-European-ancestry groups, and demonstrate the relevance of meta-regression approaches for compensating some of the current population biases in GWAS.
引用
收藏
页数:8
相关论文
共 48 条
[1]   Heritability and familiality of type 2 diabetes and related quantitative traits in the Botnia Study [J].
Almgren, P. ;
Lehtovirta, M. ;
Isomaa, B. ;
Sarelin, L. ;
Taskinen, M. R. ;
Lyssenko, V. ;
Tuomi, T. ;
Groop, L. .
DIABETOLOGIA, 2011, 54 (11) :2811-2819
[2]   Heritability and genetic associations of triglyceride and HDL-C levels using pedigree-based and empirical kinships [J].
Nicholas B. Blackburn ;
Arthur Porto ;
Juan M. Peralta ;
John Blangero .
BMC Proceedings, 12 (Suppl 9)
[3]   Transethnic Genetic-Correlation Estimates from Summary Statistics [J].
Brown, Brielin C. ;
Ye, Chun Jimmie ;
Price, Alkes L. ;
Zaitlen, Noah .
AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 99 (01) :76-88
[4]   The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019 [J].
Buniello, Annalisa ;
MacArthur, Jacqueline A. L. ;
Cerezo, Maria ;
Harris, Laura W. ;
Hayhurst, James ;
Malangone, Cinzia ;
McMahon, Aoife ;
Morales, Joannella ;
Mountjoy, Edward ;
Sollis, Elliot ;
Suveges, Daniel ;
Vrousgou, Olga ;
Whetzel, Patricia L. ;
Amode, Ridwan ;
Guillen, Jose A. ;
Riat, Harpreet S. ;
Trevanion, Stephen J. ;
Hall, Peggy ;
Junkins, Heather ;
Flicek, Paul ;
Burdett, Tony ;
Hindorff, Lucia A. ;
Cunningham, Fiona ;
Parkinson, Helen .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1005-D1012
[5]   Inclusion of variants discovered from diverse populations improves polygenic risk score transferability [J].
Cavazos, Taylor B. ;
Witte, John S. .
HUMAN GENETICS AND GENOMICS ADVANCES, 2021, 2 (01)
[6]   Tutorial: a guide to performing polygenic risk score analyses [J].
Choi, Shing Wan ;
Mak, Timothy Shin-Heng ;
O'Reilly, Paul F. .
NATURE PROTOCOLS, 2020, 15 (09) :2759-2772
[7]   Polygenic risk score for schizophrenia is more strongly associated with ancestry than with schizophrenia [J].
Curtis, David .
PSYCHIATRIC GENETICS, 2018, 28 (05) :85-89
[8]  
Ding Y, 2022, bioRxiv, DOI [10.1101/2022.09.28.509988, 10.1101/2022.09.28.509988, DOI 10.1101/2022.09.28.509988, DOI 10.1101/2022.09.28.509988V1]
[9]   A combined polygenic score of 21,293 rare and 22 common variants improves diabetes diagnosis based on hemoglobin A1C levels [J].
Dornbos, Peter ;
Koesterer, Ryan ;
Ruttenburg, Andrew ;
Trang Nguyen ;
Cole, Joanne B. ;
Leong, Aaron ;
Meigs, James B. ;
Florez, Jose C. ;
Rotter, Jerome, I ;
Udler, Miriam S. ;
Flannick, Jason .
NATURE GENETICS, 2022, 54 (11) :1609-+
[10]   Analysis of polygenic risk score usage and performance in diverse human populations [J].
Duncan, L. ;
Shen, H. ;
Gelaye, B. ;
Meijsen, J. ;
Ressler, K. ;
Feldman, M. ;
Peterson, R. ;
Domingue, B. .
NATURE COMMUNICATIONS, 2019, 10 (1)