Recent advances and challenges of rare variant association analysis in the biobank sequencing era

被引:21
作者
Chen, Wenan [1 ]
Coombes, Brandon J. J. [2 ]
Larson, Nicholas B. B. [2 ]
机构
[1] St Jude Childrens Res Hosp, Ctr Appl Bioinformat, Memphis, TN 38105 USA
[2] Mayo Clin, Dept Quantitat Hlth Sci, Rochester, MN 55905 USA
关键词
rare variant; sequencing data; variant annotations; population structure; external controls; family-based design; complex phenotypes; case-control; POPULATION-STRUCTURE; DISEQUILIBRIUM TEST; GENOME; TESTS; GENE; ROBUST; COMMON; TRAITS; METAANALYSIS; STATISTICS;
D O I
10.3389/fgene.2022.1014947
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Causal variants for rare genetic diseases are often rare in the general population. Rare variants may also contribute to common complex traits and can have much larger per-allele effect sizes than common variants, although power to detect these associations can be limited. Sequencing costs have steadily declined with technological advancements, making it feasible to adopt whole-exome and whole-genome profiling for large biobank-scale sample sizes. These large amounts of sequencing data provide both opportunities and challenges for rare-variant association analysis. Herein, we review the basic concepts of rare-variant analysis methods, the current state-of-the-art methods in utilizing variant annotations or external controls to improve the statistical power, and particular challenges facing rare variant analysis such as accounting for population structure, extremely unbalanced case-control design. We also review recent advances and challenges in rare variant analysis for familial sequencing data and for more complex phenotypes such as survival data. Finally, we discuss other potential directions for further methodology investigation.
引用
收藏
页数:14
相关论文
共 104 条
[1]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[2]   Exome sequencing and analysis of 454,787 UK Biobank participants [J].
Backman, Joshua D. ;
Li, Alexander H. ;
Marcketta, Anthony ;
Sun, Dylan ;
Mbatchou, Joelle ;
Kessler, Michael D. ;
Benner, Christian ;
Liu, Daren ;
Locke, Adam E. ;
Balasubramanian, Suganthi ;
Yadav, Ashish ;
Banerjee, Nilanjana ;
Gillies, Christopher E. ;
Damask, Amy ;
Liu, Simon ;
Bai, Xiaodong ;
Hawes, Alicia ;
Maxwell, Evan ;
Gurski, Lauren ;
Watanabe, Kyoko ;
Kosmicki, Jack A. ;
Rajagopal, Veera ;
Mighty, Jason ;
Jones, Marcus ;
Mitnaul, Lyndon ;
Stahl, Eli ;
Coppola, Giovanni ;
Jorgenson, Eric ;
Habegger, Lukas ;
Salerno, William J. ;
Shuldiner, Alan R. ;
Lotta, Luca A. ;
Overton, John D. ;
Cantor, Michael N. ;
Reid, Jeffrey G. ;
Yancopoulos, George ;
Kang, Hyun M. ;
Marchini, Jonathan ;
Baras, Aris ;
Abecasis, Goncalo R. ;
Ferreira, Manuel A. R. .
NATURE, 2021, 599 (7886) :628-+
[3]   The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies [J].
Barnett, Ian ;
Mukherjee, Rajarshi ;
Lin, Xihong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) :64-76
[4]   A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank [J].
Bi, Wenjian ;
Fritsche, Lars G. ;
Mukherjee, Bhramar ;
Kim, Sehee ;
Lee, Seunggeun .
AMERICAN JOURNAL OF HUMAN GENETICS, 2020, 107 (02) :222-233
[5]   A Fast and Accurate Method for Genome-wide Scale Phenome-wide G x E Analysis and Its Application to UK Biobank [J].
Bi, Wenjian ;
Zhao, Zhangchen ;
Dey, Rounak ;
Fritsche, Lars G. ;
Mukherjee, Bhramar ;
Lee, Seunggeun .
AMERICAN JOURNAL OF HUMAN GENETICS, 2019, 105 (06) :1182-1192
[6]   Controlling for human population stratification in rare variant association studies [J].
Bouaziz, Matthieu ;
Mullaert, Jimmy ;
Bigio, Benedetta ;
Seeleuthner, Yoann ;
Casanova, Jean-Laurent ;
Alcai, Alexandre ;
Abel, Laurent ;
Cobat, Aurelie .
SCIENTIFIC REPORTS, 2021, 11 (01)
[7]   The Value of Statistical or Bioinformatics Annotation for Rare Variant Association With Quantitative Trait [J].
Byrnes, Andrea E. ;
Wu, Michael C. ;
Wright, Fred A. ;
Li, Mingyao ;
Li, Yun .
GENETIC EPIDEMIOLOGY, 2013, 37 (07) :666-674
[8]   Pan-cancer analysis of whole genomes [J].
Campbell, Peter J. ;
Getz, Gad ;
Korbel, Jan O. ;
Stuart, Joshua M. ;
Jennings, Jennifer L. ;
Stein, Lincoln D. ;
Perry, Marc D. ;
Nahal-Bose, Hardeep K. ;
Ouellette, B. F. Francis ;
Li, Constance H. ;
Rheinbay, Esther ;
Nielsen, G. Petur ;
Sgroi, Dennis C. ;
Wu, Chin-Lee ;
Faquin, William C. ;
Deshpande, Vikram ;
Boutros, Paul C. ;
Lazar, Alexander J. ;
Hoadley, Katherine A. ;
Louis, David N. ;
Dursi, L. Jonathan ;
Yung, Christina K. ;
Bailey, Matthew H. ;
Saksena, Gordon ;
Raine, Keiran M. ;
Buchhalter, Ivo ;
Kleinheinz, Kortine ;
Schlesner, Matthias ;
Zhang, Junjun ;
Wang, Wenyi ;
Wheeler, David A. ;
Ding, Li ;
Simpson, Jared T. ;
O'Connor, Brian D. ;
Yakneen, Sergei ;
Ellrott, Kyle ;
Miyoshi, Naoki ;
Butler, Adam P. ;
Royo, Romina ;
Shorser, Solomon, I ;
Vazquez, Miguel ;
Rausch, Tobias ;
Tiao, Grace ;
Waszak, Sebastian M. ;
Rodriguez-Martin, Bernardo ;
Shringarpure, Suyash ;
Wu, Dai-Ying ;
Demidov, German M. ;
Delaneau, Olivier ;
Hayashi, Shuto .
NATURE, 2020, 578 (7793) :82-+
[9]   CNS Tumors in Neurofibromatosis [J].
Campian, Jian ;
Gutmann, David H. .
JOURNAL OF CLINICAL ONCOLOGY, 2017, 35 (21) :2378-+
[10]   Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models [J].
Chen, Han ;
Wang, Chaolong ;
Conomos, Matthew P. ;
Stilp, Adrienne M. ;
Li, Zilin ;
Sofer, Tamar ;
Szpiro, Adam A. ;
Chen, Wei ;
Brehm, John M. ;
Celedon, Juan C. ;
Redline, Susan ;
Papanicolaou, George J. ;
Thornton, Timothy A. ;
Laurie, Cathy C. ;
Rice, Kenneth ;
Lin, Xihong .
AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 98 (04) :653-666