A complete reference genome improves analysis of human genetic variation

被引:212
作者
Aganezov, Sergey [1 ]
Yan, Stephanie M. [2 ]
Soto, Daniela C. [3 ]
Kirsche, Melanie [1 ]
Zarate, Samantha [1 ]
Avdeyev, Pavel [4 ]
Taylor, Dylan J. [2 ]
Shafin, Kishwar [5 ]
Shumate, Alaina [6 ]
Xiao, Chunlin [7 ]
Wagner, Justin [8 ]
McDaniel, Jennifer [8 ]
Olson, Nathan D. [8 ]
Sauria, Michael E. G. [2 ]
Vollger, Mitchell R. [9 ]
Rhie, Arang [4 ]
Meredith, Melissa [5 ]
Martin, Skylar [10 ,11 ]
Lee, Joyce [12 ]
Koren, Sergey [4 ]
Rosenfeld, Jeffrey A. [13 ]
Paten, Benedict [5 ]
Layer, Ryan [10 ,11 ]
Chin, Chen-Shan [14 ]
Sedlazeck, Fritz J. [15 ]
Hansen, Nancy F. [16 ]
Miller, Danny E. [9 ,17 ,18 ]
Phillippy, Adam M. [4 ]
Miga, Karen H. [5 ]
McCoy, Rajiv C. [2 ]
Dennis, Megan Y. [3 ]
Zook, Justin M. [8 ]
Schatz, Michael C. [1 ,2 ,19 ]
机构
[1] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Dept Biol, Baltimore, MD 21218 USA
[3] Univ Calif Davis, Dept Biochem & Mol Med, Genome Ctr, MIND Inst, Davis, CA 95616 USA
[4] NHGRI, Genome Informat Sect, Bethesda, MD 20892 USA
[5] Univ Calif Santa Cruz, UC Santa Cruz Genom Inst, Santa Cruz, CA 95064 USA
[6] Johns Hopkins Univ, Dept Biomed Engn, Baltimore, MD USA
[7] Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD USA
[8] NIST, Gaithersburg, MD 20899 USA
[9] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[10] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[11] Univ Colorado, Biofrontiers Inst, Boulder, CO 80309 USA
[12] Bionano Genom, San Diego, CA USA
[13] Canc Inst New Jersey, New Brunswick, NJ USA
[14] DNAnexus, Mountain View, CA USA
[15] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[16] NHGRI, Comparat Genom Anal Unit, Rockville, MD USA
[17] Univ Washington, Dept Pediat, Div Genet Med, Seattle, WA 98195 USA
[18] Seattle Childrens Hosp, Seattle, WA USA
[19] Cold Spring Harbor Lab, Simons Ctr Quantitat Biol, POB 100, Cold Spring Harbor, NY 11724 USA
关键词
SEGMENTAL DUPLICATIONS; MUTATIONS; SEQUENCE; DIVERSITY; ANCESTRY; MAP;
D O I
10.1126/science.abl3533
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction of false positives in 269 medically relevant genes by up to a factor of 12. Because of these improvements in variant discovery coupled with population and functional genomic resources, T2T-CHM13 is positioned to replace GRCh38 as the prevailing reference for human genetics.
引用
收藏
页码:54 / +
页数:113
相关论文
共 105 条
[1]   The GTEx Consortium atlas of genetic regulatory effects across human tissues [J].
Aguet, Francois ;
Barbeira, Alvaro N. ;
Bonazzola, Rodrigo ;
Brown, Andrew ;
Castel, Stephane E. ;
Jo, Brian ;
Kasela, Silva ;
Kim-Hellmuth, Sarah ;
Liang, Yanyu ;
Parsana, Princy ;
Flynn, Elise ;
Fresard, Laure ;
Gamazon, Eric R. ;
Hamel, Andrew R. ;
He, Yuan ;
Hormozdiari, Farhad ;
Mohammadi, Pejman ;
Munoz-Aguirre, Manuel ;
Ardlie, Kristin G. ;
Battle, Alexis ;
Bonazzola, Rodrigo ;
Brown, Christopher D. ;
Cox, Nancy ;
Dermitzakis, Emmanouil T. ;
Engelhardt, Barbara E. ;
Garrido-Martin, Diego ;
Gay, Nicole R. ;
Getz, Gad ;
Guigo, Roderic ;
Hamel, Andrew R. ;
Handsaker, Robert E. ;
He, Yuan ;
Hoffman, Paul J. ;
Hormozdiari, Farhad ;
Im, Hae Kyung ;
Jo, Brian ;
Kasela, Silva ;
Kashin, Seva ;
Kim-Hellmuth, Sarah ;
Kwong, Alan ;
Lappalainen, Tuuli ;
Li, Xiao ;
Liang, Yanyu ;
MacArthur, Daniel G. ;
Mohammadi, Pejman ;
Montgomery, Stephen B. ;
Munoz-Aguirre, Manuel ;
Rouhana, John M. ;
Hormozdiari, Farhad ;
Im, Hae Kyung .
SCIENCE, 2020, 369 (6509) :1318-1330
[2]   Signatures of mutational processes in human cancer [J].
Alexandrov, Ludmil B. ;
Nik-Zainal, Serena ;
Wedge, David C. ;
Aparicio, Samuel A. J. R. ;
Behjati, Sam ;
Biankin, Andrew V. ;
Bignell, Graham R. ;
Bolli, Niccolo ;
Borg, Ake ;
Borresen-Dale, Anne-Lise ;
Boyault, Sandrine ;
Burkhardt, Birgit ;
Butler, Adam P. ;
Caldas, Carlos ;
Davies, Helen R. ;
Desmedt, Christine ;
Eils, Roland ;
Eyfjord, Jorunn Erla ;
Foekens, John A. ;
Greaves, Mel ;
Hosoda, Fumie ;
Hutter, Barbara ;
Ilicic, Tomislav ;
Imbeaud, Sandrine ;
Imielinsk, Marcin ;
Jaeger, Natalie ;
Jones, David T. W. ;
Jones, David ;
Knappskog, Stian ;
Kool, Marcel ;
Lakhani, Sunil R. ;
Lopez-Otin, Carlos ;
Martin, Sancha ;
Munshi, Nikhil C. ;
Nakamura, Hiromi ;
Northcott, Paul A. ;
Pajic, Marina ;
Papaemmanuil, Elli ;
Paradiso, Angelo ;
Pearson, John V. ;
Puente, Xose S. ;
Raine, Keiran ;
Ramakrishna, Manasa ;
Richardson, Andrea L. ;
Richter, Julia ;
Rosenstiel, Philip ;
Schlesner, Matthias ;
Schumacher, Ton N. ;
Span, Paul N. ;
Teague, Jon W. .
NATURE, 2013, 500 (7463) :415-+
[3]   Complete genomic and epigenetic maps of human centromeres [J].
Altemose, Nicolas ;
Logsdon, Glennis A. ;
Bzikadze, Andrey, V ;
Sidhwani, Pragya ;
Langley, Sasha A. ;
Caldas, Gina, V ;
Hoyt, Savannah J. ;
Uralsky, Lev ;
Ryabov, Fedor D. ;
Shew, Colin J. ;
Sauria, Michael E. G. ;
Borchers, Matthew ;
Gershman, Ariel ;
Mikheenko, Alla ;
Shepelev, Valery A. ;
Dvorkina, Tatiana ;
Kunyavskaya, Olga ;
Vollger, Mitchell R. ;
Rhie, Arang ;
McCartney, Ann M. ;
Asri, Mobin ;
Lorig-Roach, Ryan ;
Shafin, Kishwar ;
Lucas, Julian K. ;
Aganezov, Sergey ;
Olson, Daniel ;
de Lima, Leonardo Gomes ;
Potapova, Tamara ;
Hartley, Gabrielle A. ;
Haukness, Marina ;
Kerpedjiev, Peter ;
Gusev, Fedor ;
Tigyi, Kristof ;
Brooks, Shelise ;
Young, Alice ;
Nurk, Sergey ;
Koren, Sergey ;
Salama, Sofie R. ;
Paten, Benedict ;
Rogaev, Evgeny, I ;
Streets, Aaron ;
Karpen, Gary H. ;
Dernburg, Abby F. ;
Sullivan, Beth A. ;
Straight, Aaron F. ;
Wheeler, Travis J. ;
Gerton, Jennifer L. ;
Eichler, Evan E. ;
Phillippy, Adam M. ;
Timp, Winston .
SCIENCE, 2022, 376 (6588) :56-+
[4]   The ENCODE Blacklist: Identification of Problematic Regions of the Genome [J].
Amemiya, Haley M. ;
Kundaje, Anshul ;
Boyle, Alan P. .
SCIENTIFIC REPORTS, 2019, 9 (1)
[5]  
[Anonymous], 2015, Nature, DOI [DOI 10.1038/NATURE15393, DOI 10.1038/nature15393]
[6]   Characterizing the Major Structural Variant Alleles of the Human Genome [J].
Audano, Peter A. ;
Sulovari, Arvis ;
Graves-Lindsay, Tina A. ;
Cantsilieris, Stuart ;
Sorensen, Melanie ;
Welch, AnneMarie E. ;
Dougherty, Max L. ;
Nelson, Bradley J. ;
Shah, Ankeeta ;
Dutcher, Susan K. ;
Warren, Wesley C. ;
Magrini, Vincent ;
McGrath, Sean D. ;
Li, Yang I. ;
Wilson, Richard K. ;
Eichler, Evan E. .
CELL, 2019, 176 (03) :663-+
[7]   Recent segmental duplications in the human genome [J].
Bailey, JA ;
Gu, ZP ;
Clark, RA ;
Reinert, K ;
Samonte, RV ;
Schwartz, S ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Eichler, EE .
SCIENCE, 2002, 297 (5583) :1003-1007
[8]   Segmental duplications: Organization and impact within the current Human Genome Project assembly [J].
Bailey, JA ;
Yavor, AM ;
Massa, HF ;
Trask, BJ ;
Eichler, EE .
GENOME RESEARCH, 2001, 11 (06) :1005-1017
[9]   A public resource facilitating clinical use of genomes [J].
Ball, Madeleine P. ;
Thakuria, Joseph V. ;
Zaranek, Alexander Wait ;
Clegg, Tom ;
Rosenbaum, Abraham M. ;
Wu, Xiaodi ;
Angrist, Misha ;
Bhak, Jong ;
Bobe, Jason ;
Callow, Matthew J. ;
Cano, Carlos ;
Chou, Michael F. ;
Chung, Wendy K. ;
Douglas, Shawn M. ;
Estep, Preston W. ;
Gore, Athurva ;
Hulick, Peter ;
Labarga, Alberto ;
Lee, Je-Hyuk ;
Lunshof, Jeantine E. ;
Kim, Byung Chul ;
Kim, Jong-Il ;
Li, Zhe ;
Murray, Michael F. ;
Nilsen, Geoffrey B. ;
Peters, Brock A. ;
Raman, Anugraha M. ;
Rienhoff, Hugh Y. ;
Robasky, Kimberly ;
Wheeler, Matthew T. ;
Vandewege, Ward ;
Vorhaus, Daniel B. ;
Yang, Joyce L. ;
Yang, Luhan ;
Aach, John ;
Ashley, Euan A. ;
Drmanac, Radoje ;
Kim, Seong-Jin ;
Li, Jin Billy ;
Peshkin, Leonid ;
Seidman, Christine E. ;
Seo, Jeong-Sun ;
Zhang, Kun ;
Rehm, Heidi L. ;
Church, George M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (30) :11920-11927
[10]   Is it time to change the reference genome? [J].
Ballouz, Sara ;
Dobin, Alexander ;
Gillis, Jesse A. .
GENOME BIOLOGY, 2019, 20 (01)