Characterization and visualization of tandem repeats at genome scale

被引:27
作者
Dolzhenko, Egor [1 ]
English, Adam [2 ]
Dashnow, Harriet [3 ,4 ]
Brandine, Guilherme De Sena [1 ]
Mokveld, Tom [1 ]
Rowell, William J. [1 ]
Karniski, Caitlin [1 ]
Kronenberg, Zev [1 ]
Danzi, Matt C. [5 ,6 ]
Cheung, Warren A. [7 ]
Bi, Chengpeng [7 ]
Farrow, Emily [7 ]
Wenger, Aaron [1 ]
Chua, Khi Pin [1 ]
Martinez-Cerdeno, Veronica [8 ,9 ,10 ,11 ]
Bartley, Trevor D. [8 ,9 ,10 ]
Jin, Peng [12 ]
Nelson, David L. [13 ]
Zuchner, Stephan [5 ,6 ]
Pastinen, Tomi [7 ]
Quinlan, Aaron R. [3 ,4 ]
Sedlazeck, Fritz J. [2 ,13 ,14 ]
Eberle, Michael A. [1 ]
机构
[1] Pacific Biosci Calif, Menlo Pk, CA 94025 USA
[2] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX USA
[3] Univ Utah, Dept Human Genet, Salt Lake City, UT USA
[4] Univ Utah, Dept Biomed Informat, Salt Lake City, UT USA
[5] Univ Miami, Dr John T Macdonald Fdn Dept Human Genet, Miller Sch Med, Miami, FL USA
[6] Univ Miami, John P Hussman Inst Human Genom, Miller Sch Med, Miami, FL USA
[7] Childrens Mercy Kansas City, Genom Med Ctr, Kansas City, MO USA
[8] Shriners Hosp Children, Inst Pediat Regenerat Med, Sacramento, CA USA
[9] Univ Calif Davis, Sch Med, Sacramento, CA USA
[10] Univ Calif Davis, Dept Pathol & Lab Med, Sch Med, Sacramento, CA USA
[11] Univ Calif Davis, MIND Inst, Sch Med, Sacramento, CA USA
[12] Emory Univ, Dept Human Genet, Sch Med, Atlanta, GA USA
[13] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX USA
[14] Rice Univ, Dept Comp Sci, Houston, TX USA
关键词
EXPANSION; RFC1; GENE;
D O I
10.1038/s41587-023-02057-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Tandem repeat (TR) variation is associated with gene expression changes and numerous rare monogenic diseases. Although long-read sequencing provides accurate full-length sequences and methylation of TRs, there is still a need for computational methods to profile TRs across the genome. Here we introduce the Tandem Repeat Genotyping Tool (TRGT) and an accompanying TR database. TRGT determines the consensus sequences and methylation levels of specified TRs from PacBio HiFi sequencing data. It also reports reads that support each repeat allele. These reads can be subsequently visualized with a companion TR visualization tool. Assessing 937,122 TRs, TRGT showed a Mendelian concordance of 98.38%, allowing a single repeat unit difference. In six samples with known repeat expansions, TRGT detected all expansions while also identifying methylation signals and mosaicism and providing finer repeat length resolution than existing methods. Additionally, we released a database with allele sequences and methylation levels for 937,122 TRs across 100 genomes. A set of tools maps tandem repeats across complete genomes.
引用
收藏
页码:1606 / +
页数:15
相关论文
共 88 条
[1]   Investigation of the RFC1 Repeat Expansion in a Canadian and a Brazilian Ataxia Cohort: Identification of Novel Conformations [J].
Akcimen, Fulya ;
Ross, Jay P. ;
Bourassa, Cynthia, V ;
Liao, Calwing ;
Rochefort, Daniel ;
Drumond Gama, Maria Thereza ;
Dicaire, Marie-Josee ;
Barsottini, Orlando G. ;
Brais, Bernard ;
Pedroso, Jose Luiz ;
Dion, Patrick A. ;
Rouleau, Guy A. .
FRONTIERS IN GENETICS, 2019, 10
[2]  
[Anonymous], 2021, Homo sapiens: Human Pangenome Reference Consortium (HPRC)
[3]   Towards precision medicine [J].
Ashley, Euan A. .
NATURE REVIEWS GENETICS, 2016, 17 (09) :507-522
[4]  
Bakhtiari M., 2023, Github
[5]   Targeted genotyping of variable number tandem repeats with adVNTR [J].
Bakhtiari, Mehrdad ;
Shleizer-Burko, Sharona ;
Gymrek, Melissa ;
Bansal, Vikas ;
Bafna, Vineet .
GENOME RESEARCH, 2018, 28 (11) :1709-1719
[6]  
Caron NS., 1998, GENEREVIEWS
[7]   Direct haplotype-resolved 5-base HiFi sequencing for genome-wide profiling of hypermethylation outliers in a rare disease cohort [J].
Cheung, Warren A. ;
Johnson, Adam F. ;
Rowell, William J. ;
Farrow, Emily ;
Hall, Richard ;
Cohen, Ana S. A. ;
Means, John C. ;
Zion, Tricia N. ;
Portik, Daniel M. ;
Saunders, Christopher T. ;
Koseva, Boryana ;
Bi, Chengpeng ;
Truong, Tina K. ;
Schwendinger-Schreck, Carl ;
Yoo, Byunggil ;
Johnston, Jeffrey J. ;
Gibson, Margaret ;
Evrony, Gilad ;
Rizzo, William B. ;
Thiffault, Isabelle ;
Younger, Scott T. ;
Curran, Tom ;
Wenger, Aaron M. ;
Grundberg, Elin ;
Pastinen, Tomi .
NATURE COMMUNICATIONS, 2023, 14 (01)
[8]   Straglr: discovering and genotyping tandem repeat expansions using whole genome long-read sequences [J].
Chiu, Readman ;
Rajan-Babu, Indhu-Shree ;
Friedman, Jan M. ;
Birol, Inanc .
GENOME BIOLOGY, 2021, 22 (01)
[9]   Genomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes [J].
Cohen, Ana S. A. ;
Farrow, Emily G. ;
Abdelmoity, Ahmed T. ;
Alaimo, Joseph T. ;
Amudhavalli, Shivarajan M. ;
Anderson, John T. ;
Bansal, Lalit ;
Bartik, Lauren ;
Baybayan, Primo ;
Belden, Bradley ;
Berrios, Courtney D. ;
Biswell, Rebecca L. ;
Buczkowicz, Pawel ;
Buske, Orion ;
Chakraborty, Shreyasee ;
Cheung, Warren A. ;
Coffman, Keith A. ;
Cooper, Ashley M. ;
Cross, Laura A. ;
Curran, Tom ;
Dang, Thuy Tien T. ;
Elfrink, Mary M. ;
Engleman, Kendra L. ;
Fecske, Erin D. ;
Fieser, Cynthia ;
Fitzgerald, Keely ;
Fleming, Emily A. ;
Gadea, Randi N. ;
Gannon, Jennifer L. ;
Gelineau-Morel, Rose N. ;
Gibson, Margaret ;
Goldstein, Jeffrey ;
Grundberg, Elin ;
Halpin, Kelsee ;
Harvey, Brian S. ;
Heese, Bryce A. ;
Hein, Wendy ;
Herd, Suzanne M. ;
Hughes, Susan S. ;
Ilyas, Mohammed ;
Jacobson, Jill ;
Jenkins, Janda L. ;
Jiang, Shao ;
Johnston, Jeffrey J. ;
Keeler, Kathryn ;
Korlach, Jonas ;
Kussmann, Jennifer ;
Lambert, Christine ;
Lawson, Caitlin ;
Le Pichon, Jean-Baptiste .
GENETICS IN MEDICINE, 2022, 24 (06) :1336-1348
[10]   Biallelic expansion of an intronic repeat in RFC1 is a common cause of late-onset ataxia [J].
Cortese, Andrea ;
Simone, Roberto ;
Sullivan, Roisin ;
Vandrovcova, Jana ;
Tariq, Huma ;
Yan, Yau Way ;
Humphrey, Jack ;
Jaunmuktane, Zane ;
Sivakumar, Prasanth ;
Polke, James ;
Ilyas, Muhammad ;
Tribollet, Eloise ;
Tomaselli, Pedro J. ;
Devigili, Grazia ;
Callegari, Ilaria ;
Versino, Maurizio ;
Salpietrol, Vincenzo ;
Efthymiou, Stephanie ;
Kaski, Diego ;
Wood, Nick W. ;
Andrade, Nadja S. ;
Buglo, Elena ;
Rebelo, Adriana ;
Rossor, Alexander M. ;
Bronstein, Adolfo ;
Fratta, Pietro ;
Marques, Wilson J. ;
Zuchner, Stephan ;
Reilly, Mary M. ;
Houlden, Henry .
NATURE GENETICS, 2019, 51 (04) :649-+