ULTRA-effective labeling of tandem repeats in genomic sequence

被引:2
作者
Olson, Daniel R. [1 ]
Wheeler, Travis J. [1 ,2 ]
机构
[1] Univ Montana, Dept Comp Sci, 32 Campus Dr, Missoula, MT 59812 USA
[2] Univ Arizona, R Ken Coit Coll Pharm, Dept Pharm Practice & Sci, Tucson, AZ 85721 USA
来源
BIOINFORMATICS ADVANCES | 2024年 / 4卷 / 01期
关键词
IDENTIFICATION; EVOLUTION; FINDER;
D O I
10.1093/bioadv/vbae149
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the age of long read sequencing, genomics researchers now have access to accurate repetitive DNA sequence (including satellites) that, due to the limitations of short read-sequencing, could previously be observed only as unmappable fragments. Tools that annotate repetitive sequence are now more important than ever, so that we can better understand newly uncovered repetitive sequences, and also so that we can mitigate errors in bioinformatic software caused by those repetitive sequences. To that end, we introduce the 1.0 release of our tool for identifying and annotating locally repetitive sequence, ULTRA Locates Tandemly Repetitive Areas (ULTRA). ULTRA is fast enough to use as part of an efficient annotation pipeline, produces state-of-the-art reliable coverage of repetitive regions containing many mutations, and provides interpretable statistics and labels for repetitive regions.
引用
收藏
页数:10
相关论文
共 64 条
[1]   Complete genomic and epigenetic maps of human centromeres [J].
Altemose, Nicolas ;
Logsdon, Glennis A. ;
Bzikadze, Andrey, V ;
Sidhwani, Pragya ;
Langley, Sasha A. ;
Caldas, Gina, V ;
Hoyt, Savannah J. ;
Uralsky, Lev ;
Ryabov, Fedor D. ;
Shew, Colin J. ;
Sauria, Michael E. G. ;
Borchers, Matthew ;
Gershman, Ariel ;
Mikheenko, Alla ;
Shepelev, Valery A. ;
Dvorkina, Tatiana ;
Kunyavskaya, Olga ;
Vollger, Mitchell R. ;
Rhie, Arang ;
McCartney, Ann M. ;
Asri, Mobin ;
Lorig-Roach, Ryan ;
Shafin, Kishwar ;
Lucas, Julian K. ;
Aganezov, Sergey ;
Olson, Daniel ;
de Lima, Leonardo Gomes ;
Potapova, Tamara ;
Hartley, Gabrielle A. ;
Haukness, Marina ;
Kerpedjiev, Peter ;
Gusev, Fedor ;
Tigyi, Kristof ;
Brooks, Shelise ;
Young, Alice ;
Nurk, Sergey ;
Koren, Sergey ;
Salama, Sofie R. ;
Paten, Benedict ;
Rogaev, Evgeny, I ;
Streets, Aaron ;
Karpen, Gary H. ;
Dernburg, Abby F. ;
Sullivan, Beth A. ;
Straight, Aaron F. ;
Wheeler, Travis J. ;
Gerton, Jennifer L. ;
Eichler, Evan E. ;
Phillippy, Adam M. ;
Timp, Winston .
SCIENCE, 2022, 376 (6588) :56-+
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   MISA-web: a web server for microsatellite prediction [J].
Beier, Sebastian ;
Thiel, Thomas ;
Muench, Thomas ;
Scholz, Uwe ;
Mascher, Martin .
BIOINFORMATICS, 2017, 33 (16) :2583-2585
[4]  
Bennett P, 2000, J CLIN PATHOL-MOL PA, V53, P177
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   THE MOSAIC GENOME OF WARM-BLOODED VERTEBRATES [J].
BERNARDI, G ;
OLOFSSON, B ;
FILIPSKI, J ;
ZERIAL, M ;
SALINAS, J ;
CUNY, G ;
MEUNIERROTIVAL, M ;
RODIER, F .
SCIENCE, 1985, 228 (4702) :953-958
[7]  
Byung-Jun Yoon, 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers (IEEE Cat. No.04CH37592), P1651
[8]  
Carey KM, 2021, bioRxiv, DOI [10.1101/2021.02.13.430877, 10.1101/2021.02.13.430877, DOI 10.1101/2021.02.13.430877]
[9]   Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence (vol 393, pg 537, 1998) [J].
Cole, ST ;
Brosch, R ;
Parkhill, J ;
Garnier, T ;
Churcher, C ;
Harris, D ;
Gordon, SV ;
Eiglmeier, K ;
Gas, S ;
Barry, CE ;
Tekaia, F ;
Badcock, K ;
Basham, D ;
Brown, D ;
Chillingworth, T ;
Conner, R ;
Davies, R ;
Devlin, K ;
Feltwell, T ;
Gentles, S ;
Hamlin, N ;
Holroyd, S ;
Hornsby, T ;
Jagels, K ;
Krogh, A ;
McLean, J ;
Moule, S ;
Murphy, L ;
Oliver, K ;
Osborne, J ;
Quail, MA ;
Rajandream, MA ;
Rogers, J ;
Rutter, S ;
Seeger, K ;
Skelton, J ;
Squares, R ;
Squares, S ;
Sulston, JE ;
Taylor, K ;
Whitehead, S ;
Barrell, BG .
NATURE, 1998, 396 (6707) :190-198
[10]   The Isochores as a Fundamental Level of Genome Structure and Organization: A General Overview [J].
Costantini, Maria ;
Musto, Hector .
JOURNAL OF MOLECULAR EVOLUTION, 2017, 84 (2-3) :93-103