STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data

被引:10
|
作者
Tang, Haixu [1 ]
Nzabarushimana, Etienne [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, 150 S Woodlawn Ave, Bloomington, IN 47405 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
美国国家科学基金会;
关键词
Short tandem repeats; Whole-genome sequencing; Algorithm; DNA forensics; PERSONAL GENOMES; LOCI; MICROSATELLITES;
D O I
10.1186/s12859-017-1800-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Short tandem repeats (STRs) are found in many prokaryotic and eukaryotic genomes, and are commonly used as genetic markers, in particular for identity and parental testing in DNA forensics. The unstable expansion of some STRs was associated with various genetic disorders (e.g., the Huntington disease), and thus was used in genetic testing for screening individuals at high risk. Traditional STR analyses were based on the PCR amplification of STR loci followed by gel electrophoresis. With the availability of massive whole genome sequencing data, it becomes practical to mine STR profiles in silico from genome sequences. Software tools such as lobSTR and STR-FM have been developed to address these demands, which are, however, built upon whole genome reads mapping tools, and thus may not be sensitive enough. Results: In this paper, we present a standalone software tool STRScan that uses a greedy algorithm for targeted STR profiling in next-generation sequencing (NGS) data. STRScan was tested on the whole genome sequencing data from Venter genome sequencing and 1000 Genomes Project. The results showed that STRScan can profile 20% more STRs in the target set that are missed by lobSTR. Conclusion: STRScan is particularly useful for the NGS-based targeted STR profiling, e.g., in genetic and human identity testing. STRScan is available as open-source software at http://darwin.informatics.indiana.edu/str/.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Opportunities and challenges of whole-genome and -exome sequencing
    Petersen, Britt-Sabina
    Fredrich, Broder
    Hoeppner, Marc P.
    Ellinghaus, David
    Franke, Andre
    BMC GENETICS, 2017, 18
  • [32] A unified STR profiling system across multiple species with whole genome sequencing data
    Yilin Liu
    Jiao Xu
    Miaoxia Chen
    Changfa Wang
    Shuaicheng Li
    BMC Bioinformatics, 20
  • [33] eSCAN: scan regulatory regions for aggregate association testing using whole-genome sequencing data
    Yang, Yingxi
    Sun, Quan
    Huang, Le
    Broome, Jai G.
    Correa, Adolfo
    Reiner, Alexander
    Raffield, Laura M.
    Yang, Yuchen
    Li, Yun
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [34] Whole-Genome Sequencing in Severe Chronic Obstructive Pulmonary Disease
    Prokopenko, Dmitry
    Sakornsakolpat, Phuwanat
    Fier, Heide Loehlein
    Qiao, Dandi
    Parker, Margaret M.
    McDonald, Merry-Lynn N.
    Manichaikul, Ani
    Rich, Stephen S.
    Barr, R. Graham
    Williams, Christopher J.
    Brantly, Mark L.
    Lange, Christoph
    Beaty, Tern H.
    Crapo, James D.
    Silverman, Edwin K.
    Cho, Michael H.
    AMERICAN JOURNAL OF RESPIRATORY CELL AND MOLECULAR BIOLOGY, 2018, 59 (05) : 614 - 622
  • [35] A unified STR profiling system across multiple species with whole genome sequencing data
    Liu, Yilin
    Xu, Jiao
    Chen, Miaoxia
    Wang, Changfa
    Li, Shuai Cheng
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [36] The Benefits of Whole-Genome Sequencing Now and in the Future
    Khromykh, Alina
    Solomon, Benjamin D.
    MOLECULAR SYNDROMOLOGY, 2015, 6 (03) : 108 - 109
  • [37] Opportunities and challenges of whole-genome and -exome sequencing
    Britt-Sabina Petersen
    Broder Fredrich
    Marc P. Hoeppner
    David Ellinghaus
    Andre Franke
    BMC Genetics, 18
  • [38] Selective Whole-Genome Amplification as a Tool to Enrich Specimens with Low Treponema pallidum Genomic DNA Copies for Whole-Genome Sequencing
    Thurlow, Charles M.
    Joseph, Sandeep J.
    Ganova-Raeva, Lilia
    Katz, Samantha S.
    Pereira, Lara
    Chen, Cheng
    Debra, Alyssa
    Vilfort, Kendra
    Workowski, Kimberly
    Cohen, Stephanie E.
    Reno, Hilary
    Sun, Yongcheng
    Burroughs, Mark
    Sheth, Mili
    Chi, Kai-Hua
    Danavall, Damien
    Philip, Susan S.
    Cao, Weiping
    Kersh, Ellen N.
    Pillay, Allan
    MSPHERE, 2022, 7 (03)
  • [39] Prediction of antimicrobial resistance in clinicalCampylobacter jejuniisolates from whole-genome sequencing data
    Dahl, Louise Gade
    Joensen, Katrine Grimstrup
    Osterlund, Mark Thomas
    Kiil, Kristoffer
    Nielsen, Eva Moller
    EUROPEAN JOURNAL OF CLINICAL MICROBIOLOGY & INFECTIOUS DISEASES, 2021, 40 (04) : 673 - 682
  • [40] STaRRRT: a table of short tandem repeats in regulatory regions of the human genome
    Bolton, Katherine A.
    Ross, Jason P.
    Grice, Desma M.
    Bowden, Nikola A.
    Holliday, Elizabeth G.
    Avery-Kiejda, Kelly A.
    Scott, Rodney J.
    BMC GENOMICS, 2013, 14