STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data

被引:10
|
作者
Tang, Haixu [1 ]
Nzabarushimana, Etienne [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, 150 S Woodlawn Ave, Bloomington, IN 47405 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
美国国家科学基金会;
关键词
Short tandem repeats; Whole-genome sequencing; Algorithm; DNA forensics; PERSONAL GENOMES; LOCI; MICROSATELLITES;
D O I
10.1186/s12859-017-1800-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Short tandem repeats (STRs) are found in many prokaryotic and eukaryotic genomes, and are commonly used as genetic markers, in particular for identity and parental testing in DNA forensics. The unstable expansion of some STRs was associated with various genetic disorders (e.g., the Huntington disease), and thus was used in genetic testing for screening individuals at high risk. Traditional STR analyses were based on the PCR amplification of STR loci followed by gel electrophoresis. With the availability of massive whole genome sequencing data, it becomes practical to mine STR profiles in silico from genome sequences. Software tools such as lobSTR and STR-FM have been developed to address these demands, which are, however, built upon whole genome reads mapping tools, and thus may not be sensitive enough. Results: In this paper, we present a standalone software tool STRScan that uses a greedy algorithm for targeted STR profiling in next-generation sequencing (NGS) data. STRScan was tested on the whole genome sequencing data from Venter genome sequencing and 1000 Genomes Project. The results showed that STRScan can profile 20% more STRs in the target set that are missed by lobSTR. Conclusion: STRScan is particularly useful for the NGS-based targeted STR profiling, e.g., in genetic and human identity testing. STRScan is available as open-source software at http://darwin.informatics.indiana.edu/str/.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Short Insertion and Deletion Discoveries via Whole-Genome Sequencing of 101 Thoroughbred Racehorses
    Tozaki, Teruaki
    Ohnuma, Aoi
    Kikuchi, Mio
    Ishige, Taichiro
    Kakoi, Hironaga
    Hirota, Kei-ichi
    Takahashi, Yuji
    Nagata, Shun-ichi
    GENES, 2023, 14 (03)
  • [42] STRipy: A graphical application for enhanced genotyping of pathogenic short tandem repeats in sequencing data
    Halman, Andreas
    Dolzhenko, Egor
    Oshlack, Alicia
    HUMAN MUTATION, 2022, 43 (07) : 859 - 868
  • [43] STaRRRT: a table of short tandem repeats in regulatory regions of the human genome
    Katherine A Bolton
    Jason P Ross
    Desma M Grice
    Nikola A Bowden
    Elizabeth G Holliday
    Kelly A Avery-Kiejda
    Rodney J Scott
    BMC Genomics, 14
  • [44] A portable and scalable workflow for detecting structural variants in whole-genome sequencing data
    Kuzniar, Arnold
    Maassen, Jason
    Verhoeven, Stefan
    Santuari, Luca
    Shneider, Carl
    Kloosterman, Wigard
    de Bidder, Jeroen
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 303 - 304
  • [45] Identifying and mitigating batch effects in whole genome sequencing data
    Tom, Jennifer A.
    Reeder, Jens
    Forrest, William F.
    Graham, Robert R.
    Hunkapiller, Julie
    Behrens, Timothy W.
    Bhangale, Tushar R.
    BMC BIOINFORMATICS, 2017, 18
  • [46] Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data
    San-Xu Liu
    Wei Hou
    Xue-Yan Zhang
    Chang-Jun Peng
    Bi-Song Yue
    Zhen-Xin Fan
    Jing Li
    Zoological Research, 2018, 39 (04) : 291 - 300
  • [47] Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data
    Liu, San-Xu
    Hou, Wei
    Zhang, Xue-Yan
    Peng, Chang-Jun
    Yue, Bi-Song
    Fan, Zhen-Xin
    Li, Jing
    ZOOLOGICAL RESEARCH, 2018, 39 (04) : 291 - 300
  • [48] Detecting short tandem repeats from genome data: opening the software black box
    Merkel, Angelika
    Gemmell, Neil
    BRIEFINGS IN BIOINFORMATICS, 2008, 9 (05) : 355 - 366
  • [49] Public perceptions of bacterial whole-genome sequencing for tuberculosis
    Davies, Anna
    Scott, Stephen
    Badger, Shirlene
    Toeroek, M. Estee
    Peacock, Sharon J.
    TRENDS IN GENETICS, 2015, 31 (02) : 58 - 60
  • [50] A custom hepatitis A virus assay for whole-genome sequencing
    Cleary, Nora G.
    Bryant, Patrick W.
    Lamson, Daryl M.
    Newman, Alexandra P.
    George, Kirsten St.
    JOURNAL OF VIROLOGICAL METHODS, 2023, 312