STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data

被引:10
|
作者
Tang, Haixu [1 ]
Nzabarushimana, Etienne [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, 150 S Woodlawn Ave, Bloomington, IN 47405 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
美国国家科学基金会;
关键词
Short tandem repeats; Whole-genome sequencing; Algorithm; DNA forensics; PERSONAL GENOMES; LOCI; MICROSATELLITES;
D O I
10.1186/s12859-017-1800-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Short tandem repeats (STRs) are found in many prokaryotic and eukaryotic genomes, and are commonly used as genetic markers, in particular for identity and parental testing in DNA forensics. The unstable expansion of some STRs was associated with various genetic disorders (e.g., the Huntington disease), and thus was used in genetic testing for screening individuals at high risk. Traditional STR analyses were based on the PCR amplification of STR loci followed by gel electrophoresis. With the availability of massive whole genome sequencing data, it becomes practical to mine STR profiles in silico from genome sequences. Software tools such as lobSTR and STR-FM have been developed to address these demands, which are, however, built upon whole genome reads mapping tools, and thus may not be sensitive enough. Results: In this paper, we present a standalone software tool STRScan that uses a greedy algorithm for targeted STR profiling in next-generation sequencing (NGS) data. STRScan was tested on the whole genome sequencing data from Venter genome sequencing and 1000 Genomes Project. The results showed that STRScan can profile 20% more STRs in the target set that are missed by lobSTR. Conclusion: STRScan is particularly useful for the NGS-based targeted STR profiling, e.g., in genetic and human identity testing. STRScan is available as open-source software at http://darwin.informatics.indiana.edu/str/.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data
    Haixu Tang
    Etienne Nzabarushimana
    BMC Bioinformatics, 18
  • [2] Investigation of short tandem repeats in major depression using whole-genome sequencing data
    Yu, Chenglong
    Baune, Bernhard T.
    Wong, Ma-Li
    Licinio, Julio
    JOURNAL OF AFFECTIVE DISORDERS, 2018, 232 : 305 - 309
  • [3] STRsearch: a new pipeline for targeted profiling of short tandem repeats in massively parallel sequencing data
    Wang, Dong
    Tao, Ruiyang
    Li, Zhiqiang
    Pan, Dun
    Wang, Zhuo
    Li, Chengtao
    Shi, Yongyong
    HEREDITAS, 2020, 157 (01)
  • [4] STRsearch: a new pipeline for targeted profiling of short tandem repeats in massively parallel sequencing data
    Dong Wang
    Ruiyang Tao
    Zhiqiang Li
    Dun Pan
    Zhuo Wang
    Chengtao Li
    Yongyong Shi
    Hereditas, 157
  • [5] Systematic Profiling of Short Tandem Repeats in the Cattle Genome
    Xu, Lingyang
    Haasl, Ryan J.
    Sun, Jiajie
    Zhou, Yang
    Bickhart, Derek M.
    Li, Junya
    Song, Jiuzhou
    Sonstegard, Tad S.
    Van Tassell, Curtis P.
    Lewin, Harris A.
    Liu, George E.
    GENOME BIOLOGY AND EVOLUTION, 2017, 9 (01): : 20 - 31
  • [6] PennCNV in whole-genome sequencing data
    Lima, Leandro de Araujo
    Wang, Kai
    BMC BIOINFORMATICS, 2017, 18
  • [7] PennCNV in whole-genome sequencing data
    Leandro de Araújo Lima
    Kai Wang
    BMC Bioinformatics, 18
  • [8] Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
    Liu, Zhenhua
    Zhao, Guihu
    Xiao, Yuhui
    Zeng, Sheng
    Yuan, Yanchun
    Zhou, Xun
    Fang, Zhenghuan
    He, Runcheng
    Li, Bin
    Zhao, Yuwen
    Pan, Hongxu
    Wang, Yige
    Yu, Guoliang
    Peng, I-Feng
    Wang, Depeng
    Meng, Qingtuan
    Xu, Qian
    Sun, Qiying
    Yan, Xinxiang
    Shen, Lu
    Jiang, Hong
    Xia, Kun
    Wang, Junling
    Guo, Jifeng
    Liang, Fan
    Li, Jinchen
    Tang, Beisha
    FRONTIERS IN GENETICS, 2022, 13
  • [9] Personalized pharmacogenomics profiling using whole-genome sequencing
    Mizzi, Clint
    Peters, Brock
    Mitropoulou, Christina
    Mitropoulos, Konstantinos
    Katsila, Theodora
    Agarwal, Misha R.
    van Schaik, Ron H. N.
    Drmanac, Radoje
    Borg, Joseph
    Patrinos, George P.
    PHARMACOGENOMICS, 2014, 15 (09) : 1223 - 1234
  • [10] Aspergillus Outbreak in an Intensive Care Unit: Source Analysis with Whole Genome Sequencing and Short Tandem Repeats
    Hiel, Stephan J. P.
    Hendriks, Amber C. A.
    Eijkenboom, Jos J. A.
    Bosch, Thijs
    Coolen, Jordy P. M.
    Melchers, Willem J. G.
    Anrochte, Paul
    Camps, Simone M. T.
    Verweij, Paul E.
    Zhang, Jianhua
    van Dommelen, Laura
    JOURNAL OF FUNGI, 2024, 10 (01)