RF: A method for filtering short reads with tandem repeats for genome mapping

被引:7
|
作者
Misawa, Kazuharu [1 ]
机构
[1] RIKEN, Res Program Computat Sci, Res & Dev Grp Next Generat Integrated Living Matt, Fus Data & Anal Res & Dev Team, Yokohama, Kanagawa 2300045, Japan
关键词
Tandem repeats; Human genome; Mapping; Next-generation sequencing; REPETITIVE DNA; ALIGNMENT; PARAMETERS; ELEMENTS;
D O I
10.1016/j.ygeno.2013.03.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Next-generation sequencing platforms generate short (50-150 bp) reads that can be mapped onto the reference genome. Repetitive sequences in the genome, because of the presence of similar or identical sequences, cause mapping errors in the case of the short reads. By filtering short reads with repeats, mapping will be improved. I developed RF. RF is a new method that filters short reads with tandem repeats. A scoring scheme was developed that assigned higher scores to regions with tandem repeats and lower scores to regions without tandem repeats. In this study, IF was applied to filter out short reads with repeats, before short reads were mapped onto the same genomic contig by using a short read-mapping program. The result suggests RF improved the proportion of correctly mapped short reads on filtering the repeats. RF is a useful tool for reducing mapping errors of short reads onto reference genomes. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 37
页数:3
相关论文
共 50 条
  • [1] Tandem repeats in the rodent genome and their mapping
    Ostromyshenskii D.I.
    Kuznetsova I.S.
    Komissarov A.S.
    Kartavtseva I.V.
    Podgornaya O.I.
    Cell and Tissue Biology, 2015, 9 (3) : 217 - 225
  • [2] Genome (in)stability at tandem repeats
    Balzano, Elisa
    Pelliccia, Franca
    Giunta, Simona
    SEMINARS IN CELL & DEVELOPMENTAL BIOLOGY, 2021, 113 : 97 - 112
  • [3] Resolving complex tandem repeats with long reads
    Ummat, Ajay
    Bashir, Ali
    BIOINFORMATICS, 2014, 30 (24) : 3491 - 3498
  • [4] CoLoRMap: Correcting Long Reads by Mapping short reads
    Haghshenas, Ehsan
    Hach, Faraz
    Sahinalp, S. Cenk
    Chauve, Cedric
    BIOINFORMATICS, 2016, 32 (17) : 545 - 551
  • [5] LongTR: genome-wide profiling of genetic variation at tandem repeats from long reads
    Jam, Helyaneh Ziaei
    Zook, Justin M.
    Javadzadeh, Sara
    Park, Jonghun
    Sehgal, Aarushi
    Gymrek, Melissa
    GENOME BIOLOGY, 2024, 25 (01):
  • [6] Distinct Mutational Behaviors Differentiate Short Tandem Repeats from Microsatellites in the Human Genome
    Ananda, Guruprasad
    Walsh, Erin
    Jacob, Kimberly D.
    Krasilnikova, Maria
    Eckert, Kristin A.
    Chiaromonte, Francesca
    Makova, Kateryna D.
    GENOME BIOLOGY AND EVOLUTION, 2013, 5 (03): : 606 - 620
  • [7] Short tandem repeats of human genome are intrinsically unstable in cultured cells in vivo
    Liu, Yuzhe
    Li, Jinhuan
    Wu, Qiang
    GENE, 2023, 877
  • [8] G-MAPSEQ - A NEW METHOD FOR MAPPING READS TO A REFERENCE GENOME
    Wojciechowski, Pawel
    Frohmberg, Wojciech
    Kierzynka, Michal
    Zurkowski, Piotr
    Blazewicz, Jacek
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2016, 41 (02) : 123 - 142
  • [9] An enrichment method for mapping ambiguous reads to the reference genome for NGS analysis
    Liu, Yuan
    Ma, Yongchao
    Salsman, Evan
    Manthey, Frank A.
    Elias, Elias M.
    Li, Xuehui
    Yan, Changhui
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (06)
  • [10] ReviSTER: an automated pipeline to revise misaligned reads to simple tandem repeats
    Tae, Hongseok
    McMahon, Kevin W.
    Settlage, Robert E.
    Bavarva, Jasmin H.
    Garner, Harold R.
    BIOINFORMATICS, 2013, 29 (14) : 1734 - 1741