RF: A method for filtering short reads with tandem repeats for genome mapping

被引:7
|
作者
Misawa, Kazuharu [1 ]
机构
[1] RIKEN, Res Program Computat Sci, Res & Dev Grp Next Generat Integrated Living Matt, Fus Data & Anal Res & Dev Team, Yokohama, Kanagawa 2300045, Japan
关键词
Tandem repeats; Human genome; Mapping; Next-generation sequencing; REPETITIVE DNA; ALIGNMENT; PARAMETERS; ELEMENTS;
D O I
10.1016/j.ygeno.2013.03.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Next-generation sequencing platforms generate short (50-150 bp) reads that can be mapped onto the reference genome. Repetitive sequences in the genome, because of the presence of similar or identical sequences, cause mapping errors in the case of the short reads. By filtering short reads with repeats, mapping will be improved. I developed RF. RF is a new method that filters short reads with tandem repeats. A scoring scheme was developed that assigned higher scores to regions with tandem repeats and lower scores to regions without tandem repeats. In this study, IF was applied to filter out short reads with repeats, before short reads were mapped onto the same genomic contig by using a short read-mapping program. The result suggests RF improved the proportion of correctly mapped short reads on filtering the repeats. RF is a useful tool for reducing mapping errors of short reads onto reference genomes. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 37
页数:3
相关论文
共 50 条
  • [31] Fast and memory efficient approach for mapping NGS reads to a reference genome
    Kumar, Sanjeev
    Agarwal, Suneeta
    Ranvijay
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (02)
  • [32] The efficient algorithm for mapping next generation sequencing reads to reference genome
    Pankiewicz, Patryk
    Kusmirek, Wiktor
    Nowak, Robert M.
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2019, 2019, 11176
  • [33] Genome-wide analyses of tandem repeats and transposable elements in patchouli
    Liu, Linqiu
    Li, Junjun
    Wen, Jiawei
    He, Yang
    GENES & GENETIC SYSTEMS, 2021, 96 (02) : 81 - 87
  • [34] A Crowdsourced Gameplay for Whole-Genome Assembly via Short Reads
    Gamage, G.
    Perera, I.
    Meedeniya, D.
    Welivita, Anuradha
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (08) : 68 - 84
  • [35] An Efficient Algorithm for Mapping of Reads to a Genome Graph Using an Index Based on Hash Tables and Dynamic Programming
    Petrov S.N.
    Uroshlev L.A.
    Kasyanov A.S.
    Makeev V.Y.
    Biophysics, 2018, 63 (3) : 311 - 317
  • [36] Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms
    Pavlek, Martina
    Gelfand, Yevgeniy
    Plohl, Miroslav
    Mestrovic, Nevenka
    DNA RESEARCH, 2015, 22 (06) : 387 - 401
  • [37] A Bayesian Assignment Method for Ambiguous Bisulfite Short Reads
    Tran, Hong
    Wu, Xiaowei
    Tithi, Saima
    Sun, Ming-An
    Xie, Hehuang
    Zhang, Liqing
    PLOS ONE, 2016, 11 (03):
  • [38] Short Reads, Circular Genome: Skimming SOLiD Sequence to Construct the Bighorn Sheep Mitochondrial Genome
    Miller, Joshua M.
    Malenfant, Rene M.
    Moore, Stephen S.
    Coltman, David W.
    JOURNAL OF HEREDITY, 2012, 103 (01) : 140 - 146
  • [39] Tandem Repeats in the Genome of Sus scrofa, Their Localization on Chromosomes and in the Spermatogenic Cell Nuclei
    N. G. Ivanova
    V. N. Stefanova
    D. I. Ostromyshenskii
    O. I. Podgornaya
    Russian Journal of Genetics, 2019, 55 : 835 - 846
  • [40] Tandem and interspersed repeats contribute to the mosaic structure of segmental duplications in the human genome
    Oparina, NY
    Lacroix, MH
    Rychkov, AA
    Mashkova, TD
    MOLECULAR BIOLOGY, 2003, 37 (02) : 200 - 204