RF: A method for filtering short reads with tandem repeats for genome mapping

被引:7
|
作者
Misawa, Kazuharu [1 ]
机构
[1] RIKEN, Res Program Computat Sci, Res & Dev Grp Next Generat Integrated Living Matt, Fus Data & Anal Res & Dev Team, Yokohama, Kanagawa 2300045, Japan
关键词
Tandem repeats; Human genome; Mapping; Next-generation sequencing; REPETITIVE DNA; ALIGNMENT; PARAMETERS; ELEMENTS;
D O I
10.1016/j.ygeno.2013.03.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Next-generation sequencing platforms generate short (50-150 bp) reads that can be mapped onto the reference genome. Repetitive sequences in the genome, because of the presence of similar or identical sequences, cause mapping errors in the case of the short reads. By filtering short reads with repeats, mapping will be improved. I developed RF. RF is a new method that filters short reads with tandem repeats. A scoring scheme was developed that assigned higher scores to regions with tandem repeats and lower scores to regions without tandem repeats. In this study, IF was applied to filter out short reads with repeats, before short reads were mapped onto the same genomic contig by using a short read-mapping program. The result suggests RF improved the proportion of correctly mapped short reads on filtering the repeats. RF is a useful tool for reducing mapping errors of short reads onto reference genomes. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 37
页数:3
相关论文
共 50 条
  • [21] DINTD: Detection and Inference of Tandem Duplications From Short Sequencing Reads
    Dong, Jinxin
    Qi, Minyong
    Wang, Shaoqiang
    Yuan, Xiguo
    FRONTIERS IN GENETICS, 2020, 11
  • [22] Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
    Bertels, Frederic
    Silander, Olin K.
    Pachkov, Mikhail
    Rainey, Paul B.
    van Nimwegen, Erik
    MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (05) : 1077 - 1088
  • [23] Genome-wide analysis of transposable elements and tandem repeats in the compact placozoan genome
    Wang, Shi
    Zhang, Lingling
    Meyer, Eli
    Bao, Zhenmin
    BIOLOGY DIRECT, 2010, 5
  • [24] The effects of transcription and recombination on mutational dynamics of short tandem repeats
    Zavodna, Monika
    Bagshaw, Andrew
    Brauning, Rudiger
    Gemmell, Neil J.
    NUCLEIC ACIDS RESEARCH, 2018, 46 (03) : 1321 - 1330
  • [25] Short Tandem Repeats in plants: Genomic distribution and function prediction
    Zhu, Lin
    Wu, Hui
    Li, Huayang
    Tang, Heng
    Zhang, Li
    Xu, Houjuan
    Jiao, Fangchan
    Wang, Nian
    Yang, Long
    ELECTRONIC JOURNAL OF BIOTECHNOLOGY, 2021, 50 : 37 - 44
  • [26] Mapping short reads to a genome without using hash look-up table algorithm and Burrows Wheeler Transformation
    Lin, Chun Yuan
    Huang, Ming-Yuan
    Chu, Chia-Han
    Tang, Petrus
    Tang, Chuan Yi
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 228 - +
  • [27] An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
    Ole K. Tørresen
    Bastiaan Star
    Sissel Jentoft
    William B. Reinar
    Harald Grove
    Jason R. Miller
    Brian P. Walenz
    James Knight
    Jenny M. Ekholm
    Paul Peluso
    Rolf B. Edvardsen
    Ave Tooming-Klunderud
    Morten Skage
    Sigbjørn Lien
    Kjetill S. Jakobsen
    Alexander J. Nederbragt
    BMC Genomics, 18
  • [28] Physical location of tandem repeats in the wheat genome and application for chromosome identification
    Tao Lang
    Guangrong Li
    Hongjin Wang
    Zhihui Yu
    Qiheng Chen
    Ennian Yang
    Shulan Fu
    Zongxiang Tang
    Zujun Yang
    Planta, 2019, 249 : 663 - 675
  • [29] Toward perfect reads: self-correction of short reads via mapping on de Bruijn graphs
    Limasset, Antoine
    Flot, Jean-Francois
    Peterlongo, Pierre
    BIOINFORMATICS, 2020, 36 (05) : 1374 - 1381
  • [30] Genome-Wide Analysis of Tandem Repeats in Plants and Green Algae
    Zhao, Zhixin
    Guo, Cheng
    Sutharzan, Sreeskandarajan
    Li, Pei
    Echt, Craig S.
    Zhang, Jie
    Liang, Chun
    G3-GENES GENOMES GENETICS, 2014, 4 (01): : 67 - 78