Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics

被引:51
|
作者
Thankaswamy-Kosalai, Subazini [1 ]
Sen, Partho [1 ]
Nookaew, Intawat [1 ,2 ]
机构
[1] Chalmers Univ Technol, Dept Biol & Biol Engn, Kemivagen 10, SE-41296 Gothenburg, Sweden
[2] Univ Arkansas Med Sci, Dept Biomed Informat, Coll Med, Little Rock, AR 72205 USA
关键词
Next-generation sequencing; NGS; Aligners; Alignments; Mapping; Algorithm; Reads; Genome; TANDEM REPEATS;
D O I
10.1016/j.ygeno.2017.03.001
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Massive data produced due to the advent of next-generation sequencing (NGS) technology is widely used for biological researches and medical diagnosis. The crucial step in NGS analysis is read alignment or mapping which is computationally intensive and complex. The mapping bias tends to affect the downstream analysis, including detection of polymorphisms. In order to provide guidelines to the biologist for suitable selection of aligners; we have evaluated and benchmarked 5 different aligners (BWA, Bowtie2, NovoAlign, Smalt and Stampy) and their mapping bias based on characteristics of 5 microbial genomes. Two million simulated read pairs of various sizes (36 bp, 50 bp, 72 bp, 100 bp, 125 bp, 150 bp, 200 bp, 250 bp and 300 bp) were aligned. Specific alignment features such as sensitivity of mapping, percentage of properly paired reads, alignment time and effect of tandem repeats on incorrectly mapped reads were evaluated. BWA showed faster alignment followed by Bowtie2 and Smalt. NovoAlign and Stampy were comparatively slower. Most of the aligners showed high sensitivity towards long reads (> 100 bp) mapping. On the other hand NovoAlign showed higher sensitivity towards both short reads (36 bp, 50 bp, 72 bp) and long reads (> 100 bp) mappings; It also showed higher sensitivity towards mapping a complex genome like Plasmodium falciparum. The percentage of properly paired reads aligned by NovoAlign, BWA and Stampy were markedly higher. None of the aligners outperforms the others in the benchmark, however the aligners perform differently with genome characteristics. We expect that the results from this study will be useful for the end user to choose aligner, thus enhance the accuracy of read mapping. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:186 / 191
页数:6
相关论文
共 50 条
  • [41] A genome-wide analysis of simple sequence repeats in maize and the development of polymorphism markers from next-generation sequence data
    Qu J.
    Liu J.
    BMC Research Notes, 6 (1)
  • [42] Next-generation sequencing based Cyperus niveus (Cyperaceae) complete chloroplast genome: A comparative analysis and phylogeny
    Shahbaz, Muhammad
    Hayat, Muhammad Qasim
    Ahmad, Ibrar
    Fakhar, Hafiz Imran
    Shah, Iqra
    KOREAN JOURNAL OF PLANT TAXONOMY, 2024, 54 (02): : 99 - 109
  • [43] Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns
    Matteo Comin
    Michele Schimd
    BMC Bioinformatics, 15
  • [44] ANALYSIS OF STRUCTURAL AND FUNCTIONAL ORGANIZATION OF THE CURLY BIRCH CHLOROPLAST GENOME BASED ON THE NEXT-GENERATION SEQUENCING DATA
    Baranov, Oleg Yu.
    Kiryanov, Pavel S.
    Pantelev, Stanislav V.
    Mozharovskaya, Ludmila V.
    Padutov, Alexandr V.
    Razumova, Olga A.
    Padutov, Vladimir E.
    DOKLADY NATSIONALNOI AKADEMII NAUK BELARUSI, 2019, 63 (03): : 312 - 316
  • [45] Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns
    Comin, Matteo
    Schimd, Michele
    BMC BIOINFORMATICS, 2014, 15
  • [46] Capture-based next-generation sequencing reveals multiple actionable mutations in cancer patients failed in traditional testing
    Xie, Jing
    Lu, Xiongxiong
    Wu, Xue
    Lin, Xiaoyi
    Zhang, Chao
    Huang, Xiaofang
    Chang, Zhili
    Wang, Xinjing
    Wen, Chenlei
    Tang, Xiaomei
    Shi, Minmin
    Zhan, Qian
    Chen, Hao
    Deng, Xiaxing
    Peng, Chenghong
    Li, Hongwei
    Fang, Yuan
    Shao, Yang
    Shen, Baiyong
    MOLECULAR GENETICS & GENOMIC MEDICINE, 2016, 4 (03): : 262 - 272
  • [47] Evaluation of targeted next-generation sequencing-based preimplantation genetic diagnosis of monogenic disease
    Treff, Nathan R.
    Fedick, Anastasia
    Tao, Xin
    Devkota, Batsal
    Taylor, Deanne
    Scott, Richard T., Jr.
    FERTILITY AND STERILITY, 2013, 99 (05) : 1377 - +
  • [48] Evaluation of preimplantation genetic testing based on next-generation sequencing for balanced reciprocal translocation carriers
    Cai, Yunni
    Ding, Min
    Lin, Fei
    Diao, Zhenyu
    Zhang, Ningyuan
    Sun, Haixiang
    Zhou, Jianjun
    REPRODUCTIVE BIOMEDICINE ONLINE, 2019, 38 (05) : 669 - 675
  • [49] Analysis of HBV X gene quasispecies characteristics by next-generation sequencing and cloning-based sequencing and its association with hepatocellular carcinoma progression
    Mei, Fanbiao
    Ren, Jingjing
    Long, Long
    Li, Jilin
    Li, Kezhi
    Liu, Haizhou
    Tang, Yanping
    Fang, Xiang
    Wu, Hanghang
    Xiao, Chanchan
    Huang, Tianren
    Deng, Wei
    JOURNAL OF MEDICAL VIROLOGY, 2019, 91 (06) : 1087 - 1096
  • [50] Screening and Evaluation of Amplicons of Maize Endogenous Reference Genes Based on Next-Generation Sequencing Technology
    Chen L.
    Zhou J.
    Liang J.
    Li T.
    Wang H.
    Fang Z.
    Chen H.
    Peng H.
    Shipin Kexue/Food Science, 2023, 44 (20): : 146 - 154