Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics

被引:51
|
作者
Thankaswamy-Kosalai, Subazini [1 ]
Sen, Partho [1 ]
Nookaew, Intawat [1 ,2 ]
机构
[1] Chalmers Univ Technol, Dept Biol & Biol Engn, Kemivagen 10, SE-41296 Gothenburg, Sweden
[2] Univ Arkansas Med Sci, Dept Biomed Informat, Coll Med, Little Rock, AR 72205 USA
关键词
Next-generation sequencing; NGS; Aligners; Alignments; Mapping; Algorithm; Reads; Genome; TANDEM REPEATS;
D O I
10.1016/j.ygeno.2017.03.001
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Massive data produced due to the advent of next-generation sequencing (NGS) technology is widely used for biological researches and medical diagnosis. The crucial step in NGS analysis is read alignment or mapping which is computationally intensive and complex. The mapping bias tends to affect the downstream analysis, including detection of polymorphisms. In order to provide guidelines to the biologist for suitable selection of aligners; we have evaluated and benchmarked 5 different aligners (BWA, Bowtie2, NovoAlign, Smalt and Stampy) and their mapping bias based on characteristics of 5 microbial genomes. Two million simulated read pairs of various sizes (36 bp, 50 bp, 72 bp, 100 bp, 125 bp, 150 bp, 200 bp, 250 bp and 300 bp) were aligned. Specific alignment features such as sensitivity of mapping, percentage of properly paired reads, alignment time and effect of tandem repeats on incorrectly mapped reads were evaluated. BWA showed faster alignment followed by Bowtie2 and Smalt. NovoAlign and Stampy were comparatively slower. Most of the aligners showed high sensitivity towards long reads (> 100 bp) mapping. On the other hand NovoAlign showed higher sensitivity towards both short reads (36 bp, 50 bp, 72 bp) and long reads (> 100 bp) mappings; It also showed higher sensitivity towards mapping a complex genome like Plasmodium falciparum. The percentage of properly paired reads aligned by NovoAlign, BWA and Stampy were markedly higher. None of the aligners outperforms the others in the benchmark, however the aligners perform differently with genome characteristics. We expect that the results from this study will be useful for the end user to choose aligner, thus enhance the accuracy of read mapping. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:186 / 191
页数:6
相关论文
共 50 条
  • [31] Hypotrichosis with juvenile macular dystrophy: Combination of whole-genome sequencing and genome-wide homozygosity mapping identifies a large deletion in CDH3 initially undetected by whole-exome sequencing-A lesson from next-generation sequencing
    Saeidian, Amir Hossein
    Vahidnezhad, Hassan
    Youssefian, Leila
    Sotudeh, Soheila
    Sargazi, Meisam
    Zeinali, Sirous
    Uitto, Jouni
    MOLECULAR GENETICS & GENOMIC MEDICINE, 2019, 7 (11):
  • [32] Evaluation of next-generation sequencing versus next-generation flow cytometry for minimal-residual-disease detection in Chinese patients with multiple myeloma
    Mo Zhou
    Yan Chen
    Yanlei Gong
    Mingqing Zhu
    Jiannong Cen
    Jinlan Pan
    Lingzhi Yan
    Jingjing Shang
    Song Jin
    Xiaolan Shi
    Weiqin Yao
    Shuang Yan
    Depei Wu
    Suning Chen
    Chengcheng Fu
    Li Yao
    Discover Oncology, 15
  • [33] Evaluation of next-generation sequencing versus next-generation flow cytometry for minimal-residual-disease detection in Chinese patients with multiple myeloma
    Zhou, Mo
    Chen, Yan
    Gong, Yanlei
    Zhu, Mingqing
    Cen, Jiannong
    Pan, Jinlan
    Yan, Lingzhi
    Shang, Jingjing
    Jin, Song
    Shi, Xiaolan
    Yao, Weiqin
    Yan, Shuang
    Wu, Depei
    Chen, Suning
    Fu, Chengcheng
    Yao, Li
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [34] Read the clonotype: Next-generation sequencing-based lymphocyte clonality analysis and perspectives for application in pathology
    Groenen, Patricia J. T. A.
    van den Brand, Michiel
    Kroeze, Leonie I.
    Amir, Avital L.
    Hebeda, Konnie M.
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [35] Systematic comparison of variant calling pipelines of target genome sequencing cross multiple next-generation sequencers
    Feng, Baosheng
    Lai, Juan
    Fan, Xue
    Liu, Yongfeng
    Wang, Miao
    Wu, Ping
    Zhou, Zhiliang
    Yan, Qin
    Sun, Lei
    FRONTIERS IN GENETICS, 2024, 14
  • [36] Genome Survey Sequencing and Genetic Background Characterization of Ilex chinensis Sims (Aquifoliaceae) Based on Next-Generation Sequencing
    Zhou, Peng
    Li, Jiao
    Huang, Jing
    Li, Fei
    Zhang, Qiang
    Zhang, Min
    PLANTS-BASEL, 2022, 11 (23):
  • [37] Next-generation sequencing in residual liquid-based cytology specimens for cancer genome analysis
    Yamaguchi, Tomomi
    Akahane, Toshiaki
    Harada, Ohi
    Kato, Yasutaka
    Aimono, Eriko
    Takei, Hidehiro
    Tasaki, Takashi
    Noguchi, Hirotsugu
    Nishihara, Hiroshi
    Kamata, Hajime
    Tanimoto, Akihide
    DIAGNOSTIC CYTOPATHOLOGY, 2020, 48 (11) : 965 - 971
  • [38] A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology
    Ustek, Duran
    Sirma, Sema
    Gumus, Ergun
    Arikan, Muzaffer
    Cakiris, Aris
    Abaci, Neslihan
    Mathew, Jaicy
    Emrence, Zeliha
    Azakli, Hulya
    Cosan, Fulya
    Cakar, Atilla
    Parlak, Mahmut
    Kursun, Olcay
    INFECTION GENETICS AND EVOLUTION, 2012, 12 (07) : 1349 - 1354
  • [39] Genome-wide characterization of copy number variations in the host genome in genetic resistance to Marek’s disease using next generation sequencing
    Hao Bai
    Yanghua He
    Yi Ding
    Qin Chu
    Ling Lian
    Eliyahu M. Heifetz
    Ning Yang
    Hans H. Cheng
    Huanmin Zhang
    Jilan Chen
    Jiuzhou Song
    BMC Genetics, 21
  • [40] Genome-wide characterization of copy number variations in the host genome in genetic resistance to Marek's disease using next generation sequencing
    Bai, Hao
    He, Yanghua
    Ding, Yi
    Chu, Qin
    Lian, Ling
    Heifetz, Eliyahu M.
    Yang, Ning
    Cheng, Hans H.
    Zhang, Huanmin
    Chen, Jilan
    Song, Jiuzhou
    BMC GENETICS, 2020, 21 (01)