From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing

被引:40
|
作者
Laurie, Steve [1 ,2 ]
Fernandez-Callejo, Marcos [1 ,2 ]
Marco-Sola, Santiago [1 ,2 ]
Trotta, Jean-Remi [1 ,2 ]
Camps, Jordi [1 ,2 ]
Chacon, Alejandro [3 ]
Espinosa, Antonio [3 ]
Gut, Marta [1 ,2 ]
Gut, Ivo [1 ,2 ]
Heath, Simon [1 ,2 ]
Beltran, Sergi [1 ,2 ]
机构
[1] BIST, Ctr Genom Regulat CRG, CNAG CRG, Baldiri & Reixac 4, Barcelona 08028, Spain
[2] UPF, Barcelona, Spain
[3] Univ Autonoma Barcelona, Bellaterra, Spain
关键词
whole genome sequencing; whole exome sequencing; NGS; NA12878; alignment; variant calling; bioinformatics; computing speed; benchmark; DISCOVERY; GENERATION; FRAMEWORK; SNP;
D O I
10.1002/humu.23114
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK Haplotype-Caller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available. Published 2016 Wiley Periodicals, Inc.
引用
收藏
页码:1263 / 1271
页数:9
相关论文
共 50 条
  • [1] Opportunities and challenges of whole-genome and -exome sequencing
    Petersen, Britt-Sabina
    Fredrich, Broder
    Hoeppner, Marc P.
    Ellinghaus, David
    Franke, Andre
    BMC GENETICS, 2017, 18
  • [2] Opportunities and challenges of whole-genome and -exome sequencing
    Britt-Sabina Petersen
    Broder Fredrich
    Marc P. Hoeppner
    David Ellinghaus
    Andre Franke
    BMC Genetics, 18
  • [3] BALSA: integrated secondary analysis for whole-genome and whole-exome sequencing, accelerated by GPU
    Luo, Ruibang
    Wong, Yiu-Lun
    Law, Wai-Chun
    Lee, Lap-Kei
    Cheung, Jeanno
    Liu, Chi-Man
    Lam, Tak-Wah
    PEERJ, 2014, 2
  • [4] Whole-Exome/Genome Sequencing and Genomics
    Grody, Wayne W.
    Thompson, Barry H.
    Hudgins, Louanne
    PEDIATRICS, 2013, 132 : S211 - S215
  • [5] Analytical validation of whole exome and whole genome sequencing for clinical applications
    Linderman, Michael D.
    Brandt, Tracy
    Edelmann, Lisa
    Jabado, Omar
    Kasai, Yumi
    Kornreich, Ruth
    Mahajan, Milind
    Shah, Hardik
    Kasarskis, Andrew
    Schadt, Eric E.
    BMC MEDICAL GENOMICS, 2014, 7
  • [6] Whole exome and whole genome sequencing with dried blood spot DNA without whole genome amplification
    Bassaganyas, Laia
    Freedman, George
    Vaka, Dedeepya
    Wan, Eunice
    Lao, Richard
    Chen, Flavia
    Kvale, Mark
    Currier, Robert J.
    Puck, Jennifer M.
    Kwok, Pui-Yan
    HUMAN MUTATION, 2018, 39 (01) : 167 - 171
  • [7] Variant detection sensitivity and biases in whole genome and exome sequencing
    Meynert, Alison M.
    Ansari, Morad
    FitzPatrick, David R.
    Taylor, Martin S.
    BMC BIOINFORMATICS, 2014, 15
  • [8] Computational and Bioinformatics Frameworks for Next-Generation Whole Exome and Genome Sequencing
    Dolled-Filhart, Marisa P.
    Lee, Michael, Jr.
    Ou-yang, Chih-wen
    Haraksingh, Rajini Rani
    Lin, Jimmy Cheng-Ho
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [9] Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants
    Belkadi, Aziz
    Bolze, Alexandre
    Itan, Yuval
    Cobat, Aurelie
    Vincent, Quentin B.
    Antipenko, Alexander
    Shang, Lei
    Boisson, Bertrand
    Casanova, Jean-Laurent
    Abel, Laurent
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) : 5473 - 5478
  • [10] Linkage analysis and the study of Mendelian disease in the era of whole exome and genome sequencing
    Teare, M. Dawn
    Koref, Mauro F. Santibanez
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2014, 13 (05) : 378 - 383