Fast and accurate mutation detection in whole genome sequences of multiple isogenic samples with IsoMut

被引:22
作者
Pipek, O. [1 ]
Ribli, D. [1 ]
Molnar, J. [2 ]
Poti, A. [2 ]
Krzystanek, M. [3 ]
Bodor, A. [1 ]
Tusnady, G. E. [2 ]
Szallasi, Z. [3 ,4 ,5 ,6 ]
Csabai, I. [1 ]
Szuts, D. [2 ]
机构
[1] Eotvos Lorand Univ, Dept Phys & Complex Syst, H-1117 Budapest, Hungary
[2] Hungarian Acad Sci, Inst Enzymol, Res Ctr Nat Sci, H-1117 Budapest, Hungary
[3] Tech Univ Denmark, Ctr Biol Sequence Anal, Dept Syst Biol, DK-2800 Lyngby, Denmark
[4] Boston Childrens Hosp, CHIP, Boston, MA 02115 USA
[5] Harvard Med Sch, Boston, MA 02215 USA
[6] Semmelweis Univ, Dept Pathol, Brain Metastasis Res Grp, MTA SE NAP, H-1091 Budapest, Hungary
基金
匈牙利科学研究基金会;
关键词
Next generation sequencing; Mutagenesis; Somatic mutation detection; Multiple isogenic samples; Low false positive rate; Demonstrative algorithm; POINT MUTATIONS; READ ALIGNMENT; CANCER; NUMBER; ERROR;
D O I
10.1186/s12859-017-1492-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Detection of somatic mutations is one of the main goals of next generation DNA sequencing. A wide range of experimental systems are available for the study of spontaneous or environmentally induced mutagenic processes. However, most of the routinely used mutation calling algorithms are not optimised for the simultaneous analysis of multiple samples, or for non-human experimental model systems with no reliable databases of common genetic variations. Most standard tools either require numerous in-house post filtering steps with scarce documentation or take an unpractically long time to run. To overcome these problems, we designed the streamlined IsoMut tool which can be readily adapted to experimental scenarios where the goal is the identification of experimentally induced mutations in multiple isogenic samples. Methods: Using 30 isogenic samples, reliable cohorts of validated mutations were created for testing purposes. Optimal values of the filtering parameters of IsoMut were determined in a thorough and strict optimization procedure based on these test sets. Results: We show that IsoMut, when tuned correctly, decreases the false positive rate compared to conventional tools in a 30 sample experimental setup; and detects not only single nucleotide variations, but short insertions and deletions as well. IsoMut can also be run more than a hundred times faster than the most precise state of art tool, due its straightforward and easily understandable filtering algorithm. Conclusions: IsoMut has already been successfully applied in multiple recent studies to find unique, treatment induced mutations in sets of isogenic samples with very low false positive rates. These types of studies provide an important contribution to determining the mutagenic effect of environmental agents or genetic defects, and IsoMut turned out to be an invaluable tool in the analysis of such data.
引用
收藏
页数:11
相关论文
共 30 条
[1]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[2]   Subclonal phylogenetic structures in cancer revealed by ultra-deep sequencing [J].
Campbell, Peter J. ;
Pleasance, Erin D. ;
Stephens, Philip J. ;
Dicks, Ed ;
Rance, Richard ;
Goodhead, Ian ;
Follows, George A. ;
Green, Anthony R. ;
Futreal, P. Andy ;
Stratton, Michael R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (35) :13081-13086
[3]   Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples [J].
Cibulskis, Kristian ;
Lawrence, Michael S. ;
Carter, Scott L. ;
Sivachenko, Andrey ;
Jaffe, David ;
Sougnez, Carrie ;
Gabriel, Stacey ;
Meyerson, Matthew ;
Lander, Eric S. ;
Getz, Gad .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :213-219
[4]   BRAFL597 Mutations in Melanoma Are Associated with Sensitivity to MEK Inhibitors [J].
Dahlman, Kimberly Brown ;
Xia, Junfeng ;
Hutchinson, Katherine ;
Ng, Charles ;
Hucks, Donald ;
Jia, Peilin ;
Atefi, Mohammad ;
Su, Zengliu ;
Branch, Suzanne ;
Lyle, Pamela L. ;
Hicks, Donna J. ;
Bozon, Viviana ;
Glaspy, John A. ;
Rosen, Neal ;
Solit, David B. ;
Netterville, James L. ;
Vnencak-Jones, Cindy L. ;
Sosman, Jeffrey A. ;
Ribas, Antoni ;
Zhao, Zhongming ;
Pao, William .
CANCER DISCOVERY, 2012, 2 (09) :791-797
[5]   Targeted next generation sequencing of clinically significant gene mutations and translocations in leukemia [J].
Duncavage, Eric J. ;
Abel, Haley J. ;
Szankasi, Philippe ;
Kelley, Todd W. ;
Pfeifer, John D. .
MODERN PATHOLOGY, 2012, 25 (06) :795-804
[6]   SAMBLASTER: fast duplicate marking and structural variant read extraction [J].
Faust, Gregory G. ;
Hall, Ira M. .
BIOINFORMATICS, 2014, 30 (17) :2503-2505
[7]   Ensembl 2014 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Kulesha, Eugene ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William M. ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Pritchard, Emily ;
Riat, Harpreet S. ;
Ruffier, Magali ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Trevanion, Stephen J. ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Wilson, Mark ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Parker, Anne ;
Spudich, Giulietta ;
Yates, Andy .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D749-D755
[8]   From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software [J].
Forster, Michael ;
Forster, Peter ;
Elsharawy, Abdou ;
Hemmrich, Georg ;
Kreck, Benjamin ;
Wittig, Michael ;
Thomsen, Ingo ;
Stade, Bjoern ;
Barann, Matthias ;
Ellinghaus, David ;
Petersen, Britt-Sabina ;
May, Sandra ;
Melum, Espen ;
Schilhabel, Markus B. ;
Keller, Andreas ;
Schreiber, Stefan ;
Rosenstiel, Philip ;
Franke, Andre .
NUCLEIC ACIDS RESEARCH, 2013, 41 (01) :e16
[9]   Targeted next-generation sequencing detects point mutations, insertions, deletions and balanced chromosomal rearrangements as well as identifies novel leukemia-specific fusion genes in a single procedure [J].
Grossmann, V. ;
Kohlmann, A. ;
Klein, H-U ;
Schindela, S. ;
Schnittger, S. ;
Dicker, F. ;
Dugas, M. ;
Kern, W. ;
Haferlach, T. ;
Haferlach, C. .
LEUKEMIA, 2011, 25 (04) :671-680
[10]  
Johnson GE, 2012, METHODS MOL BIOL, V817, P55, DOI 10.1007/978-1-61779-421-6_4