Dinosaur: A Refined Open-Source Peptide MS Feature Detector

被引:48
作者
Teleman, Johan [1 ,2 ]
Chawade, Aakash [1 ]
Sandin, Marianne [1 ]
Levander, Fredrik [1 ,3 ]
Malmstrom, Johan [2 ]
机构
[1] Lund Univ, Dept Immunotechnol, S-22383 Lund, Sweden
[2] Lund Univ, Dept Clin Sci, S-22100 Lund, Sweden
[3] Lund Univ, BILS, S-22383 Lund, Sweden
基金
欧洲研究理事会; 瑞典研究理事会;
关键词
proteomics; mass spectrometry; electrospray ionization; feature detection; chimeric spectra; algorithm; software; MASS-SPECTROMETRIC DATA; OPEN-SOURCE SOFTWARE; LIQUID-CHROMATOGRAPHY; MONOISOTOPIC MASSES; IDENTIFICATION; PROTEOME; FRAMEWORK; ALIGNMENT; WORKFLOW; TOOLS;
D O I
10.1021/acs.jproteome.6b00016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In bottom-up mass spectrometry (MS)-based proteomics, peptide isotopic and chromatographic traces (features) are frequently used for label-free quantification in data-dependent acquisition MS but can also be used for the improved identification of chimeric spectra or sample complexity characterization. Feature detection is difficult because of the high complexity of MS proteomics data from biological samples, which frequently causes features to intermingle. In addition, existing feature detection algorithms commonly suffer from compatibility issues, long computation times, or poor performance on high-resolution data. Because of these limitations, we developed a new tool, Dinosaur, with increased speed and versatility. Dinosaur has the functionality to sample algorithm computations through quality-control plots, which we call a plot trail. From the evaluation of this plot trail, we introduce several algorithmic improvements to further improve the robustness and performance of Dinosaur, with the detection of features for 98% of MS/MS identifications in a benchmark data set, and no other algorithm tested in this study passed 96% feature detection. We finally used Dinosaur to reimplement a published workflow for peptide identification in chimeric spectra, increasing chimeric identification from 26% to 32% over the standard workflow. Dinosaur is operating-system-independent and is freely available as open source on https://github.com/fickludd/dinosaur.
引用
收藏
页码:2143 / 2151
页数:9
相关论文
共 45 条
  • [1] Feature detection and alignment of hyphenated chromatographic-mass spectrometric data -: Extraction of pure ion chromatograms using Kalman tracking
    Aberg, K. Magnus
    Torgrip, Ralf J. O.
    Kolmert, Johan
    Schuppe-Koistinen, Ina
    Lindberg, Johan
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2008, 1192 (01) : 139 - 146
  • [2] A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS
    Bellew, Matthew
    Coram, Marc
    Fitzgibbon, Matthew
    Igra, Mark
    Randolph, Tim
    Wang, Pei
    May, Damon
    Eng, Jimmy
    Fang, Ruihua
    Lin, ChenWei
    Chen, Jinzhi
    Goodlett, David
    Whiteaker, Jeffrey
    Paulovich, Amanda
    McIntosh, Martin
    [J]. BIOINFORMATICS, 2006, 22 (15) : 1902 - 1909
  • [3] Current challenges in software solutions for mass spectrometry-based quantitative proteomics
    Cappadona, Salvatore
    Baker, Peter R.
    Cutillas, Pedro R.
    Heck, Albert J. R.
    van Breukelen, Bas
    [J]. AMINO ACIDS, 2012, 43 (03) : 1087 - 1108
  • [4] Data Processing Has Major Impact on the Outcome of Quantitative Label-Free LC-MS Analysis
    Chawade, Aakash
    Sandin, Marianne
    Teleman, Johan
    Malmstrom, Johan
    Levander, Fredrik
    [J]. JOURNAL OF PROTEOME RESEARCH, 2015, 14 (02) : 676 - 687
  • [5] Differential dynamics of the mammalian mRNA and protein expression response to misfolding stress
    Cheng, Zhe
    Teo, Guoshou
    Krueger, Sabrina
    Rock, Tara M.
    Koh, Hiromi W. L.
    Choi, Hyungwon
    Vogel, Christine
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (01)
  • [6] Massifquant: open-source Kalman filter-based XC-MS isotope trace feature detection
    Conley, Christopher J.
    Smith, Rob
    Torgrip, Ralf J. O.
    Taylor, Ryan M.
    Tautenhahn, Ralf
    Prince, John T.
    [J]. BIOINFORMATICS, 2014, 30 (18) : 2636 - 2643
  • [7] MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification
    Cox, Juergen
    Mann, Matthias
    [J]. NATURE BIOTECHNOLOGY, 2008, 26 (12) : 1367 - 1372
  • [8] Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis
    Gillet, Ludovic C.
    Navarro, Pedro
    Tate, Stephen
    Roest, Hannes
    Selevsek, Nathalie
    Reiter, Lukas
    Bonner, Ron
    Aebersold, Ruedi
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (06)
  • [9] SuperQuant: A Data Processing Approach to Increase Quantitative Proteome Coverage
    Gorshkov, Vladimir
    Verano-Braga, Thiago
    Kjeldsen, Frank
    [J]. ANALYTICAL CHEMISTRY, 2015, 87 (12) : 6319 - 6327
  • [10] The complete structure of the 55S mammalian mitochondrial ribosome
    Greber, Basil J.
    Bieri, Philipp
    Leibundgut, Marc
    Leitner, Alexander
    Aebersold, Ruedi
    Boehringer, Daniel
    Ban, Nenad
    [J]. SCIENCE, 2015, 348 (6232) : 303 - 308