Dinosaur: A Refined Open-Source Peptide MS Feature Detector

被引:48
作者
Teleman, Johan [1 ,2 ]
Chawade, Aakash [1 ]
Sandin, Marianne [1 ]
Levander, Fredrik [1 ,3 ]
Malmstrom, Johan [2 ]
机构
[1] Lund Univ, Dept Immunotechnol, S-22383 Lund, Sweden
[2] Lund Univ, Dept Clin Sci, S-22100 Lund, Sweden
[3] Lund Univ, BILS, S-22383 Lund, Sweden
基金
欧洲研究理事会; 瑞典研究理事会;
关键词
proteomics; mass spectrometry; electrospray ionization; feature detection; chimeric spectra; algorithm; software; MASS-SPECTROMETRIC DATA; OPEN-SOURCE SOFTWARE; LIQUID-CHROMATOGRAPHY; MONOISOTOPIC MASSES; IDENTIFICATION; PROTEOME; FRAMEWORK; ALIGNMENT; WORKFLOW; TOOLS;
D O I
10.1021/acs.jproteome.6b00016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In bottom-up mass spectrometry (MS)-based proteomics, peptide isotopic and chromatographic traces (features) are frequently used for label-free quantification in data-dependent acquisition MS but can also be used for the improved identification of chimeric spectra or sample complexity characterization. Feature detection is difficult because of the high complexity of MS proteomics data from biological samples, which frequently causes features to intermingle. In addition, existing feature detection algorithms commonly suffer from compatibility issues, long computation times, or poor performance on high-resolution data. Because of these limitations, we developed a new tool, Dinosaur, with increased speed and versatility. Dinosaur has the functionality to sample algorithm computations through quality-control plots, which we call a plot trail. From the evaluation of this plot trail, we introduce several algorithmic improvements to further improve the robustness and performance of Dinosaur, with the detection of features for 98% of MS/MS identifications in a benchmark data set, and no other algorithm tested in this study passed 96% feature detection. We finally used Dinosaur to reimplement a published workflow for peptide identification in chimeric spectra, increasing chimeric identification from 26% to 32% over the standard workflow. Dinosaur is operating-system-independent and is freely available as open source on https://github.com/fickludd/dinosaur.
引用
收藏
页码:2143 / 2151
页数:9
相关论文
共 45 条
  • [11] The Proteios Software Environment: An Extensible Multiuser Platform for Management and Analysis of Proteomics Data
    Hakkinen, Jari
    Vincic, Gregory
    Mansson, Olle
    Warell, Kristofer
    Levander, Fredrik
    [J]. JOURNAL OF PROTEOME RESEARCH, 2009, 8 (06) : 3037 - 3043
  • [12] Interaction proteome of human Hippo signaling: modular control of the co-activator YAP1
    Hauri, Simon
    Wepf, Alexander
    van Drogen, Audrey
    Varjosalo, Markku
    Tapon, Nic
    Aebersold, Ruedi
    Gstaiger, Matthias
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2013, 9
  • [13] A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances
    Hein, Marco Y.
    Hubner, Nina C.
    Poser, Ina
    Cox, Juergen
    Nagaraj, Nagarjuna
    Toyoda, Yusuke
    Gak, Igor A.
    Weisswange, Ina
    Mansfeld, Joerg
    Buchholz, Frank
    Hyman, Anthony A.
    Mann, Matthias
    [J]. CELL, 2015, 163 (03) : 712 - 723
  • [14] High-speed data reduction, feature detection and MS/MS spectrum quality assessment of shotgun proteomics data sets using high-resolution mass Spectrometry
    Hoopmann, Michael R.
    Finney, Gregory L.
    MacCoss, Michael J.
    [J]. ANALYTICAL CHEMISTRY, 2007, 79 (15) : 5620 - 5632
  • [15] TOPPAS: A Graphical Workflow Editor for the Analysis of High-Throughput Proteomics Data
    Junker, Johannes
    Bielow, Chris
    Bertsch, Andreas
    Sturm, Marc
    Reinert, Knut
    Kohlbachert, Oliver
    [J]. JOURNAL OF PROTEOME RESEARCH, 2012, 11 (07) : 3914 - 3920
  • [16] Proteome-wide selected reaction monitoring assays for the human pathogen Streptococcus pyogenes
    Karlsson, Christofer
    Malmstroem, Lars
    Aebersold, Ruedi
    Malmstroem, Johan
    [J]. NATURE COMMUNICATIONS, 2012, 3
  • [17] Data processing for mass spectrometry-based metabolomics
    Katajamaa, Mikko
    Oresic, Matej
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2007, 1158 (1-2) : 318 - 328
  • [18] ProteoWizard: open source software for rapid proteomics tools development
    Kessner, Darren
    Chambers, Matt
    Burke, Robert
    Agusand, David
    Mallick, Parag
    [J]. BIOINFORMATICS, 2008, 24 (21) : 2534 - 2536
  • [19] MS-GF plus makes progress towards a universal database search tool for proteomics
    Kim, Sangtae
    Pevzner, Pavel A.
    [J]. NATURE COMMUNICATIONS, 2014, 5
  • [20] Statistical and computational methods for comparative proteomic profiling using liquid chromatography-tandem mass spectrometry
    Listgarten, J
    Emili, A
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (04) : 419 - 434