Improved Protein Inference from Multiple Protease Bottom-Up Mass Spectrometry Data

被引:33
作者
Miller, Rachel M. [1 ]
Millikin, Robert J. [1 ]
Hoffmann, Connor V. [1 ]
Solntsev, Stefan K. [1 ]
Sheynkman, Gloria M. [1 ]
Shortreed, Michael R. [1 ]
Smith, Lloyd M. [1 ]
机构
[1] Univ Wisconsin, Dept Chem, 1101 Univ Ave, Madison, WI 53706 USA
关键词
mass spectrometry; bottom-up; multiple proteases; data-dependent acquisition; protein inference; STATISTICAL-MODEL; MS/MS; IDENTIFICATIONS; SENSITIVITY; PROTEOMICS;
D O I
10.1021/acs.jproteome.9b00330
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Peptides detected by tandem mass spectrometry (MS/MS) in bottom-up proteomics serve as proxies for the proteins expressed in the sample. Protein inference is a process routinely applied to these peptides to generate a plausible list of candidate protein identifications. The use of multiple proteases for parallel protein digestions expands sequence coverage, provides additional peptide identifications, and increases the probability of identifying peptides that are unique to a single protein, which are all valuable for protein inference. We have developed and implemented a multi-protease protein inference algorithm in MetaMorpheus, a bottom-up search software program, which incorporates the calculation of protease-specific q-values and preserves the association of peptide sequences and their protease of origin. This integrated multi-protease protein inference algorithm provides more accurate results than either the aggregation of results from the separate analysis of the peptide identifications produced by each protease (separate approach) in MetaMorpheus, or results that are obtained using Fido, ProteinProphet, or DTASelect2. MetaMorpheus' integrated multi-protease data analysis decreases the ambiguity of the protein group list, reduces the frequency of erroneous identifications, and increases the number of post-translational modifications identified, while combining multi-protease search and protein inference into a single software program.
引用
收藏
页码:3429 / 3438
页数:10
相关论文
共 23 条
[1]  
Edwards N., 2009, CLIN PROTEOM, V5, P23, DOI [DOI 10.1007/s12014-009-9024-5, 10.1007/s12014-009-9024-5, DOI 10.1007/S12014-009-9024-5]
[2]   Comet: An open-source MS/MS sequence database search tool [J].
Eng, Jimmy K. ;
Jahan, Tahmina A. ;
Hoopmann, Michael R. .
PROTEOMICS, 2013, 13 (01) :22-24
[3]   Using the entrapment sequence method as a standard to evaluate key steps of proteomics data analysis process [J].
Feng, Xiao-dong ;
Li, Li-wei ;
Zhang, Jian-hong ;
Zhu, Yun-ping ;
Chang, Cheng ;
Shu, Kun-xian ;
Ma, Jie .
BMC GENOMICS, 2017, 18
[4]   Confetti: A Multiprotease Map of the HeLa Proteome for Comprehensive Proteomics [J].
Guo, Xiaofeng ;
Trudgian, David C. ;
Lemoff, Andrew ;
Yadavalli, Sivaramakrishna ;
Mirzaei, Hamid .
MOLECULAR & CELLULAR PROTEOMICS, 2014, 13 (06) :1573-1584
[5]   Protein inference: a review [J].
Huang, Ting ;
Wang, Jingjing ;
Yu, Weichuan ;
He, Zengyou .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (05) :586-614
[6]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392
[7]   Global Post-Translational Modification Discovery [J].
Li, Qiyao ;
Shortreed, Michael R. ;
Wenger, Craig D. ;
Frey, Brian L. ;
Schaffer, Leah V. ;
Scalf, Mark ;
Smith, Lloyd M. .
JOURNAL OF PROTEOME RESEARCH, 2017, 16 (04) :1383-1390
[8]   Interpretation of shotgun proteomic data - The protein inference problem [J].
Nesvizhskii, AI ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (10) :1419-1440
[9]   A statistical model for identifying proteins by tandem mass spectrometry [J].
Nesvizhskii, AI ;
Keller, A ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2003, 75 (17) :4646-4658
[10]   Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies [J].
Searle, Brian C. ;
Turner, Mark ;
Nesvizhskii, Alexey I. .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (01) :245-253