PTMiner: Localization and Quality Control of Protein Modifications Detected in an Open Search and Its Application to Comprehensive Post-translational Modification Characterization in Human Proteome

被引:30
作者
An, Zhiwu [1 ,5 ]
Zhai, Linhui [2 ]
Ying, Wantao [3 ,4 ]
Qian, Xiaohong [3 ,4 ]
Gong, Fuzhou [1 ,5 ]
Tan, Minjia [2 ]
Fu, Yan [1 ,5 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Natl Ctr Math & Interdisciplinary Sci, Key Lab Random Complex Struct & Data Sci, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Shanghai Inst Mat Med, State Key Lab Drug Res, Shanghai 201203, Peoples R China
[3] Natl Engn Res Ctr Prot Drugs, Beijing Proteome Res Ctr, Natl Ctr Prot Sci Beijing, State Key Lab Prote, Beijing 102206, Peoples R China
[4] Beijing Inst Life, Beijing 100850, Peoples R China
[5] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
关键词
PHOSPHORYLATION SITE LOCALIZATION; MASS-SPECTROMETRY; PEPTIDE IDENTIFICATION; SHOTGUN PROTEOMICS; TANDEM; MS/MS; ALGORITHM; DISCOVERY; SOFTWARE; ACCURATE;
D O I
10.1074/mcp.RA118.000812
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The open (mass tolerant) search of tandem mass spectra of peptides shows great potential in the comprehensive detection of post-translational modifications (PTMs) in shotgun proteomics. However, this search strategy has not been widely used by the community, and one bottleneck of it is the lack of appropriate algorithms for automated and reliable post-processing of the coarse and error-prone search results. Here we present PTMiner, a software tool for confident filtering and localization of modifications (mass shifts) detected in an open search. After mass-shift-grouped false discovery rate (FDR) control of peptide-spectrum matches (PSMs), PTMiner uses an empirical Bayesian method to localize modifications through iterative learning of the prior probabilities of each type of modification occurring on different amino acids. The performance of PTMiner was evaluated on three data sets, including simulated data, chemically synthesized peptide library data and modified-peptide spiked-in proteome data. The results showed that PTMiner can effectively control the PSM FDR and accurately localize the modification sites. At 1% real false localization rate (FLR), PTMiner localized 93%, 84 and 83% of the modification sites in the three data sets, respectively, far higher than two open search engines we used and an extended version of the Ascore localization algorithm. We then used PTMiner to analyze a draft map of human proteome containing 25 million spectra from 30 tissues, and confidently identified over 1.7 million modified PSMs at 1% FDR and 1% FLR, which provided a system-wide view of both known and unknown PTMs in the human proteome.
引用
收藏
页码:391 / 405
页数:15
相关论文
共 73 条
[1]   QuickMod: A Tool for Open Modification Spectrum Library Searches [J].
Ahrne, Erik ;
Nikitin, Frederic ;
Lisacek, Frederique ;
Mueller, Markus .
JOURNAL OF PROTEOME RESEARCH, 2011, 10 (07) :2913-2921
[2]   A multidimensional chromatography technology for in-depth phosphoproteome analysis [J].
Albuquerque, Claudio P. ;
Smolka, Marcus B. ;
Payne, Samuel H. ;
Bafna, Vineet ;
Eng, Jimmy ;
Zhou, Huilin .
MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (07) :1389-1396
[3]   SLoMo: Automated Site Localization of Modifications from ETD/ECD Mass Spectra [J].
Bailey, Christopher M. ;
Sweet, Steve M. M. ;
Cunningham, Debbie L. ;
Zeller, Martin ;
Heath, John K. ;
Cooper, Helen J. .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (04) :1965-1971
[4]   MS-Viewer: A Web-based Spectral Viewer for Proteomics Results [J].
Baker, Peter R. ;
Chalkley, Robert J. .
MOLECULAR & CELLULAR PROTEOMICS, 2014, 13 (05) :1392-1396
[5]   Modification Site Localization Scoring Integrated into a Search Engine [J].
Baker, Peter R. ;
Trinidad, Jonathan C. ;
Chalkley, Robert J. .
MOLECULAR & CELLULAR PROTEOMICS, 2011, 10 (07)
[6]   Protein identification by spectral networks analysis [J].
Bandeira, Nuno ;
Tsur, Dekel ;
Frank, Ari ;
Pevzner, Pavel A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) :6140-6145
[7]   SeMoP: A new computational strategy for the unrestricted search for modified peptides using LC-MS/MS data [J].
Baumgartner, Christian ;
Rejtar, Tomas ;
Kullolli, Majlinda ;
Akella, Lakshmi Manohar ;
Karger, Barry L. .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (09) :4199-4208
[8]   A probability-based approach for high-throughput protein phosphorylation analysis and site localization [J].
Beausoleil, Sean A. ;
Villen, Judit ;
Gerber, Scott A. ;
Rush, John ;
Gygi, Steven P. .
NATURE BIOTECHNOLOGY, 2006, 24 (10) :1285-1292
[9]   Comprehensive analysis of a multidimensional liquid chromatography mass spectrometry dataset acquired on a quadrupole selecting, quadrupole collision cell, time-of-flight mass spectrometer - I. How much of the data is theoretically interpretable by search engines? [J].
Chalkley, RJ ;
Baker, PR ;
Hansen, KC ;
Medzihradszky, KF ;
Allen, NP ;
Rexach, M ;
Burlingame, AL .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (08) :1189-1193
[10]   Modification Site Localization Scoring: Strategies and Performance [J].
Chalkley, Robert J. ;
Clauser, Karl R. .
MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (05) :3-14