Rapid and accurate peptide identification from tandem mass spectra

被引:142
|
作者
Park, Christopher Y. [1 ]
Klammer, Aaron A. [1 ]
Kaell, Lukas [1 ]
MacCoss, Michael J. [1 ]
Noble, William S. [1 ,2 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
mass spectrometry; peptide identification; proteomics; bioinformatics;
D O I
10.1021/pr800127y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Mass spectrometry, the core technology in the field of proteomics, promises to enable scientists to identify and quantify the entire complement of proteins in a complex biological sample. Currently, the primary bottleneck in this type of experiment is computational. Existing algorithms for interpreting mass spectra are slow and fail to identify a large proportion of the given spectra. We describe a database search program called Crux that reimplements and extends the widely used database search program SEQUEST. For speed, Crux uses a peptide indexing scheme to rapidly retrieve candidate peptides for a given spectrum. For each peptide in the target database, Crux generates shuffled decoy peptides on the fly, providing a good null model and, hence, accurate false discovery rate estimates. Crux also implements two recently described postprocessing methods: a p value calculation based upon fitting a Weibull distribution to the observed scores, and a semisupervised method that learns to discriminate between target and decoy matches. Both methods significantly improve the overall rate of peptide identification. Crux is implemented in C and is distributed with source code freely to noncommercial users.
引用
收藏
页码:3022 / 3027
页数:6
相关论文
共 50 条
  • [1] Peptide Identification from Mixture Tandem Mass Spectra
    Wang, Jian
    Perez-Santiago, Josue
    Katz, Jonathan E.
    Mallick, Parag
    Bandeira, Nuno
    MOLECULAR & CELLULAR PROTEOMICS, 2010, 9 (07) : 1476 - 1485
  • [2] Faster SEQUEST Searching for Peptide Identification from Tandem Mass Spectra
    Diament, Benjamin J.
    Noble, William Stafford
    JOURNAL OF PROTEOME RESEARCH, 2011, 10 (09) : 3871 - 3879
  • [3] Sequence database compression for peptide identification from tandem mass spectra
    Edwards, N
    Lippert, R
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2004, 3240 : 230 - 241
  • [4] Accurate Identification of Mass Peaks for Tandem Mass Spectra Using MCMC Model
    Hui Li
    Chunmei Liu
    Mugizi Robert Rwebangira
    Legand Burge
    Tsinghua Science and Technology, 2015, 20 (05) : 453 - 459
  • [5] Accurate Identification of Mass Peaks for Tandem Mass Spectra Using MCMC Model
    Li, Hui
    Liu, Chunmei
    Rwebangira, Mugizi Robert
    Burge, Legand
    TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (05) : 453 - 459
  • [6] Peptide Identification by Database Search of Mixture Tandem Mass Spectra
    Wang, Jian
    Bourne, Philip E.
    Bandeira, Nuno
    MOLECULAR & CELLULAR PROTEOMICS, 2011, 10 (12)
  • [7] PeakSelect:: preprocessing tandem mass spectra for better peptide identification
    Zhang, Jingfen
    He, Simin
    Ling, Charles X.
    Ca, Xingjun
    Zeng, Rong
    Gao, Wen
    RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2008, 22 (08) : 1203 - 1212
  • [8] Peptide identification by tandem mass spectra: An efficient parallel searching
    Oh, JH
    Gao, J
    BIBE 2005: 5TH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, 2005, : 161 - 168
  • [9] An accurate and efficient algorithm for peptide and ptm identification by tandem mass spectrometry
    Ning, Kang
    Ng, Hoong Kee
    Leong, Hon Wai
    GENOME INFORMATICS 2007, VOL 19, 2007, 19 : 119 - 130
  • [10] Model of ion intensity from tandem mass spectra for improved peptide identification and simulation
    Fazal, Z.
    Southey, B. R.
    Sadeque, A.
    Sweedler, J. V.
    Rodriguez-Zas, S. L.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 994 - 996