SAMPI:: Protein identification with mass spectra alignments

被引:7
作者
Kaltenbach, Hans-Michael
Wilke, Andreas
Boecker, Sebastian
机构
[1] Univ Bielefeld, AG Genominformat, Tech Fak, D-33501 Bielefeld, Germany
[2] Univ Chicago, Computat Inst, Chicago, IL 60637 USA
[3] Univ Jena, Lehrstuhl Bioinformat, D-07743 Jena, Germany
关键词
SPECTROMETRY DATA; ALGORITHMS; PROTEOMICS; SEARCH; TOOL;
D O I
10.1186/1471-2105-8-102
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Mass spectrometry based peptide mass fingerprints (PMFs) offer a fast, efficient, and robust method for protein identification. A protein is digested (usually by trypsin) and its mass spectrum is compared to simulated spectra for protein sequences in a database. However, existing tools for analyzing PMFs often suffer from missing or heuristic analysis of the significance of search results and insufficient handling of missing and additional peaks. Results: We present an unified framework for analyzing Peptide Mass Fingerprints that offers a number of advantages over existing methods: First, comparison of mass spectra is based on a scoring function that can be custom-designed for certain applications and explicitly takes missing and additional peaks into account. The method is able to simulate almost every additive scoring scheme. Second, we present an efficient deterministic method for assessing the significance of a protein hit, independent of the underlying scoring function and sequence database. We prove the applicability of our approach using biological mass spectrometry data and compare our results to the standard software Mascot. Conclusion: The proposed framework for analyzing Peptide Mass Fingerprints shows performance comparable to Mascot on small peak lists. Introducing more noise peaks, we are able to keep identification rates at a similar level by using the flexibility introduced by scoring schemes.
引用
收藏
页数:11
相关论文
共 24 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]  
[Anonymous], 1997, ALGORITHMS STRINGS T
[4]  
Bafna V., 2001, BIOINFORMATICS, V17, P13
[5]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1992, 20 :2019-2022
[6]  
Böcker S, 2005, LECT NOTES COMPUT SC, V3537, P429
[7]   Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data [J].
Bylund, D ;
Danielsson, R ;
Malmquist, G ;
Markides, KE .
JOURNAL OF CHROMATOGRAPHY A, 2002, 961 (02) :237-244
[8]   Evaluation of algorithms for protein identification from sequence databases using mass spectrometry data [J].
Chamrad, DC ;
Körting, G ;
Stühler, K ;
Meyer, HE ;
Klose, J ;
Blüggel, M .
PROTEOMICS, 2004, 4 (03) :619-628
[9]  
Gay S, 2002, PROTEOMICS, V2, P1374, DOI 10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO
[10]  
2-D