A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data

被引:117
作者
Xu, Hua
Freitas, Michael A. [1 ]
机构
[1] Ohio State Univ, Dept Mol Immunol Virol & Med Genet, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Chem, Columbus, OH 43210 USA
来源
BMC BIOINFORMATICS | 2007年 / 8卷
关键词
SEQUENCE DATABASES; SPECTRAL DATA; PROTEIN IDENTIFICATION; MS-MS; PEPTIDE; DISSOCIATION; VALIDATION; MS/MS; MODEL;
D O I
10.1186/1471-2105-8-133
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Liquid chromatography coupled with tandem mass spectrometry ( LC-MS/MS) has become one of the most used tools in mass spectrometry based proteomics. Various algorithms have since been developed to automate the process for modern high-throughput LC-MS/MS experiments. Results: A probability based statistical scoring model for assessing peptide and protein matches in tandem MS database search was derived. The statistical scores in the model represent the probability that a peptide match is a random occurrence based on the number or the total abundance of matched product ions in the experimental spectrum. The model also calculates probability based scores to assess protein matches. Thus the protein scores in the model reflect the significance of protein matches and can be used to differentiate true from random protein matches. Conclusion: The model is sensitive to high mass accuracy and implicitly takes mass accuracy into account during scoring. High mass accuracy will not only reduce false positives, but also improves the scores of true positive matches. The algorithm is incorporated in an automated database search program MassMatrix.
引用
收藏
页数:10
相关论文
共 32 条
[1]  
Bafna V., 2001, BIOINFORMATICS, V17, P13
[2]   Electron capture dissociation mass spectrometry in characterization of peptides and proteins [J].
Bakhtiar, Ray ;
Guan, Ziqiang .
BIOTECHNOLOGY LETTERS, 2006, 28 (14) :1047-1059
[3]   CONTRIBUTIONS OF MASS-SPECTROMETRY TO PEPTIDE AND PROTEIN-STRUCTURE [J].
BIEMANN, K .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1988, 16 (1-12) :99-111
[4]   Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS MS and database searching [J].
Clauser, KR ;
Baker, P ;
Burlingame, AL .
ANALYTICAL CHEMISTRY, 1999, 71 (14) :2871-2882
[5]   OLAV: Towards high-throughput tandem mass spectrometry data identification [J].
Colinge, J ;
Masselot, A ;
Giron, M ;
Dessingy, T ;
Magnin, J .
PROTEOMICS, 2003, 3 (08) :1454-1463
[6]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[7]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[8]   Open mass spectrometry search algorithm [J].
Geer, LY ;
Markey, SP ;
Kowalak, JA ;
Wagner, L ;
Xu, M ;
Maynard, DM ;
Yang, XY ;
Shi, WY ;
Bryant, SH .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :958-964
[9]   SALSA: A pattern recognition algorithm to detect electrophile-adducted peptides by automated evaluation of CID spectra in LC-MS-MS analyses [J].
Hansen, BT ;
Jones, JA ;
Mason, DE ;
Liebler, DC .
ANALYTICAL CHEMISTRY, 2001, 73 (08) :1676-1683
[10]   Intensity-based statistical scorer for tandem mass spectrometry [J].
Havilio, M ;
Haddad, Y ;
Smilansky, Z .
ANALYTICAL CHEMISTRY, 2003, 75 (03) :435-444