Coherent pipeline for biomarker discovery using mass spectrometry and bioinformatics

被引：10

作者：

Al-Shahib, Ali ^{[1
]}

Misra, Raju ^{[1
]}

Ahmod, Nadia ^{[1
]}

Fang, Min ^{[1
]}

Shah, Haroun ^{[1
]}

Gharbia, Saheer ^{[1
]}

机构：

[1] Hlth Protect Agcy, Ctr Infect, London NW9 5EQ, England

来源：

BMC BIOINFORMATICS | 2010年 / 11卷

关键词：

SEARCH ALGORITHMS; TANDEM; IDENTIFICATION;

D O I：

10.1186/1471-2105-11-437

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Robust biomarkers are needed to improve microbial identification and diagnostics. Proteomics methods based on mass spectrometry can be used for the discovery of novel biomarkers through their high sensitivity and specificity. However, there has been a lack of a coherent pipeline connecting biomarker discovery with established approaches for evaluation and validation. We propose such a pipeline that uses in silico methods for refined biomarker discovery and confirmation. Results: The pipeline has four main stages: Sample preparation, mass spectrometry analysis, database searching and biomarker validation. Using the pathogen Clostridium botulinum as a model, we show that the robustness of candidate biomarkers increases with each stage of the pipeline. This is enhanced by the concordance shown between various database search algorithms for peptide identification. Further validation was done by focusing on the peptides that are unique to C. botulinum strains and absent in phylogenetically related Clostridium species. From a list of 143 peptides, 8 candidate biomarkers were reliably identified as conserved across C. botulinum strains. To avoid discarding other unique peptides, a confidence scale has been implemented in the pipeline giving priority to unique peptides that are identified by a union of algorithms. Conclusions: This study demonstrates that implementing a coherent pipeline which includes intensive bioinformatics validation steps is vital for discovery of robust biomarkers. It also emphasises the importance of proteomics based methods in biomarker discovery.

引用

页数：9

共 22 条

[1]

Altschul SF, 1996, METHOD ENZYMOL, V266, P460

[2] BASIC LOCAL ALIGNMENT SEARCH TOOL [J].

ALTSCHUL, SF ;

GISH, W ;

MILLER, W ;

MYERS, EW ;

LIPMAN, DJ .

JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410

[3]

[Anonymous], SWISS-PROT protein database

[4] Comparative evaluation of tandem MS search algorithms using a target-decoy search strategy [J].

Balgley, Brian M. ;

Laudeman, Tom ;

Yang, Li ;

Song, Tao ;

Lee, Cheng S. .

MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (09) :1599-1608

[5] Comparison of different search engines using validated MS/MS test datasets [J].

Boutilier, K ;

Ross, M ;

Podtelejnikov, AV ;

Orsi, C ;

Taylor, R ;

Taylor, P ;

Figeys, D .

ANALYTICA CHIMICA ACTA, 2005, 534 (01) :11-20

[6] Comparison of mascot and X!Tandem performance for low and high accuracy mass spectrometry and the development of an adjusted Mascot threshold [J].

Brosch, Markus ;

Swamy, Sajani ;

Hubbard, Tim ;

Choudhary, Jyoti .

MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (05) :962-970

[7] Potential for false positive identifications from large databases through tandem mass spectrometry [J].

Cargile, BJ ;

Bundy, JL ;

Stephenson, JL .

JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :1082-1085

[8] TANDEM: matching proteins with tandem mass spectra [J].

Craig, R ;

Beavis, RC .

BIOINFORMATICS, 2004, 20 (09) :1466-1467

[9]

ELIAS J, 2007, NATURE METHODS, V4, P2107

[10] AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].

ENG, JK ;

MCCORMACK, AL ;

YATES, JR .

JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989

← 1 2 3 →