Coherent pipeline for biomarker discovery using mass spectrometry and bioinformatics

被引:10
作者
Al-Shahib, Ali [1 ]
Misra, Raju [1 ]
Ahmod, Nadia [1 ]
Fang, Min [1 ]
Shah, Haroun [1 ]
Gharbia, Saheer [1 ]
机构
[1] Hlth Protect Agcy, Ctr Infect, London NW9 5EQ, England
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SEARCH ALGORITHMS; TANDEM; IDENTIFICATION;
D O I
10.1186/1471-2105-11-437
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Robust biomarkers are needed to improve microbial identification and diagnostics. Proteomics methods based on mass spectrometry can be used for the discovery of novel biomarkers through their high sensitivity and specificity. However, there has been a lack of a coherent pipeline connecting biomarker discovery with established approaches for evaluation and validation. We propose such a pipeline that uses in silico methods for refined biomarker discovery and confirmation. Results: The pipeline has four main stages: Sample preparation, mass spectrometry analysis, database searching and biomarker validation. Using the pathogen Clostridium botulinum as a model, we show that the robustness of candidate biomarkers increases with each stage of the pipeline. This is enhanced by the concordance shown between various database search algorithms for peptide identification. Further validation was done by focusing on the peptides that are unique to C. botulinum strains and absent in phylogenetically related Clostridium species. From a list of 143 peptides, 8 candidate biomarkers were reliably identified as conserved across C. botulinum strains. To avoid discarding other unique peptides, a confidence scale has been implemented in the pipeline giving priority to unique peptides that are identified by a union of algorithms. Conclusions: This study demonstrates that implementing a coherent pipeline which includes intensive bioinformatics validation steps is vital for discovery of robust biomarkers. It also emphasises the importance of proteomics based methods in biomarker discovery.
引用
收藏
页数:9
相关论文
共 22 条
  • [1] Altschul SF, 1996, METHOD ENZYMOL, V266, P460
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] [Anonymous], SWISS-PROT protein database
  • [4] Comparative evaluation of tandem MS search algorithms using a target-decoy search strategy
    Balgley, Brian M.
    Laudeman, Tom
    Yang, Li
    Song, Tao
    Lee, Cheng S.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (09) : 1599 - 1608
  • [5] Comparison of different search engines using validated MS/MS test datasets
    Boutilier, K
    Ross, M
    Podtelejnikov, AV
    Orsi, C
    Taylor, R
    Taylor, P
    Figeys, D
    [J]. ANALYTICA CHIMICA ACTA, 2005, 534 (01) : 11 - 20
  • [6] Comparison of mascot and X!Tandem performance for low and high accuracy mass spectrometry and the development of an adjusted Mascot threshold
    Brosch, Markus
    Swamy, Sajani
    Hubbard, Tim
    Choudhary, Jyoti
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (05) : 962 - 970
  • [7] Potential for false positive identifications from large databases through tandem mass spectrometry
    Cargile, BJ
    Bundy, JL
    Stephenson, JL
    [J]. JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) : 1082 - 1085
  • [8] TANDEM: matching proteins with tandem mass spectra
    Craig, R
    Beavis, RC
    [J]. BIOINFORMATICS, 2004, 20 (09) : 1466 - 1467
  • [9] ELIAS J, 2007, NATURE METHODS, V4, P2107
  • [10] AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE
    ENG, JK
    MCCORMACK, AL
    YATES, JR
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) : 976 - 989