The bacterial proteogenomic pipeline

被引:15
作者
Uszkoreit, Julian [1 ]
Plohnke, Nicole [2 ]
Rexroth, Sascha [2 ]
Marcus, Katrin [1 ]
Eisenacher, Martin [1 ]
机构
[1] Ruhr Univ Bochum, Med Proteom Ctr, Bochum, Germany
[2] Ruhr Univ Bochum, Bochum, Germany
关键词
TANDEM MASS-SPECTRA; PROTEIN IDENTIFICATION; DECOY DATABASES; SPECTROMETRY; PROTEOMICS; PEPTIDES; GENOME;
D O I
10.1186/1471-2164-15-S9-S19
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Proteogenomics combines the cutting-edge methods from genomics and proteomics. While it has become cheap to sequence whole genomes, the correct annotation of protein coding regions in the genome is still tedious and error prone. Mass spectrometry on the other hand relies on good characterizations of proteins derived from the genome, but can also be used to help improving the annotation of genomes or find species specific peptides. Additionally, proteomics is widely used to find evidence for differential expression of proteins under different conditions, e.g. growth conditions for bacteria. The concept of proteogenomics is not altogether new, in-house scripts are used by different labs and some special tools for eukaryotic and human analyses are available. Results: The Bacterial Proteogenomic Pipeline, which is completely written in Java, alleviates the conducting of proteogenomic analyses of bacteria. From a given genome sequence, a naive six frame translation is performed and, if desired, a decoy database generated. This database is used to identify MS/MS spectra by common peptide identification algorithms. After combination of the search results and optional flagging for different experimental conditions, the results can be browsed and further inspected. In particular, for each peptide the number of identifications for each condition and the positions in the corresponding protein sequences are shown. Intermediate and final results can be exported into GFF3 format for visualization in common genome browsers. Conclusions: To facilitate proteogenomics analyses the Bacterial Proteogenomic Pipeline is a set of comprehensive tools running on common desktop computers, written in Java and thus platform independent. The pipeline allows integrating peptide identifications from various algorithms and emphasizes the visualization of spectral counts from different experimental conditions.
引用
收藏
页数:6
相关论文
共 19 条
[1]   An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome [J].
Bitton, Danny A. ;
Smith, Duncan L. ;
Connolly, Yvonne ;
Scutt, Paul J. ;
Miller, Crispin J. .
PLOS ONE, 2010, 5 (01)
[2]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[3]   Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry [J].
Elias, Joshua E. ;
Gygi, Steven P. .
NATURE METHODS, 2007, 4 (03) :207-214
[4]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[5]   Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics [J].
Fermin, Damian ;
Allen, Baxter B. ;
Blackwell, Thomas W. ;
Menon, Rajasree ;
Adamski, Marcin ;
Xu, Yin ;
Ulintz, Peter ;
Omenn, Gilbert S. ;
States, David J. .
GENOME BIOLOGY, 2006, 7 (04)
[6]   Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines [J].
Jones, Andrew R. ;
Siepen, Jennifer A. ;
Hubbard, Simon J. ;
Paton, Norman W. .
PROTEOMICS, 2009, 9 (05) :1220-1229
[7]   Assigning significance to peptides identified by tandem mass spectrometry using decoy databases [J].
Kaell, Lukas ;
Storey, John D. ;
MacCoss, Michael J. ;
Noble, William Stafford .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (01) :29-34
[8]   Spectral probabilities and generating functions of tandem mass spectra: A strike against decoy databases [J].
Kim, Sangtae ;
Gupta, Nitin ;
Pevzner, Pavel A. .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (08) :3354-3363
[9]   Rhizobial Adaptation to Hosts, a New Facet in the Legume Root-Nodule Symbiosis [J].
Koch, Marion ;
Delmotte, Nathanael ;
Rehrauer, Hubert ;
Vorholt, Julia A. ;
Pessi, Gabriella ;
Hennecke, Hauke .
MOLECULAR PLANT-MICROBE INTERACTIONS, 2010, 23 (06) :784-790
[10]   Proteogenomic Analysis of Bradyrhizobium japonicum USDA110 Using Genosuite, an Automated Multi-algorithmic Pipeline [J].
Kumar, Dhirendra ;
Yadav, Amit Kumar ;
Kadimi, Puneet Kumar ;
Nagaraj, Shivashankar H. ;
Grimmond, Sean M. ;
Dash, Debasis .
MOLECULAR & CELLULAR PROTEOMICS, 2013, 12 (11) :3388-3397