Metaproteomic analysis using the Galaxy framework

被引:60
作者
Jagtap, Pratik D. [1 ,2 ]
Blakely, Alan [3 ]
Murray, Kevin [2 ]
Stewart, Shaun [4 ]
Kooren, Joel [2 ]
Johnson, James E. [5 ]
Rhodus, Nelson L. [6 ]
Rudney, Joel [6 ]
Griffin, Timothy J. [1 ,2 ]
机构
[1] Univ Minnesota, Ctr Mass Spectrometry & Prote, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Dept Biochem Mol Biol & Biophys, Minneapolis, MN 55455 USA
[3] Hamline Univ, St Paul, MN USA
[4] Carleton Coll, Northfield, MN 55057 USA
[5] Minnesota Supercomp Inst, Minneapolis, MN USA
[6] Univ Minnesota, Sch Dent, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Bioinformatics; Customized database generation; Mass spectrometry; Metaproteomics; Peptide sequence match; Sequence database search; PROTEIN EXTRACTION; REVEALS; MICROBIOTA; COMMUNITY; IDENTIFICATION; LOCALIZATION; ADAPTATION; EXPRESSION; TISSUES; SOILS;
D O I
10.1002/pmic.201500074
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metaproteomics characterizes proteins expressed by microorganism communities (microbiome) present in environmental samples or a host organism (e.g. human), revealing insights into the molecular functions conferred by these communities. Compared to conventional proteomics, metaproteomics presents unique data analysis challenges, including the use of large protein databases derived from hundreds or thousands of organisms, as well as numerous processing steps to ensure high data quality. These challenges limit the use of metaproteomics for many researchers. In response, we have developed an accessible and flexible metaproteomics workflow within the Galaxy bioinformatics framework. Via analysis of human oral tissue exudate samples, we have established a modular Galaxy-based workflow that automates a reduction method for searching large sequence databases, enabling comprehensive identification of host proteins (human) as well as meta-proteins from the nonhost organisms. Downstream, automated processing steps enable basic local alignment search tool analysis and evaluation/visualization of peptide sequence match quality, maximizing confidence in results. Outputted results are compatible with tools for taxonomic and functional characterization (e.g. Unipept, MEGAN5). Galaxy also allows for the sharing of complete workflows with others, promoting reproducibility and also providing a template for further modification and enhancement. Our results provide a blueprint for establishing Galaxy as a solution for metaproteomic data analysis. All MS data have been deposited in the ProteomeXchange with identifier PXD001655 ().
引用
收藏
页码:3553 / 3565
页数:13
相关论文
共 66 条
  • [1] A metaproteomic approach gives functional insights into anaerobic digestion
    Abram, F.
    Enright, A. -M.
    O'Reilly, J.
    Botting, C. H.
    Collins, G.
    O'Flaherty, V.
    [J]. JOURNAL OF APPLIED MICROBIOLOGY, 2011, 110 (06) : 1550 - 1560
  • [2] Optimisation of protein extraction and 2-DE for metaproteomics of microbial communities from anaerobic wastewater treatment biofilms
    Abram, Florence
    Gunnigle, Eoin
    O'Flaherty, Vincent
    [J]. ELECTROPHORESIS, 2009, 30 (23) : 4149 - 4151
  • [3] Digging into the extracellular matrix of a complex microbial community using a combined metagenomic and metaproteomic approach
    Albertsen, M.
    Stensballe, A.
    Nielsen, K. L.
    Nielsen, P. H.
    [J]. WATER SCIENCE AND TECHNOLOGY, 2013, 67 (07) : 1650 - 1656
  • [4] A Dynamic Range Compression and Three-Dimensional Peptide Fractionation Analysis Platform Expands Proteome Coverage and the Diagnostic Potential of Whole Saliva
    Bandhakavi, Sricharan
    Stone, Matthew D.
    Onsongo, Getiria
    Van Riper, Susan K.
    Griffin, Timothy J.
    [J]. JOURNAL OF PROTEOME RESEARCH, 2009, 8 (12) : 5590 - 5600
  • [5] Metaproteomic Identification of Diazotrophic Methanotrophs and Their Localization in Root Tissues of Field-Grown Rice Plants
    Bao, Zhihua
    Okubo, Takashi
    Kubota, Kengo
    Kasahara, Yasuhiro
    Tsurumaru, Hirohito
    Anda, Mizue
    Ikeda, Seishi
    Minamisawa, Kiwamu
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2014, 80 (16) : 5043 - 5052
  • [6] Metaproteomics of soils from semiarid environment: Functional and phylogenetic information obtained with different protein extraction methods
    Bastida, F.
    Hernandez, T.
    Garcia, C.
    [J]. JOURNAL OF PROTEOMICS, 2014, 101 : 31 - 42
  • [7] Addressing Statistical Biases in Nucleotide-Derived Protein Databases for Proteogenomic Search Strategies
    Blakeley, Paul
    Overton, Ian M.
    Hubbard, Simon J.
    [J]. JOURNAL OF PROTEOME RESEARCH, 2012, 11 (11) : 5221 - 5234
  • [8] Multi-omic data analysis using Galaxy
    Boekel, Jorrit
    Chilton, John M.
    Cooke, Ira R.
    Horvatovich, Peter L.
    Jagtap, Pratik D.
    Kall, Lukas
    Lehtio, Janne
    Lukasse, Pieter
    Moerland, Perry D.
    Griffin, Timothy J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (02) : 137 - 139
  • [9] Bacterial colonization of microbial biofilms in oral squamous cell carcinoma
    Bolz, J.
    Dosa, E.
    Schubert, J.
    Eckert, A. W.
    [J]. CLINICAL ORAL INVESTIGATIONS, 2014, 18 (02) : 409 - 414
  • [10] Metaproteogenomic analysis of a sulfate-reducing enrichment culture reveals genomic organization of key enzymes in the m-xylene degradation pathway and metabolic activity of proteobacteria
    Bozinovski, Dragana
    Taubert, Martin
    Kleinsteuber, Sabine
    Richnow, Hans-Hermann
    von Bergen, Martin
    Vogt, Carsten
    Seifert, Jana
    [J]. SYSTEMATIC AND APPLIED MICROBIOLOGY, 2014, 37 (07) : 488 - 501