Identifying biologically relevant differences between metagenomic communities

被引:756
作者
Parks, Donovan H. [1 ]
Beiko, Robert G. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
关键词
STATISTICAL SIGNIFICANCE; CONFIDENCE-INTERVALS; GUT MICROBIOME; ODDS RATIO; PROPORTIONS; RESOURCE; GENES; TOOLS;
D O I
10.1093/bioinformatics/btq041
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metagenomics is the study of genetic material recovered directly from environmental samples. Taxonomic and functional differences between metagenomic samples can highlight the influence of ecological factors on patterns of microbial life in a wide range of habitats. Statistical hypothesis tests can help us distinguish ecological influences from sampling artifacts, but knowledge of only the P-value from a statistical hypothesis test is insufficient to make inferences about biological relevance. Current reporting practices for pairwise comparative metagenomics are inadequate, and better tools are needed for comparative metagenomic analysis. Results: We have developed a new software package, STAMP, for comparative metagenomics that supports best practices in analysis and reporting. Examination of a pair of iron mine metagenomes demonstrates that deeper biological insights can be gained using statistical techniques available in our software. An analysis of the functional potential of 'Candidatus Accumulibacter phosphatis' in two enhanced biological phosphorus removal metagenomes identified several subsystems that differ between the A. phosphatis stains in these related communities, including phosphate metabolism, secretion and metal transport.
引用
收藏
页码:715 / 721
页数:7
相关论文
共 61 条
  • [1] Abdi H., 2007, Encyclopedia of Measurement and Statistics, P651, DOI DOI 10.4135/9781412952644.N299
  • [2] Inappropriate interpretation of the odds ratio: Oddly not that uncommon
    Agrawal, D
    [J]. PEDIATRICS, 2005, 116 (06) : 1612 - 1613
  • [3] On logit confidence intervals for the odds ratio with small samples
    Agresti, A
    [J]. BIOMETRICS, 1999, 55 (02) : 597 - 602
  • [4] Agresti A., 1992, STAT SCI, V7, P131, DOI [10.1214/ss/1177011454, DOI 10.1214/SS/1177011454]
  • [5] Agresti A., 1990, CATEGORICAL DATA ANA
  • [6] The genome sequence of the psychrophilic archaeon, Methanococcoides burtonii: the role of genome evolution in cold adaptation
    Allen, Michelle A.
    Lauro, Federico M.
    Williams, Timothy J.
    Burg, Dominic
    Siddiqui, Khawar S.
    De Francisci, Davide
    Chong, Kevin W. Y.
    Pilak, Oliver
    Chew, Hwee H.
    De Maere, Matthew Z.
    Ting, Lily
    Katrib, Marilyn
    Ng, Charmaine
    Sowers, Kevin R.
    Galperin, Michael Y.
    Anderson, Iain J.
    Ivanova, Natalia
    Dalin, Eileen
    Martinez, Michele
    Lapidus, Alla
    Hauser, Loren
    Land, Miriam
    Thomas, Torsten
    Cavicchioli, Ricardo
    [J]. ISME JOURNAL, 2009, 3 (09) : 1012 - 1035
  • [7] BARNARD GA, 1947, BIOMETRIKA, V34, P123, DOI 10.1093/biomet/34.1-2.123
  • [8] ON ALLEGED GAINS IN POWER FROM LOWER P-VALUES
    BARNARD, GA
    [J]. STATISTICS IN MEDICINE, 1989, 8 (12) : 1469 - 1477
  • [9] Bacterial rhodopsin:: Evidence for a new type of phototrophy in the sea
    Béjà, O
    Aravind, L
    Koonin, EV
    Suzuki, MT
    Hadd, A
    Nguyen, LP
    Jovanovich, S
    Gates, CM
    Feldman, RA
    Spudich, JL
    Spudich, EN
    DeLong, EF
    [J]. SCIENCE, 2000, 289 (5486) : 1902 - 1906
  • [10] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300