Machine Learning Predicts Biogeochemistry from Microbial Community Structure in a Complex Model System

被引:18
作者
Dutta, Avishek [1 ]
Goldman, Thomas [2 ]
Keating, Jeffrey [2 ]
Burke, Ellen [2 ]
Williamson, Nicole [2 ]
Dirmeier, Reinhard [2 ]
Bowman, Jeff S. [1 ,3 ]
机构
[1] Univ Calif San Diego, Integrat Oceanog Div, Scripps Inst Oceanog, La Jolla, CA 92093 USA
[2] BP Biosci Ctr, San Diego, CA USA
[3] Univ Calif San Diego, Ctr Microbiome Innovat, La Jolla, CA USA
关键词
biogeochemical state; machine learning; random forest; sulfidogenesis potential; microbial community analysis; NITRATE; SULFIDE; STRAIN;
D O I
10.1128/spectrum.01909-21
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Microbial community structure is influenced by the environment and in turn exerts control on many environmental parameters. We applied this concept in a bioreactor study to test whether microbial community structure contains information sufficient to predict the concentration of H2S as the product of sulfate reduction. Microbial sulfate reduction is a major source of H2S in many industrial and environmental systems and is often influenced by the existing physicochemical conditions. Production of H2S in industrial systems leads to occupational hazards and adversely affects the quality of products. A long-term (148 days) experiment was conducted in upflow bioreactors to mimic sulfidogenesis, followed by inhibition with nitrate salts and a resumption of H2S generation when inhibition was released. We determined microbial community structure in 731 samples across 20 bioreactors using 16S rRNA gene sequencing and applied a random forest algorithm to successfully predict different phases of sulfidogenesis and mitigation (accuracy = 93.17%) and sessile and effluent microbial communities (accuracy = 100%). Similarly derived regression models that also included cell abundances were able to predict H2S concentration with remarkably high fidelity (R-2 > 0.82). Metabolic profiles based on microbial community structure were also found to be reliable predictors for H2S concentration (R-2 = 0.78). These results suggest that microbial community structure contains information sufficient to predict sulfidogenesis in a closed system, with anticipated applications to microbially driven processes in open environments. IMPORTANCE Microbial communities control many biogeochemical processes. Many of these processes are impractical or expensive to measure directly. Because the taxonomic structure of the microbial community is indicative of its function, it encodes information that can be used to predict biogeochemistry. Here, we demonstrate how a machine learning technique can be used to predict sulfidogenesis, a key biogeochemical process in a model system. A distinction of this research was the ability to predict H2S production in a bioreactor from the effluent bacterial community structure without direct observations of the sessile community or other environmental conditions. This study establishes the ability to use machine learning approaches in predicting sulfide concentrations in a closed system, which can be further developed as a valuable tool for predicting biogeochemical processes in open environments. As machine learning algorithms continue to improve, we anticipate increased applications of microbial community structure to predict key environmental and industrial processes.
引用
收藏
页数:17
相关论文
共 48 条
[1]   Testing the quality of forest variable estimation using dense image matching: a comparison with airborne laser scanning in a Mediterranean pine forest [J].
Antonio Navarro, Jose ;
Fernandez-Landa, Alfredo ;
Luis Tome, Jose ;
Luz Guillen-Climent, Maria ;
Carlos Ojeda, Juan .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (14) :4744-4760
[2]   EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences [J].
Barbera, Pierre ;
Kozlov, Alexey M. ;
Czech, Lucas ;
Morel, Benoit ;
Darriba, Diego ;
Flouri, Tomas ;
Stamatakis, Alexandros .
SYSTEMATIC BIOLOGY, 2019, 68 (02) :365-369
[3]   Making Sense of a Scent-Sensing Metaphor for Microbes and Environmental Predictions [J].
Bowman, Jeff S. .
MSYSTEMS, 2021, 6 (04)
[4]   Bacterial community segmentation facilitates the prediction of ecosystem function along the coast of the western Antarctic Peninsula [J].
Bowman, Jeff S. ;
Amaral-Zettler, Linda A. ;
Rich, Jeremy J. ;
Luria, Catherine M. ;
Ducklow, Hugh W. .
ISME JOURNAL, 2017, 11 (06) :1460-1471
[5]   Microbial Communities Can Be Described by Metabolic Structure: A General Framework and Application to a Seasonally Variable, Depth-Stratified Microbial Community from the Coastal West Antarctic Peninsula [J].
Bowman, Jeff S. ;
Ducklow, Hugh W. .
PLOS ONE, 2015, 10 (08)
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Callahan BJ, 2016, NAT METHODS, V13, P581, DOI [10.1038/NMETH.3869, 10.1038/nmeth.3869]
[8]   Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Huntley, James ;
Fierer, Noah ;
Owens, Sarah M. ;
Betley, Jason ;
Fraser, Louise ;
Bauer, Markus ;
Gormley, Niall ;
Gilbert, Jack A. ;
Smith, Geoff ;
Knight, Rob .
ISME JOURNAL, 2012, 6 (08) :1621-1624
[9]   SPECTROPHOTOMETRIC DETERMINATION OF HYDROGEN SULFIDE IN NATURAL WATERS [J].
CLINE, JD .
LIMNOLOGY AND OCEANOGRAPHY, 1969, 14 (03) :454-&
[10]   Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data [J].
Czech, Lucas ;
Barbera, Pierre ;
Stamatakis, Alexandros .
BIOINFORMATICS, 2020, 36 (10) :3263-3265