Contamination source modeling with SCRuB improves cancer phenotype prediction from microbiome data

被引:30
作者
Austin, George I. I. [1 ,2 ]
Park, Heekuk [3 ]
Meydan, Yoli [2 ]
Seeram, Dwayne [3 ]
Sezin, Tanya [4 ]
Lou, Yue Clare [5 ]
Firek, Brian A. A. [6 ]
Morowitz, Michael J. J. [6 ]
Banfield, Jillian F. F. [7 ,8 ,9 ,10 ]
Christiano, Angela M. M. [4 ,11 ]
Pe'er, Itsik [1 ,2 ,12 ]
Uhlemann, Anne-Catrin [3 ]
Shenhav, Liat [13 ]
Korem, Tal [2 ,14 ,15 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY USA
[2] Columbia Univ, Irving Med Ctr, Dept Syst Biol, Program Math Genom, New York, NY 10027 USA
[3] Columbia Univ, Irving Med Ctr, Div Infect Dis, New York, NY USA
[4] Columbia Univ, Irving Med Ctr, Dept Dermatol, New York, NY USA
[5] Univ Calif, Dept Plant & Microbial Biol, Berkeley, CA USA
[6] Univ Pittsburgh, Sch Med, Dept Surg, Pittsburgh, PA USA
[7] Univ Calif Berkeley, Dept Earth & Planetary Sci, Berkeley, CA USA
[8] Univ Calif Berkeley, Dept Environm Sci Policy & Management, Berkeley, CA USA
[9] Univ Calif Berkeley, Innovat Genom Inst, Berkeley, CA USA
[10] Chan Zuckerberg Biohub, San Francisco, CA USA
[11] Columbia Univ, Irving Med Ctr, Dept Genet & Dev, New York, NY USA
[12] Columbia Univ, Data Sci Inst, New York, NY USA
[13] Rockefeller Univ, Ctr Studies Phys & Biol, New York, NY 10065 USA
[14] Columbia Univ, Irving Med Ctr, Dept Obstet & Gynecol, New York, NY 10027 USA
[15] CIFAR Azrieli Global Scholars program, Toronto, ON, Canada
关键词
TUMOR;
D O I
10.1038/s41587-023-01696-w
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Sequencing-based approaches for the analysis of microbial communities are susceptible to contamination, which could mask biological signals or generate artifactual ones. Methods for in silico decontamination using controls are routinely used, but do not make optimal use of information shared across samples and cannot handle taxa that only partially originate in contamination or leakage of biological material into controls. Here we present Source tracking for Contamination Removal in microBiomes (SCRuB), a probabilistic in silico decontamination method that incorporates shared information across multiple samples and controls to precisely identify and remove contamination. We validate the accuracy of SCRuB in multiple data-driven simulations and experiments, including induced contamination, and demonstrate that it outperforms state-of-the-art methods by an average of 15-20 times. We showcase the robustness of SCRuB across multiple ecosystems, data types and sequencing depths. Demonstrating its applicability to microbiome research, SCRuB facilitates improved predictions of host phenotypes, most notably the prediction of treatment response in melanoma patients using decontaminated tumor microbiome data. Modeling contamination sources over multiple samples enhances the analysis of microbiome data.
引用
收藏
页码:1820 / +
页数:30
相关论文
共 60 条
[1]   The Placenta Harbors a Unique Microbiome [J].
Aagaard, Kjersti ;
Ma, Jun ;
Antony, Kathleen M. ;
Ganu, Radhika ;
Petrosino, Joseph ;
Versalovic, James .
SCIENCE TRANSLATIONAL MEDICINE, 2014, 6 (237)
[2]   Microbiota of the indoor environment: a meta-analysis [J].
Adams, Rachel I. ;
Bateman, Ashley C. ;
Bik, Holly M. ;
Meadow, James F. .
MICROBIOME, 2015, 3 :49
[3]   Gut microbiome compositional and functional differences between tumor and non-tumor adjacent tissues from cohorts from the US and Spain [J].
Allali, Imane ;
Delgado, Susana ;
Marron, Pablo Isidro ;
Astudillo, Aurora ;
Yeh, Jen Jen ;
Ghazal, Hassan ;
Amzazi, Saaid ;
Keku, Temitope ;
Azcarate-Peril, M. Andrea .
GUT MICROBES, 2015, 6 (03) :161-172
[4]   STENSL: Microbial Source Tracking with ENvironment SeLection [J].
An, Ulzee ;
Shenhav, Liat ;
Olson, Christine A. ;
Hsiao, Elaine Y. ;
Halperin, Eran ;
Sankararaman, Sriram .
MSYSTEMS, 2022, 7 (05)
[5]   Oral and Gut Microbial Diversity and Immune Regulation in Patients with HIV on Antiretroviral Therapy [J].
Annavajhala, Medini K. ;
Khan, Sabrina D. ;
Sullivan, Sean B. ;
Shah, Jayesh ;
Pass, Lauren ;
Kister, Karolina ;
Kunen, Heather ;
Chiang, Victor ;
Monnot, Gwennaelle C. ;
Ricupero, Christopher L. ;
Mazur, Rebecca A. ;
Gordon, Peter ;
de Jong, Annemieke ;
Wadhwa, Sunil ;
Yin, Michael T. ;
Demmer, Ryan T. ;
Uhlemann, Anne-Catrin .
MSPHERE, 2020, 5 (01)
[6]  
Austin, 2023, SCRUB
[7]  
Austin, 2022, CONTAMINATION BENCHM
[8]   Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2 [J].
Bolyen, Evan ;
Rideout, Jai Ram ;
Dillon, Matthew R. ;
Bokulich, NicholasA. ;
Abnet, Christian C. ;
Al-Ghalith, Gabriel A. ;
Alexander, Harriet ;
Alm, Eric J. ;
Arumugam, Manimozhiyan ;
Asnicar, Francesco ;
Bai, Yang ;
Bisanz, Jordan E. ;
Bittinger, Kyle ;
Brejnrod, Asker ;
Brislawn, Colin J. ;
Brown, C. Titus ;
Callahan, Benjamin J. ;
Caraballo-Rodriguez, Andres Mauricio ;
Chase, John ;
Cope, Emily K. ;
Da Silva, Ricardo ;
Diener, Christian ;
Dorrestein, Pieter C. ;
Douglas, Gavin M. ;
Durall, Daniel M. ;
Duvallet, Claire ;
Edwardson, Christian F. ;
Ernst, Madeleine ;
Estaki, Mehrbod ;
Fouquier, Jennifer ;
Gauglitz, Julia M. ;
Gibbons, Sean M. ;
Gibson, Deanna L. ;
Gonzalez, Antonio ;
Gorlick, Kestrel ;
Guo, Jiarong ;
Hillmann, Benjamin ;
Holmes, Susan ;
Holste, Hannes ;
Huttenhower, Curtis ;
Huttley, Gavin A. ;
Janssen, Stefan ;
Jarmusch, Alan K. ;
Jiang, Lingjing ;
Kaehler, Benjamin D. ;
Bin Kang, Kyo ;
Keefe, Christopher R. ;
Keim, Paul ;
Kelley, Scott T. ;
Knights, Dan .
NATURE BIOTECHNOLOGY, 2019, 37 (08) :852-857
[9]  
Callahan BJ, 2016, NAT METHODS, V13, P581, DOI [10.1038/NMETH.3869, 10.1038/nmeth.3869]
[10]   Geography and Location Are the Primary Drivers of Office Microbiome Composition [J].
Chase, John ;
Fouquier, Jennifer ;
Zare, Mahnaz ;
Sonderegger, Derek L. ;
Knight, Rob ;
Kelley, Scott T. ;
Siegel, Jeffrey ;
Caporaso, J. Gregory .
MSYSTEMS, 2016, 1 (02)