Caution regarding the specificities of pan- cancer microbial structure

被引:29
作者
Gihawi, Abraham [1 ]
Cooper, Colin S. [1 ]
Brewer, Daniel S. [1 ,2 ]
机构
[1] Univ East Anglia, Norwich Med Sch, Bob Champ Res & Educ Bldg, Norwich NR4 7UQ, England
[2] Norwich Res Pk, Earlham Inst, Colney Lane, Norwich NR4 7UG, England
关键词
cancer; bacteria; viruses; microbiome; machine learning; contamination; SP-NOV; DIVERSITY; VIRUSES; BACTERIAL;
D O I
10.1099/mgen.0.001088
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Results published in an article by Poore et al. (Nature. 2020;579:567-574) suggested that machine learning models can almost perfectly distinguish between tumour types based on their microbial composition using machine learning models. Whilst we believe that there is the potential for microbial composition to be used in this manner, we have concerns with the paper that make us question the certainty of the conclusions drawn. We believe there are issues in the areas of the contribution of contamination, handling of batch effects, false positive classifications and limitations in the machine learning approaches used. This makes it difficult to identify whether the authors have identified true biological signal and how robust these models would be in use as clinical biomarkers. We commend Poore et al. on their approach to open data and reproducibility that has enabled this analysis. We hope that this discourse assists the future development of machine learning models and hypothesis generation in microbiome research.
引用
收藏
页数:6
相关论文
共 38 条
[31]   Diversity of Viruses Infecting Eukaryotic Algae [J].
Short, Steven M. ;
Staniewski, Michael A. ;
Chaban, Yuri V. ;
Long, Andrew M. ;
Wang, Donglin .
CURRENT ISSUES IN MOLECULAR BIOLOGY, 2020, 39 :29-61
[32]   An improved polyclonal antiserum for detecting Ryegrass mosaic rymovirus [J].
Webster, DE ;
Beck, DL ;
Rabenstein, F ;
Forster, RLS ;
Guy, PL .
ARCHIVES OF VIROLOGY, 2005, 150 (09) :1921-1926
[33]   Navigating the pitfalls of applying machine learning in genomics [J].
Whalen, Sean ;
Schreiber, Jacob ;
Noble, William S. ;
Pollard, Katherine S. .
NATURE REVIEWS GENETICS, 2022, 23 (03) :169-181
[34]  
Wickham H., 2019, J. Open. Source Softw, V4, P1686, DOI [10.21105/joss.01686, DOI 10.21105/JOSS.01686]
[35]  
Wilke C. O., 2020, COWPLOT STREAMLINED
[36]   Taxonomic reassessment of N4-like viruses using comparative genomics and proteomics suggests a new subfamily - "Enquartavirinae'' [J].
Wittmann, Johannes ;
Klumpp, Jochen ;
Switt, Andrea I. Moreno ;
Yagubi, Abdelbaset ;
Ackermann, Hans-Wolfgang ;
Wiedmann, Martin ;
Svircev, Antonet ;
Nash, John H. E. ;
Kropinski, Andrew M. .
ARCHIVES OF VIROLOGY, 2015, 160 (12) :3053-3062
[37]   Complete genome sequence of a novel velarivirus infecting areca palm in China [J].
Yu, Hongmei ;
Qi, Shuishui ;
Chang, Zhaoxia ;
Rong, Qiqi ;
Akinyemi, Ibukun A. ;
Wu, Qingfa .
ARCHIVES OF VIROLOGY, 2015, 160 (09) :2367-2370
[38]   The landscape of viral associations in human cancers [J].
Zapatka, Marc ;
Borozan, Ivan ;
Brewer, Daniel S. ;
Iskar, Murat ;
Grundhoff, Adam ;
Alawi, Malik ;
Desai, Nikita ;
Sueltmann, Holger ;
Moch, Holger ;
Cooper, Colin S. ;
Eils, Roland ;
Ferretti, Vincent ;
Lichter, Peter .
NATURE GENETICS, 2020, 52 (03) :320-+