Using gut microbiota as a diagnostic tool for colorectal cancer: machine learning techniques reveal promising results

被引:4
作者
Lu, Fang [1 ,2 ,3 ]
Lei, Ting [2 ,3 ,4 ]
Zhou, Jie [1 ,2 ,3 ]
Liang, Hao [1 ,2 ,3 ,5 ]
Cui, Ping [2 ,3 ,5 ]
Zuo, Taiping [2 ,3 ,4 ]
Ye, Li [1 ,2 ,3 ]
Chen, Hui [2 ,3 ,4 ]
Huang, Jiegang [1 ,2 ,3 ]
机构
[1] Guangxi Med Univ, Sch Publ Hlth, Nanning 530021, Guangxi, Peoples R China
[2] Guangxi Key Lab AIDS Prevent & Treatment, Nanning 530021, Guangxi, Peoples R China
[3] Guangxi Univ, Key Lab Prevent & Control Highly Prevalent Dis, Nanning 530021, Guangxi, Peoples R China
[4] Guangxi Med Univ, Geriatr Digest Dept Internal Med, Affiliated Hosp 1, Nanning, Peoples R China
[5] Guangxi Med Univ, Life Sci Inst, Nanning 530021, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
colorectal cancer; gut microbiome; 16S rRNA gene sequencing; diagnosis; machine learning; biomarker; PREDICTION; MODEL;
D O I
10.1099/jmm.0.001699
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Introduction. Increasing evidence suggests a correlation between gut microbiota and colorectal cancer (CRC).Hypothesis/Gap Statement. However, few studies have used gut microbiota as a diagnostic biomarker for CRC.Aim. The objective of this study was to explore whether a machine learning (ML) model based on gut microbiota could be used to diagnose CRC and identify key biomarkers in the model. Methodology. We sequenced the 16S rRNA gene from faecal samples of 38 participants, including 17 healthy subjects and 21 CRC patients. Eight supervised ML algorithms were used to diagnose CRC based on faecal microbiota operational taxonomic units (OTUs), and the models were evaluated in terms of identification, calibration and clinical practicality for optimal modelling parameters. Finally, the key gut microbiota was identified using the random forest (RF) algorithm.Results. We found that CRC was associated with the dysregulation of gut microbiota. Through a comprehensive evaluation of supervised ML algorithms, we found that different algorithms had significantly different prediction performance using faecal microbiomes. Different data screening methods played an important role in optimization of the prediction models. We found that naive Bayes algorithms [NB, accuracy=0.917, area under the curve (AUC)=0.926], RF (accuracy=0.750, AUC=0.926) and logistic regression (LR, accuracy=0.750, AUC=0.889) had high predictive potential for CRC. Furthermore, important features in the model, namely s__metagenome_g__Lachnospiraceae_ND3007_group (AUC=0.814), s__Escherichia_coli_g__Escherichia-Shigella (AUC=0.784) and s__unclassified_g__Prevotella (AUC=0.750), could each be used as diagnostic biomarkers of CRC.Conclusions. Our results suggested an association between gut microbiota dysregulation and CRC, and demonstrated the fea-sibility of the gut microbiota to diagnose cancer. The bacteria s__metagenome_g__Lachnospiraceae_ND3007_group, s__Escheri-chia_coli_g__Escherichia-Shigella and s__unclassified_g__Prevotella were key biomarkers for CRC.
引用
收藏
页数:12
相关论文
共 44 条
  • [1] Human Gut Microbiome and Risk for Colorectal Cancer
    Ahn, Jiyoung
    Sinha, Rashmi
    Pei, Zhiheng
    Dominianni, Christine
    Wu, Jing
    Shi, Jianxin
    Goedert, James J.
    Hayes, Richard B.
    Yang, Liying
    [J]. JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2013, 105 (24): : 1907 - 1911
  • [2] Systematic evaluation of supervised classifiers for fecal microbiota-based prediction of colorectal cancer
    Ai, Luoyan
    Tian, Haiying
    Chen, Zhaofei
    Chen, Huimin
    Xu, Jie
    Fang, Jing-Yuan
    [J]. ONCOTARGET, 2017, 8 (06) : 9546 - 9556
  • [3] Microbiota-based model improves the sensitivity of fecal immunochemical test for detecting colonic lesions
    Baxter, Nielson T.
    Ruffin, Mack T.
    Rogers, Mary A. M.
    Schloss, Patrick D.
    [J]. GENOME MEDICINE, 2016, 8
  • [4] Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology
    Bera, Kaustav
    Schalper, Kurt A.
    Rimm, David L.
    Velcheti, Vamsidhar
    Madabhushi, Anant
    [J]. NATURE REVIEWS CLINICAL ONCOLOGY, 2019, 16 (11) : 703 - 715
  • [5] Cao Y., 2021, Gastroenterology, V161, DOI 10.1053
  • [6] Accuracy of screening for fecal occult blood on a single stool sample obtained by digital rectal examination: A comparison with recommended sampling practice
    Collins, JF
    Lieberman, DA
    Durbin, TE
    Weiss, DG
    [J]. ANNALS OF INTERNAL MEDICINE, 2005, 142 (02) : 81 - 85
  • [7] Butyrate enemas in experimental colitis and protection against large bowel cancer in a rat model
    DArgenio, G
    Cosenza, V
    DelleCave, M
    Iovino, P
    DellaValle, N
    Lombardi, G
    Mazzacca, G
    [J]. GASTROENTEROLOGY, 1996, 110 (06) : 1727 - 1734
  • [8] DeStefano Shields CE., 2018, Cell Host Microbe, V23
  • [9] Dynamics and associations of microbial community types across the human body
    Ding, Tao
    Schloss, Patrick D.
    [J]. NATURE, 2014, 509 (7500) : 357 - +
  • [10] Gut microbiome development along the colorectal adenoma-carcinoma sequence
    Feng, Qiang
    Liang, Suisha
    Jia, Huijue
    Stadlmayr, Andreas
    Tang, Longqing
    Lan, Zhou
    Zhang, Dongya
    Xia, Huihua
    Xu, Xiaoying
    Jie, Zhuye
    Su, Lili
    Li, Xiaoping
    Li, Xin
    Li, Junhua
    Xiao, Liang
    Huber-Schoenauer, Ursula
    Niederseer, David
    Xu, Xun
    Al-Aama, Jumana Yousuf
    Yang, Huanming
    Wang, Jian
    Kristiansen, Karsten
    Arumugam, Manimozhiyan
    Tilg, Herbert
    Datz, Christian
    Wang, Jun
    [J]. NATURE COMMUNICATIONS, 2015, 6