MicroHDF: predicting host phenotypes with metagenomic data using a deep forest-based framework

被引:1
作者
Shi, Kai [1 ,2 ]
Liu, Qiaohui [1 ]
Ji, Qingrong [1 ]
He, Qisheng [1 ]
Zhao, Xing-Ming [3 ,4 ]
机构
[1] Guilin Univ Technol, Coll Comp Sci & Engn, Guilin 541004, Gaungxi, Peoples R China
[2] Guilin Univ Technol, Guangxi Key Lab Embedded Technol & Intelligent Sys, Guilin 541004, Gaungxi, Peoples R China
[3] Huzhou Univ, Affiliated Cent Hosp, Huzhou Cent Hosp, Huzhou 313000, Zhejiang, Peoples R China
[4] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
disease prediction; gut microbiome; machine learning; metagenomics; phylogenetic tree; HUMAN GUT MICROBIOME; ASSOCIATION; COLITIS;
D O I
10.1093/bib/bbae530
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The gut microbiota plays a vital role in human health, and significant effort has been made to predict human phenotypes, especially diseases, with the microbiota as a promising indicator or predictor with machine learning (ML) methods. However, the accuracy is impacted by a lot of factors when predicting host phenotypes with the metagenomic data, e.g. small sample size, class imbalance, high-dimensional features, etc. To address these challenges, we propose MicroHDF, an interpretable deep learning framework to predict host phenotypes, where a cascade layers of deep forest units is designed for handling sample class imbalance and high dimensional features. The experimental results show that the performance of MicroHDF is competitive with that of existing state-of-the-art methods on 13 publicly available datasets of six different diseases. In particular, it performs best with the area under the receiver operating characteristic curve of 0.9182 +/- 0.0098 and 0.9469 +/- 0.0076 for inflammatory bowel disease (IBD) and liver cirrhosis, respectively. Our MicroHDF also shows better performance and robustness in cross-study validation. Furthermore, MicroHDF is applied to two high-risk diseases, IBD and autism spectrum disorder, as case studies to identify potential biomarkers. In conclusion, our method provides an effective and reliable prediction of the host phenotype and discovers informative features with biological insights.
引用
收藏
页数:13
相关论文
共 69 条
  • [1] Alteration of the Gut Microbiome in Inflammatory Bowel Disease
    Andoh, Akira
    Nishida, Atsushi
    [J]. DIGESTION, 2023, 104 (01) : 16 - 23
  • [2] [Anonymous], 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
  • [3] [Anonymous], Annu Int Conf IEEE Eng Med Biol Soc, V2017, DOI [10.1109/TNB.2015.2461219, DOI 10.1109/TNB.2015.2461219]
  • [4] BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata
    Barrett, Tanya
    Clark, Karen
    Gevorgyan, Robert
    Gorelenkov, Vyacheslav
    Gribov, Eugene
    Karsch-Mizrachi, Ilene
    Kimelman, Michael
    Pruitt, Kim D.
    Resenchuk, Sergei
    Tatusova, Tatiana
    Yaschenko, Eugene
    Ostell, James
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D57 - D63
  • [5] ADT-OH improves intestinal barrier function and remodels the gut microbiota in DSS-induced colitis
    Bi, Zhiqian
    Chen, Jia
    Chang, Xiaoyao
    Li, Dangran
    Yao, Yingying
    Cai, Fangfang
    Xu, Huangru
    Cheng, Jian
    Hua, Zichun
    Zhuang, Hongqin
    [J]. FRONTIERS OF MEDICINE, 2023, 17 (05) : 972 - 992
  • [6] Human disease prediction from microbiome data by multiple feature fusion and deep learning
    Chen, Xingjian
    Zhu, Zifan
    Zhang, Weitong
    Wang, Yuchen
    Wang, Fuzhou
    Yang, Jianyi
    Wong, Ka-Chun
    [J]. ISCIENCE, 2022, 25 (04)
  • [7] Gut Bacteria Shared by Children and Their Mothers Associate with Developmental Level and Social Deficits in Autism Spectrum Disorder
    Chen, Yu
    Fang, Hui
    Li, Chunyan
    Wu, Guojun
    Xu, Ting
    Yang, Xin
    Zhao, Liping
    Ke, Xiaoyan
    Zhang, Chenhong
    [J]. MSPHERE, 2020, 5 (06) : 1 - 12
  • [8] GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison
    Dai, Die
    Zhu, Jiaying
    Sun, Chuqing
    Li, Min
    Liu, Jinxin
    Wu, Sicheng
    Ning, Kang
    He, Li-Jie
    Zhao, Xing-Ming
    Chen, Wei-Hua
    [J]. NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) : D777 - D784
  • [9] Altered gut microbial profile is associated with abnormal metabolism activity of Autism Spectrum Disorder
    Dan, Zhou
    Mao, Xuhua
    Liu, Qisha
    Guo, Mengchen
    Zhuang, Yaoyao
    Liu, Zhi
    Chen, Kun
    Chen, Junyu
    Xu, Rui
    Tang, Junming
    Qin, Lianhong
    Gu, Bing
    Liu, Kangjian
    Su, Chuan
    Zhang, Faming
    Xia, Yankai
    Hu, Zhibin
    Liu, Xingyin
    [J]. GUT MICROBES, 2020, 11 (05) : 1246 - 1267
  • [10] Multi-Layer and Recursive Neural Networks for Metagenomic Classification
    Ditzler, Gregory
    Polikar, Robi
    Rosen, Gail
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (06) : 608 - 616