Bayesian compositional generalized linear models for analyzing microbiome data

被引:2
|
作者
Zhang, Li [1 ]
Zhang, Xinyan [2 ]
Yi, Nengjun [1 ]
机构
[1] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
[2] Kennesaw State Univ, Sch Data Sci & Analyt, Kennesaw, GA USA
关键词
Bayesian models; compositional data; MCMC; microbiome; sum-to-zero restriction; STATISTICAL-ANALYSIS; GUT MICROBIOTA; REGRESSION;
D O I
10.1002/sim.9946
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The crucial impact of the microbiome on human health and disease has gained significant scientific attention. Researchers seek to connect microbiome features with health conditions, aiming to predict diseases and develop personalized medicine strategies. However, the practicality of conventional models is restricted due to important aspects of microbiome data. Specifically, the data observed is compositional, as the counts within each sample are bound by a fixed-sum constraint. Moreover, microbiome data often exhibits high dimensionality, wherein the number of variables surpasses the available samples. In addition, microbiome features exhibiting phenotypical similarity usually have similar influence on the response variable. To address the challenges posed by these aspects of the data structure, we proposed Bayesian compositional generalized linear models for analyzing microbiome data (BCGLM) with a structured regularized horseshoe prior for the compositional coefficients and a soft sum-to-zero restriction on coefficients through the prior distribution. We fitted the proposed models using Markov Chain Monte Carlo (MCMC) algorithms with R package rstan. The performance of the proposed method was assessed by extensive simulation studies. The simulation results show that our approach outperforms existing methods with higher accuracy of coefficient estimates and lower prediction error. We also applied the proposed method to microbiome study to find microorganisms linked to inflammatory bowel disease (IBD). To make this work reproducible, the code and data used in this article are available at .
引用
收藏
页码:141 / 155
页数:15
相关论文
共 50 条
  • [41] Bayesian compositional regression with microbiome features via variational inference
    Darren A. V. Scott
    Ernest Benavente
    Julian Libiseller-Egger
    Dmitry Fedorov
    Jody Phelan
    Elena Ilina
    Polina Tikhonova
    Alexander Kudryavstev
    Julia Galeeva
    Taane Clark
    Alex Lewin
    BMC Bioinformatics, 24
  • [42] Bayesian compositional regression with microbiome features via variational inference
    Scott, Darren A. V.
    Benavente, Ernest
    Libiseller-Egger, Julian
    Fedorov, Dmitry
    Phelan, Jody
    Ilina, Elena
    Tikhonova, Polina
    Kudryavstev, Alexander
    Galeeva, Julia
    Clark, Taane
    Lewin, Alex
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [43] Total least squares solution for compositional data using linear models
    Fiserova, Eva
    Hron, Karel
    JOURNAL OF APPLIED STATISTICS, 2010, 37 (07) : 1137 - 1152
  • [44] Multiple linear regression modeling for compositional data
    Wang, Huiwen
    Shangguan, Liying
    Wu, Junjie
    Guan, Rong
    NEUROCOMPUTING, 2013, 122 : 490 - 500
  • [45] When relative and absolute information matter: Compositional predictor with a total in generalized linear models
    Coenders, Germa
    Martin-Fernandez, Josep A.
    Ferrer-Rosell, Berta
    STATISTICAL MODELLING, 2017, 17 (06) : 494 - 512
  • [46] Variable selection in microbiome compositional data analysis
    Susin, Antoni
    Wang, Yiwen
    Cao, Kim-Anh Le
    Calle, M. Luz
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
  • [47] Microbiome compositional data analysis for survival studies
    Pujolassos, Meritxell
    Susin, Antoni
    Calle, M. Luz
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (02)
  • [48] Bayesian model selection in linear mixed models for longitudinal data
    Ariyo, Oludare
    Quintero, Adrian
    Munoz, Johanna
    Verbeke, Geert
    Lesaffre, Emmanuel
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (05) : 890 - 913
  • [49] Mediation effect selection in high-dimensional and compositional microbiome data
    Zhang, Haixiang
    Chen, Jun
    Feng, Yang
    Wang, Chan
    Li, Huilin
    Liu, Lei
    STATISTICS IN MEDICINE, 2021, 40 (04) : 885 - 896
  • [50] Principal microbial groups: compositional alternative to phylogenetic grouping of microbiome data
    Boyraz, Asli
    Pawlowsky-Glahn, Vera
    Jose Egozcue, Juan
    Acar, Aybar Can
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)