Bayesian compositional generalized linear models for analyzing microbiome data

被引:2
|
作者
Zhang, Li [1 ]
Zhang, Xinyan [2 ]
Yi, Nengjun [1 ]
机构
[1] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
[2] Kennesaw State Univ, Sch Data Sci & Analyt, Kennesaw, GA USA
关键词
Bayesian models; compositional data; MCMC; microbiome; sum-to-zero restriction; STATISTICAL-ANALYSIS; GUT MICROBIOTA; REGRESSION;
D O I
10.1002/sim.9946
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The crucial impact of the microbiome on human health and disease has gained significant scientific attention. Researchers seek to connect microbiome features with health conditions, aiming to predict diseases and develop personalized medicine strategies. However, the practicality of conventional models is restricted due to important aspects of microbiome data. Specifically, the data observed is compositional, as the counts within each sample are bound by a fixed-sum constraint. Moreover, microbiome data often exhibits high dimensionality, wherein the number of variables surpasses the available samples. In addition, microbiome features exhibiting phenotypical similarity usually have similar influence on the response variable. To address the challenges posed by these aspects of the data structure, we proposed Bayesian compositional generalized linear models for analyzing microbiome data (BCGLM) with a structured regularized horseshoe prior for the compositional coefficients and a soft sum-to-zero restriction on coefficients through the prior distribution. We fitted the proposed models using Markov Chain Monte Carlo (MCMC) algorithms with R package rstan. The performance of the proposed method was assessed by extensive simulation studies. The simulation results show that our approach outperforms existing methods with higher accuracy of coefficient estimates and lower prediction error. We also applied the proposed method to microbiome study to find microorganisms linked to inflammatory bowel disease (IBD). To make this work reproducible, the code and data used in this article are available at .
引用
收藏
页码:141 / 155
页数:15
相关论文
共 50 条
  • [21] Bayesian semiparametric models for nonignorable missing mechanisms in generalized linear models
    Kalaylioglu, Z. I.
    Ozturk, O.
    JOURNAL OF APPLIED STATISTICS, 2013, 40 (08) : 1746 - 1763
  • [22] Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications
    Patrick L. Combettes
    Christian L. Müller
    Statistics in Biosciences, 2021, 13 : 217 - 242
  • [23] Extended Bayesian Model Averaging in Generalized Linear Mixed Models Applied to Schizophrenia Family Data
    Tsai, Miao-Yu
    Hsiao, Chuhsing K.
    Chen, Wei J.
    ANNALS OF HUMAN GENETICS, 2011, 75 : 62 - 77
  • [24] A comparison of generalised linear models and compositional models for ordered categorical data
    Vencalek, Ondrej
    Hron, Karel
    Filzmoser, Peter
    STATISTICAL MODELLING, 2020, 20 (03) : 249 - 273
  • [25] Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications
    Combettes, Patrick L.
    Mueller, Christian L.
    STATISTICS IN BIOSCIENCES, 2021, 13 (02) : 217 - 242
  • [26] Compositional Linear Regression on Interval-valued Data
    Pekaslan, Direnc
    Wagner, Christian
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [27] A Bayesian-model for compositional data analysis
    Brewer, MJ
    Soulsby, C
    Dunn, SM
    COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 105 - 110
  • [28] Bayesian projection approaches to variable selection in generalized linear models
    Nott, David J.
    Leng, Chenlei
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3227 - 3241
  • [29] Bayesian Inference on Hierarchical Nonlocal Priors in Generalized Linear Models
    Cao, Xuan
    Lee, Kyoungjae
    BAYESIAN ANALYSIS, 2024, 19 (01): : 99 - 122
  • [30] A GLM-based zero-inflated generalized Poisson factor model for analyzing microbiome data
    Chi, Jinling
    Ye, Jimin
    Zhou, Ying
    FRONTIERS IN MICROBIOLOGY, 2024, 15