A two-part mixed-effects model for analyzing longitudinal microbiome compositional data

被引:140
|
作者
Chen, Eric Z. [1 ,2 ]
Li, Hongzhe [1 ,2 ]
机构
[1] Univ Penn, Perelman Sch Med, Genom & Computat Biol Grad Grp, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
GUT MICROBIOME; METAGENOMICS; PROGRESSION; DYNAMICS; DIET;
D O I
10.1093/bioinformatics/btw308
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The human microbial communities are associated with many human diseases such as obesity, diabetes and inflammatory bowel disease. High-throughput sequencing technology has been widely used to quantify the microbial composition in order to understand its impacts on human health. Longitudinal measurements of microbial communities are commonly obtained in many microbiome studies. A key question in such microbiome studies is to identify the microbes that are associated with clinical outcomes or environmental factors. However, microbiome compositional data are highly skewed, bounded in [0,1), and often sparse with many zeros. In addition, the observations from repeated measures in longitudinal studies are correlated. A method that takes into account these features is needed for association analysis in longitudinal microbiome data. Results: In this paper, we propose a two-part zero-inflated Beta regression model with random effects (ZIBR) for testing the association between microbial abundance and clinical covariates for longitudinal microbiome data. The model includes a logistic regression component to model presence/absence of a microbe in the samples and a Beta regression component to model non-zero microbial abundance, where each component includes a random effect to account for the correlations among the repeated measurements on the same subject. Both simulation studies and the application to real microbiome data have shown that ZIBR model outperformed the previously used methods. The method provides a useful tool for identifying the relevant taxa based on longitudinal or repeated measures in microbiome research.
引用
收藏
页码:2611 / 2617
页数:7
相关论文
共 50 条
  • [41] A likelihood-based two-part marginal model for longitudinal semicontinuous data
    Su, Li
    Tom, Brian D. M.
    Farewell, Vernon T.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2015, 24 (02) : 194 - 205
  • [42] Two-Part and Related Regression Models for Longitudinal Data
    Farewell, V. T.
    Long, D. L.
    Tom, B. D. M.
    Yiu, S.
    Su, L.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 : 283 - 315
  • [43] Inference on Two-Part Latent Variable Analysis Model With Multivariate Longitudinal Data
    Xia, Ye-Mao
    Lu, Bin
    Tang, Nian-Sheng
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2019, 26 (05) : 685 - 709
  • [44] Negative Binomial Mixed Models for Analyzing Longitudinal Microbiome Data
    Zhang, Xinyan
    Pei, Yu-Fang
    Zhang, Lei
    Guo, Boyi
    Pendegraft, Amanda H.
    Zhuang, Wenzhuo
    Yi, Nengjun
    FRONTIERS IN MICROBIOLOGY, 2018, 9
  • [45] A marginalized two-part model for semicontinuous data
    Smith, Valerie A.
    Preisser, John S.
    Neelon, Brian
    Maciejewski, Matthew L.
    STATISTICS IN MEDICINE, 2014, 33 (28) : 4891 - 4903
  • [46] A two-part joint model for the analysis of survival and longitudinal binary data with excess zeros
    Rizopoulos, Dimitris
    Verbeke, Geert
    Lesaffre, Emmanuel
    Vanrenterghem, Yves
    BIOMETRICS, 2008, 64 (02) : 611 - 619
  • [47] Mixed-Effects Models for Conditional Quantiles with Longitudinal Data
    Liu, Yuan
    Bottai, Matteo
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2009, 5 (01):
  • [48] Simplex Mixed-Effects Models for Longitudinal Proportional Data
    Qiu, Zhenguo
    Song, Peter X. -K.
    Tan, Ming
    SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (04) : 577 - 596
  • [49] Local polynomial mixed-effects models for longitudinal data
    Wu, HL
    Zhang, JT
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (459) : 883 - 897
  • [50] Functional mixed-effects model for periodic data
    Qin, L
    Guo, WS
    BIOSTATISTICS, 2006, 7 (02) : 225 - 234