Bayesian compositional models for ordinal response

被引:2
作者
Zhang, Li [1 ]
Zhang, Xinyan [2 ]
Leach, Justin M. [1 ]
Rahman, A. K. M. F. [1 ]
Yi, Nengjun [1 ]
机构
[1] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
[2] Kennesaw State Univ, Sch Data Sci & Analyt, Kennesaw, GA USA
关键词
Compositional data; MCMC; Hamiltonian Monte Carlo; sum-to-zero restriction; microbiome; VARIABLE SELECTION; REGRESSION-MODELS;
D O I
10.1177/09622802241247730
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Ordinal response is commonly found in medicine, biology, and other fields. In many situations, the predictors for this ordinal response are compositional, which means that the sum of predictors for each sample is fixed. Examples of compositional data include the relative abundance of species in microbiome data and the relative frequency of nutrition concentrations. Moreover, the predictors that are strongly correlated tend to have similar influence on the response outcome. Conventional cumulative logistic regression models for ordinal responses ignore the fixed-sum constraint on predictors and their associated interrelationships, and thus are not appropriate for analyzing compositional predictors.To solve this problem, we proposed Bayesian Compositional Models for Ordinal Response to analyze the relationship between compositional data and an ordinal response with a structured regularized horseshoe prior for the compositional coefficients and a soft sum-to-zero restriction on coefficients through the prior distribution. The method was implemented with R package rstan using efficient Hamiltonian Monte Carlo algorithm. We performed simulations to compare the proposed approach and existing methods for ordinal responses. Results revealed that our proposed method outperformed the existing methods in terms of parameter estimation and prediction. We also applied the proposed method to a microbiome study HMP2Data, to find microorganisms linked to ordinal inflammatory bowel disease levels. To make this work reproducible, the code and data used in this paper are available at https://github.com/Li-Zhang28/BCO.
引用
收藏
页码:1043 / 1054
页数:12
相关论文
共 39 条
[1]  
Agresti A., 2013, Categorical Data Analysis, V3rd
[2]  
AITCHISON J, 1984, BIOMETRIKA, V71, P323
[3]  
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[4]   RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[5]  
Banerjee S., 2003, Hierarchical modeling and analysis for spatial data
[7]   On conditional and intrinsic autoregressions [J].
Besag, J ;
Kooperberg, C .
BIOMETRIKA, 1995, 82 (04) :733-746
[8]   Ordinal Regression Models in Psychology: A Tutorial [J].
Buerkner, Paul-Christian ;
Vuorre, Matti .
ADVANCES IN METHODS AND PRACTICES IN PSYCHOLOGICAL SCIENCE, 2019, 2 (01) :77-101
[9]   brms: An R Package for Bayesian Multilevel Models Using Stan [J].
Buerkner, Paul-Christian .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (01) :1-28
[10]   Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications [J].
Combettes, Patrick L. ;
Mueller, Christian L. .
STATISTICS IN BIOSCIENCES, 2021, 13 (02) :217-242