High-Dimensional Bayesian Semiparametric Models for Small Samples: A Principled Approach to the Analysis of Cytokine Expression Data

被引:0
|
作者
Poli, Giovanni [1 ]
Argiento, Raffaele [2 ,3 ]
Amedei, Amedeo [4 ]
Stingo, Francesco C. [1 ]
机构
[1] Univ Firenze, Dept Stat, Comp Sci, Applicat G Parenti, Florence, Italy
[2] Univ Bergamo, Dept Econ, Bergamo, Italy
[3] Univ Cattolica Sacro Cuore, Dept Stat Sci, Milan, Italy
[4] Univ Firenze, Dept Expt & Clin Med, Florence, Italy
关键词
Crohn's disease; cytokines; Dirichlet Process; semiparametric Bayesian modeling; VARIABLE SELECTION; EMPIRICAL BAYES; DISTRIBUTIONS; SPIKE;
D O I
10.1002/bimj.70000
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In laboratory medicine, due to the lack of sample availability and resources, measurements of many quantities of interest are commonly collected over a few samples, making statistical inference particularly challenging. In this context, several hypotheses can be tested, and studies are not often powered accordingly. We present a semiparametric Bayesian approach to effectively test multiple hypotheses applied to an experiment that aims to identify cytokines involved in Crohn's disease (CD) infection that may be ongoing in multiple tissues. We assume that the positive correlation commonly observed between cytokines is caused by latent groups of effects, which in turn result from a common cause. These clusters are effectively modeled through a Dirichlet Process (DP) that is one of the most popular choices as nonparametric prior in Bayesian statistics and has been proven to be a powerful tool for model-based clustering. We use a spike-slab distribution as the base measure of the DP. The nonparametric part has been included in an additive model whose parametric component is a Bayesian hierarchical model. We include simulations that empirically demonstrate the effectiveness of the proposed testing procedure in settings that mimic our application's sample size and data structure. Our CD data analysis shows strong evidence of a cytokine gradient in the external intestinal tissue.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] COMPLEXITY ANALYSIS OF BAYESIAN LEARNING OF HIGH-DIMENSIONAL DAG MODELS AND THEIR EQUIVALENCE CLASSES
    Zhou, Quan
    Chang, Hyunwoong
    ANNALS OF STATISTICS, 2023, 51 (03): : 1058 - 1085
  • [32] Some improved estimation strategies in high-dimensional semiparametric regression models with application to riboflavin production data
    M. Arashi
    Mahdi Roozbeh
    Statistical Papers, 2019, 60 : 667 - 686
  • [33] Some improved estimation strategies in high-dimensional semiparametric regression models with application to riboflavin production data
    Arashi, M.
    Roozbeh, Mahdi
    STATISTICAL PAPERS, 2019, 60 (03) : 317 - 336
  • [34] ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
    Archer, Kellie J.
    Seffernick, Anna Eames
    Sun, Shuai
    Zhang, Yiran
    STATS, 2022, 5 (02): : 371 - 384
  • [35] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [36] New approach to Bayesian high-dimensional linear regression
    Jalali, Shirin
    Maleki, Arian
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2018, 7 (04) : 605 - 655
  • [37] Efficient quadratures for high-dimensional Bayesian data assimilation
    Cheng, Ming
    Wang, Peng
    Tartakovsky, Daniel M.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 506
  • [38] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    ENVIRONMETRICS, 2021, 32 (07)
  • [39] Adaptive Bayesian density regression for high-dimensional data
    Shen, Weining
    Ghosal, Subhashis
    BERNOULLI, 2016, 22 (01) : 396 - 420
  • [40] Robust structured heterogeneity analysis approach for high-dimensional data
    Sun, Yifan
    Luo, Ziye
    Fan, Xinyan
    STATISTICS IN MEDICINE, 2022, 41 (17) : 3229 - 3259