High-Dimensional Bayesian Semiparametric Models for Small Samples: A Principled Approach to the Analysis of Cytokine Expression Data

被引:0
|
作者
Poli, Giovanni [1 ]
Argiento, Raffaele [2 ,3 ]
Amedei, Amedeo [4 ]
Stingo, Francesco C. [1 ]
机构
[1] Univ Firenze, Dept Stat, Comp Sci, Applicat G Parenti, Florence, Italy
[2] Univ Bergamo, Dept Econ, Bergamo, Italy
[3] Univ Cattolica Sacro Cuore, Dept Stat Sci, Milan, Italy
[4] Univ Firenze, Dept Expt & Clin Med, Florence, Italy
关键词
Crohn's disease; cytokines; Dirichlet Process; semiparametric Bayesian modeling; VARIABLE SELECTION; EMPIRICAL BAYES; DISTRIBUTIONS; SPIKE;
D O I
10.1002/bimj.70000
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In laboratory medicine, due to the lack of sample availability and resources, measurements of many quantities of interest are commonly collected over a few samples, making statistical inference particularly challenging. In this context, several hypotheses can be tested, and studies are not often powered accordingly. We present a semiparametric Bayesian approach to effectively test multiple hypotheses applied to an experiment that aims to identify cytokines involved in Crohn's disease (CD) infection that may be ongoing in multiple tissues. We assume that the positive correlation commonly observed between cytokines is caused by latent groups of effects, which in turn result from a common cause. These clusters are effectively modeled through a Dirichlet Process (DP) that is one of the most popular choices as nonparametric prior in Bayesian statistics and has been proven to be a powerful tool for model-based clustering. We use a spike-slab distribution as the base measure of the DP. The nonparametric part has been included in an additive model whose parametric component is a Bayesian hierarchical model. We include simulations that empirically demonstrate the effectiveness of the proposed testing procedure in settings that mimic our application's sample size and data structure. Our CD data analysis shows strong evidence of a cytokine gradient in the external intestinal tissue.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Saving behaviour and health: A high-dimensional Bayesian analysis of British panel data
    Brown, Sarah
    Ghosh, Pulak
    Gray, Daniel
    Pareek, Bhuvanesh
    Roberts, Jennifer
    EUROPEAN JOURNAL OF FINANCE, 2021, 27 (16): : 1581 - 1603
  • [42] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Aijun Yang
    Xuejun Jiang
    Lianjie Shu
    Jinguan Lin
    Computational Statistics, 2017, 32 : 127 - 143
  • [43] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143
  • [44] Small sample sizes: A big data problem in high-dimensional data analysis
    Konietschke, Frank
    Schwab, Karima
    Pauly, Markus
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (03) : 687 - 701
  • [45] Scalable spatio-temporal Bayesian analysis of high-dimensional electroencephalography data
    Mohammed, Shariq
    Dey, Dipak K.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (01): : 107 - 128
  • [46] A Bayesian approach for semiparametric regression analysis of panel count data
    Jianhong Wang
    Xiaoyan Lin
    Lifetime Data Analysis, 2020, 26 : 402 - 420
  • [47] A Bayesian semiparametric approach for the differential analysis of sequence counts data
    Guindani, Michele
    Sepulveda, Nuno
    Paulino, Carlos Daniel
    Mueller, Peter
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2014, 63 (03) : 385 - 404
  • [48] An Iteratively Reweighted Importance Kernel Bayesian Filtering Approach for High-Dimensional Data Processing
    Liu, Xin
    MATHEMATICS, 2024, 12 (19)
  • [49] A Bayesian approach for semiparametric regression analysis of panel count data
    Wang, Jianhong
    Lin, Xiaoyan
    LIFETIME DATA ANALYSIS, 2020, 26 (02) : 402 - 420
  • [50] In Nonparametric and High-Dimensional Models, Bayesian Ignorability is an Informative Prior
    Linero, Antonio R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 2785 - 2798