Bayesian modeling via discrete nonparametric priors

被引:0
作者
Marta Catalano
Antonio Lijoi
Igor Prünster
Tommaso Rigon
机构
[1] University of Warwick,Department of Statistics
[2] Bocconi University,Bocconi Institute for Data Science and Analytics
[3] University of Milano-Bicocca,Department of Economics, Management and Statistics
来源
Japanese Journal of Statistics and Data Science | 2023年 / 6卷
关键词
Clustering; Density estimation; Dependence; Dirichlet process; Exchangeability; Mixture model; Partial exchangeability; Pitman–Yor process; Species discovery;
D O I
暂无
中图分类号
学科分类号
摘要
The availability of complex-structured data has sparked new research directions in statistics and machine learning. Bayesian nonparametrics is at the forefront of this trend thanks to two crucial features: its coherent probabilistic framework, which naturally leads to principled prediction and uncertainty quantification, and its infinite-dimensionality, which exempts from parametric restrictions and ensures full modeling flexibility. In this paper, we provide a concise overview of Bayesian nonparametrics starting from its foundations and the Dirichlet process, the most popular nonparametric prior. We describe the use of the Dirichlet process in species discovery, density estimation, and clustering problems. Among the many generalizations of the Dirichlet process proposed in the literature, we single out the Pitman–Yor process, and compare it to the Dirichlet process. Their different features are showcased with real-data illustrations. Finally, we consider more complex data structures, which require dependent versions of these models. One of the most effective strategies to achieve this goal is represented by hierarchical constructions. We highlight the role of the dependence structure in the borrowing of information and illustrate its effectiveness on unbalanced datasets.
引用
收藏
页码:607 / 624
页数:17
相关论文
共 107 条
[1]  
Antoniak CE(1974)Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems The Annals of Statistics 2 1152-1174
[2]  
Barrios E(2013)Modeling with normalized random measure mixture models Statistical Science 28 313-334
[3]  
Lijoi A(1973)Ferguson distributions via Pólya urn schemes The Annals of Statistics 1 353-355
[4]  
Nieto-Barajas LE(2019)Latent nested nonparametric priors. (With discussion) Bayesian Analysis 15 1303-1356
[5]  
Prünster I(2019)Distribution theory for hierarchical processes The Annals of Statistics 47 67-92
[6]  
Blackwell D(2017)Bayesian prediction with multiple-samples information Journal of Multivariate Analysis 156 18-28
[7]  
MacQueen JB(2021)Survival analysis via hierarchically dependent mixture hazards Annals of Statistics 49 863-884
[8]  
Camerlenghi F(2017)Sparse graphs using exchangeable random measures Journal of the Royal Statistical Society Series B: Statistical Methodology 79 1295-1366
[9]  
Dunson D(2022)Posterior asymptotics for boosted hierarchical Dirichlet process mixtures Journal of Machine Learning Research 23 1-23
[10]  
Lijoi A(1978)Nonparametric statistical problems under partial exchangeability: The role of associative means Quaderni Istituto Matematica Finanziaria dell’Università di Torino Serie III 1-36