Continuous-Time Birth-Death MCMC for Bayesian Regression Tree Models

被引:0
作者
Mohammadi, Reza [1 ]
Pratola, Matthew [2 ]
Kaptein, Maurits [3 ]
机构
[1] Univ Amsterdam, Amsterdam Business Sch, Amsterdam, Netherlands
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[3] Univ Tilburg, Stat & Res Methods, Tilburg, Netherlands
关键词
Bayesian Regression trees; Decision trees; Continuous-time MCMC; Bayesian structure learning; Birth-death process; Bayesian model averaging; Bayesian model selection; RANDOM FORESTS; REVERSIBLE JUMP; CART; CLASSIFICATION; MIXTURE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decision trees are flexible models that are well suited for many statistical regression problems. In the Bayesian framework for regression trees, Markov Chain Monte Carlo (MCMC) search algorithms are required to generate samples of tree models according to their posterior probabilities. The critical component of such MCMC algorithms is to construct "good" Metropolis-Hastings steps to update the tree topology. Such algorithms frequently suffer from poor mixing and local mode stickiness; therefore, the algorithms are slow to converge. Hitherto, authors have primarily used discrete-time birth/death mechanisms for Bayesian (sums of) regression tree models to explore the tree-model space. These algorithms are efficient, in terms of computation and convergence, only if the rejection rate is low which is not always the case. We overcome this issue by developing a novel search algorithm which is based on a continuous-time birth-death Markov process. The search algorithm explores the tree-model space by jumping between parameter spaces corresponding to different tree structures. The jumps occur in continuous time corresponding to the birth-death events which are modeled as independent Poisson processes. In the proposed algorithm, the moves between models are always accepted which can dramatically improve the convergence and mixing properties of the search algorithm. We provide theoretical support of the algorithm for Bayesian regression tree models and demonstrate its performance in a simulated example.
引用
收藏
页数:26
相关论文
共 41 条
[21]   Efficient sampling of Gaussian graphical models using conditional Bayes factors [J].
Hinne, Max ;
Lenkoski, Alex ;
Heskes, Tom ;
van Gerven, Marcel .
STAT, 2014, 3 (01) :326-336
[22]  
Lakshminarayanan B., 2013, Proc. Int. Conf. Machine Learning, V28, P280
[23]   Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection [J].
Linero, Antonio R. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) :626-636
[24]   Decision making and uncertainty quantification for individualized treatments using Bayesian Additive Regression Trees [J].
Logan, Brent R. ;
Sparapani, Rodney ;
McCulloch, Robert E. ;
Laud, Purushottam W. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (04) :1079-1093
[25]   Bayesian Structure Learning in Sparse Gaussian Graphical Models [J].
Mohammadi, A. ;
Wit, E. C. .
BAYESIAN ANALYSIS, 2015, 10 (01) :109-138
[26]   Using mixture of Gamma distributions for Bayesian analysis in an M/G/1 queue with optional second service [J].
Mohammadi, A. ;
Salehi-Rad, M. R. ;
Wit, E. C. .
COMPUTATIONAL STATISTICS, 2013, 28 (02) :683-700
[27]   Bayesian modelling of Dupuytren disease by using Gaussian copula graphical models [J].
Mohammadi, Abdolreza ;
Abegaz, Fentaw ;
van den Heuvel, Edwin ;
Wit, Ernst C. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2017, 66 (03) :629-645
[28]   BDgraph: An R Package for Bayesian Structure Learning in Graphical Models [J].
Mohammadi, Reza ;
Wit, Ernst C. .
JOURNAL OF STATISTICAL SOFTWARE, 2019, 89 (03) :1-30
[29]  
Mohammadi Reza, 2017, ARXIV170604416, V110, P116
[30]   Newer classification and regression tree techniques: Bagging and random forests for ecological prediction [J].
Prasad, AM ;
Iverson, LR ;
Liaw, A .
ECOSYSTEMS, 2006, 9 (02) :181-199