Random Forest estimation of the ordered choice model

被引:0
作者
Lechner, Michael [1 ,2 ,3 ,4 ,5 ]
Okasa, Gabriel [1 ,6 ]
机构
[1] Univ St Gallen SEW HSG, Swiss Inst Empir Econ Res, Rosenberg str 22, CH-9000 St Gallen, Switzerland
[2] CEPR, London, England
[3] CESifo, Munich, Germany
[4] IAB, Nurnberg, Germany
[5] IZA, Bonn, Germany
[6] Swiss Natl Sci Fdn SNSF, Wildhainweg 3, CH-3001 Bern, Switzerland
关键词
Ordered choice models; Random Forests; Probabilities; Marginal effects; Machine learning; C14; C25; C40; SEMIPARAMETRIC ESTIMATION; PREDICTION; HEALTH; CLASSIFICATION;
D O I
10.1007/s00181-024-02646-4
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper we develop a new machine learning estimator for ordered choice models based on the Random Forest. The proposed Ordered Forest flexibly estimates the conditional choice probabilities while taking the ordering information explicitly into account. In addition to common machine learning estimators, it enables the estimation of marginal effects as well as conducting inference and thus provides the same output as classical econometric estimators. An extensive simulation study reveals a good predictive performance, particularly in settings with nonlinearities and high correlation among covariates. An empirical application contrasts the estimation of marginal effects and their standard errors with an Ordered Logit model. A software implementation of the Ordered Forest is provided both in R and Python in the package orf available on CRAN and PyPI, respectively.
引用
收藏
页码:1 / 106
页数:106
相关论文
共 95 条
[91]  
Van Rossum G., 2009, PYTHON 3 REFERENCE M
[92]  
Wager S., 2014, ARXIV
[93]  
Wager S, 2014, J MACH LEARN RES, V15, P1625
[94]   Understanding and interpreting generalized ordered logit models [J].
Williams, Richard .
JOURNAL OF MATHEMATICAL SOCIOLOGY, 2016, 40 (01) :7-20
[95]   ranger: A Fast Implementation of Random Forests for High Dimensional Data in C plus plus and R [J].
Wright, Marvin N. ;
Ziegler, Andreas .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 77 (01) :1-17