Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees

被引:0
|
作者
Brophy, Jonathan [1 ]
Lowd, Daniel [1 ]
机构
[1] Univ Oregon, Eugene, OR 97403 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年
关键词
MACHINE; PERFORMANCE; PREDICTION; TUTORIAL; FORESTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gradient-boosted regression trees (GBRTs) are hugely popular for solving tabular regression problems, but provide no estimate of uncertainty. We propose Instance-Based Uncertainty estimation for Gradient-boosted regression trees (IBUG), a simple method for extending any GBRT point predictor to produce probabilistic predictions. IBUG computes a non-parametric distribution around a prediction using the k-nearest training instances, where distance is measured with a tree-ensemble kernel. The runtime of IBUG depends on the number of training examples at each leaf in the ensemble, and can be improved by sampling trees or training instances. Empirically, we find that IBUG achieves similar or better performance than the previous state-of-the-art across 22 benchmark regression datasets. We also find that IBUG can achieve improved probabilistic performance by using different base GBRT models, and can more flexibly model the posterior distribution of a prediction than competing methods. We also find that previous methods suffer from poor probabilistic calibration on some datasets, which can be mitigated using a scalar factor tuned on the validation data. Source code is available at https://github.com/jjbrophy47/ibug.
引用
收藏
页数:15
相关论文
共 17 条
  • [1] Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees
    Brophy, Jonathan
    Hammoudeh, Zayd
    Lowd, Daniel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [2] Improving the prediction of an atmospheric chemistry transport model using gradient-boosted regression trees
    Ivatt, Peter D.
    Evans, Mathew J.
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2020, 20 (13) : 8063 - 8082
  • [3] Back-Analysis Method for Stope Displacements Using Gradient-Boosted Regression Tree and Firefly Algorithm
    Qi, Chongchong
    Fourie, Andy
    Zhao, Xu
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2018, 32 (05)
  • [4] Flexible loss functions for binary classification in gradient-boosted decision trees: An application to credit scoring
    Mushava, Jonah
    Murray, Michael
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [5] Offshore application of landslide susceptibility mapping using gradient-boosted decision trees: a Gulf of Mexico case study
    Dyer, Alec S.
    Mark-Moser, MacKenzie
    Duran, Rodrigo
    Bauer, Jennifer R.
    NATURAL HAZARDS, 2024, 120 (07) : 6223 - 6244
  • [6] Instance-based regression with missing data applied to a photocatalytic oxidation process
    Leon, Florin
    Piuleac, Ciprian George
    Curteanu, Silvia
    Poulios, Ioannis
    CENTRAL EUROPEAN JOURNAL OF CHEMISTRY, 2012, 10 (04): : 1149 - 1156
  • [7] Cascading logistic regression onto gradient boosted decision trees for forecasting and trading stock indices
    Zhou, Feng
    Zhang, Qun
    Sornette, Didier
    Jiang, Liu
    APPLIED SOFT COMPUTING, 2019, 84
  • [8] Estimating punching performance in fiber-reinforced polymer concrete slabs utilizing machine learning and gradient-boosted regression techniques
    Sankarapandian, Krishnapriya
    Alshahrani, Haya Mesfer
    Alotaibi, Faiz Abdullah
    Alnfiai, Mrim M.
    MATERIA-RIO DE JANEIRO, 2025, 30
  • [9] Pressure drop modelling in sand filters in micro-irrigation using gradient boosted regression trees
    Garcia Nieto, Paulino J.
    Garcia-Gonzalo, Esperanza
    Arbat, Gerard
    Duran-Ros, Miquel
    Ramirez de Cartagena, Francisco
    Puig-Bargues, Jaume
    BIOSYSTEMS ENGINEERING, 2018, 171 : 41 - 51
  • [10] Remote Sensing Based Yield Estimation of Rice (Oryza Sativa L.) Using Gradient Boosted Regression in India
    Arumugam, Ponraj
    Chemura, Abel
    Schauberger, Bernhard
    Gornott, Christoph
    REMOTE SENSING, 2021, 13 (12)