Probabilistic Gradient Boosting Machines for Large-Scale Probabilistic Regression

被引:18
|
作者
Sprangers, Olivier [1 ,2 ]
Schelter, Sebastian [2 ]
de Rijke, Maarten [2 ]
机构
[1] AIRLab, Amsterdam, Netherlands
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年
关键词
Probabilistic Regression; Gradient Boosting Machines;
D O I
10.1145/3447548.3467278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gradient Boosting Machines (GBMs) are hugely popular for solving tabular data problems. However, practitioners are not only interested in point predictions, but also in probabilistic predictions in order to quantify the uncertainty of the predictions. Creating such probabilistic predictions is difficult with existing GBM-based solutions: they either require training multiple models or they become too computationally expensive to be useful for large-scale settings. We propose Probabilistic Gradient Boosting Machines (PGBMs), a method to create probabilistic predictions with a single ensemble of decision trees in a computationally efficient manner. PGBM approximates the leaf weights in a decision tree as a random variable, and approximates the mean and variance of each sample in a dataset via stochastic tree ensemble update equations. These learned moments allow us to subsequently sample from a specified distribution after training. We empirically demonstrate the advantages of PGBM compared to existing state-of-the-art methods: (i) PGBM enables probabilistic estimates without compromising on point performance in a single model, (ii) PGBM learns probabilistic estimates via a single model only (and without requiring multi-parameter boosting), and thereby offers a speedup of up to several orders of magnitude over existing state-of-the-art methods on large datasets, and (iii) PGBM achieves accurate probabilistic estimates in tasks with complex differentiable loss functions, such as hierarchical time series problems, where we observed up to 10% improvement in point forecasting performance and up to 300% improvement in probabilistic forecasting performance.
引用
收藏
页码:1510 / 1520
页数:11
相关论文
共 50 条
  • [31] SVMTorch: Support vector machines for large-scale regression problems
    Collobert, R
    Bengio, S
    JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) : 143 - 160
  • [32] Deep Reinforcement Learning for Stabilization of Large-Scale Probabilistic Boolean Networks
    Moschoyiannis, Sotiris
    Chatzaroulas, Evangelos
    Sliogeris, Vytenis
    Wu, Yuhu
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (03): : 1412 - 1423
  • [33] Using a Probabilistic Model to Assist Merging of Large-Scale Administrative Records
    Enamorado, Ted
    Fifield, Benjamin
    Imai, Kosuke
    AMERICAN POLITICAL SCIENCE REVIEW, 2019, 113 (02) : 353 - 371
  • [34] Modeling and Probabilistic Reasoning of Population Evacuation During Large-scale Disaster
    Song, Xuan
    Zhang, Quanshi
    Sekimoto, Yoshihide
    Horanont, Teerayut
    Ueyama, Satoshi
    Shibasaki, Ryosuke
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1231 - 1239
  • [35] Large-scale linked data integration using probabilistic reasoning and crowdsourcing
    Demartini, Gianluca
    Difallah, Djellel Eddine
    Cudre-Mauroux, Philippe
    VLDB JOURNAL, 2013, 22 (05): : 665 - 687
  • [36] Probabilistic Temporal Fusion Transformers for Large-Scale KPI Anomaly Detection
    Luo, Haoran
    Zheng, Yongkun
    Chen, Kang
    Zhao, Shuo
    IEEE ACCESS, 2024, 12 : 9123 - 9137
  • [37] Probabilistic unsupervised classification for large-scale analysis of spectral imaging data
    Paradis, Emmanuel
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 107
  • [38] Large-scale linked data integration using probabilistic reasoning and crowdsourcing
    Gianluca Demartini
    Djellel Eddine Difallah
    Philippe Cudré-Mauroux
    The VLDB Journal, 2013, 22 : 665 - 687
  • [39] Large-scale Probabilistic Functional Modes from resting state fMRI
    Harrison, Samuel J.
    Woolrich, Mark W.
    Robinson, Emma C.
    Glasser, Matthew F.
    Beckmann, Christian F.
    Jenkinson, Mark
    Smith, Stephen M.
    NEUROIMAGE, 2015, 109 : 217 - 231
  • [40] Probabilistic Reservation Services for Large-Scale Batch-Scheduled Systems
    Nurmi, Daniel
    Wolski, Rich
    Brevik, John
    IEEE SYSTEMS JOURNAL, 2009, 3 (01): : 6 - 24