Two-level quantile regression forests for bias correction in range prediction

被引:0
|
作者
Thanh-Tung Nguyen
Joshua Z. Huang
Thuy Thi Nguyen
机构
[1] Shenzhen Institutes of Advanced Technology,Shenzhen Key Laboratory of High Performance Data Mining
[2] Chinese Academy of Sciences,College of Computer Science and Software Engineering
[3] School of Computer Science and Engineering,undefined
[4] Water Resources University,undefined
[5] University of Chinese Academy of Sciences,undefined
[6] Shenzhen University,undefined
[7] Faculty of Information Technology,undefined
[8] Vietnam National University of Agriculture,undefined
来源
Machine Learning | 2015年 / 101卷
关键词
Bias correction; Random forests; Quantile regression forests; High dimensional data; Data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Quantile regression forests (QRF), a tree-based ensemble method for estimation of conditional quantiles, has been proven to perform well in terms of prediction accuracy, especially for range prediction. However, the model may have bias and suffer from working with high dimensional data (thousands of features). In this paper, we propose a new bias correction method, called bcQRF that uses bias correction in QRF for range prediction. In bcQRF, a new feature weighting subspace sampling method is used to build the first level QRF model. The residual term of the first level QRF model is then used as the response feature to train the second level QRF model for bias correction. The two-level models are used to compute bias-corrected predictions. Extensive experiments on both synthetic and real world data sets have demonstrated that the bcQRF method significantly reduced prediction errors and outperformed most existing regression random forests. The new method performed especially well on high dimensional data.
引用
收藏
页码:325 / 343
页数:18
相关论文
共 50 条
  • [1] Two-level quantile regression forests for bias correction in range prediction
    Thanh-Tung Nguyen
    Huang, Joshua Z.
    Thuy Thi Nguyen
    MACHINE LEARNING, 2015, 101 (1-3) : 325 - 343
  • [2] Prediction of heat waves in Pakistan using quantile regression forests
    Khan, Najeebullah
    Shahid, Shamsuddin
    Juneng, Liew
    Ahmed, Kamal
    Ismail, Tarmizi
    Nawaz, Nadeem
    ATMOSPHERIC RESEARCH, 2019, 221 : 1 - 11
  • [3] Two-level preconditioning for Ridge Regression
    Tavernier, Joris
    Simm, Jaak
    Meerbergen, Karl
    Moreau, Yves
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2021, 28 (04)
  • [4] Covariance Estimation in Two-Level Regression
    Moehle, Nicholas
    Gorinevsky, Dimitry
    2013 2ND INTERNATIONAL CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2013, : 288 - 293
  • [5] BIAS-CORRECTED QUANTILE REGRESSION FORESTS FOR HIGH-DIMENSIONAL DATA
    Nguyen Thanh Tung
    Huang, Joshua Zhexue
    Thuy Thi Nguyen
    Khan, Imran
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 1 - 6
  • [6] STABILITY OF CERTAIN TWO-LEVEL ALGORITHMS FOR LONG-RANGE PREDICTION.
    Krotov, G.I.
    Soviet automatic control, 1981, 14 (05): : 72 - 76
  • [7] Better nonparametric confidence intervals via robust bias correction for quantile regression
    Guo, Shaojun
    Han, Yu
    Wang, Qingsong
    STAT, 2021, 10 (01):
  • [8] Prediction of extremal precipitation by quantile regression forests: from SNU Multiscale Team
    Park, Seoncheol
    Kwon, Junhyeon
    Kim, Joonpyo
    Oh, Hee-Seok
    EXTREMES, 2018, 21 (03) : 463 - 476
  • [9] Prediction of extremal precipitation by quantile regression forests: from SNU Multiscale Team
    Seoncheol Park
    Junhyeon Kwon
    Joonpyo Kim
    Hee-Seok Oh
    Extremes, 2018, 21 : 463 - 476
  • [10] THEORY OF TWO-LEVEL GMDH ALGORITHMS FOR LONG-RANGE QUANTITATIVE PREDICTION.
    Ivakhnenko, A.G.
    Kocherga, Yu.L.
    Soviet automatic control, 1983, 16 (06): : 7 - 12