BIAS-CORRECTED QUANTILE REGRESSION FORESTS FOR HIGH-DIMENSIONAL DATA

被引:0
|
作者
Nguyen Thanh Tung [1 ,4 ]
Huang, Joshua Zhexue [1 ,2 ]
Thuy Thi Nguyen [3 ]
Khan, Imran [1 ]
机构
[1] Chinese Acad Sci, SIAT, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[3] Hanoi Univ Agr Vietnam, Hanoi, Vietnam
[4] Water Resources Univ, Hanoi, Vietnam
关键词
Bias Correction; Quantile Regression Forests; High-Dimensional Data; Random Forests; Data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Quantile Regression Forest (QRF), a nonparametric regression method based on the random forests, has been proved to perform well in terms of prediction accuracy, especially for non-Gaussian conditional distributions. However, the method may have two kinds of bias when solving regression problems: bias in the feature selection stage and bias in solving the regression problem. In this paper, we propose a new bias-correction algorithm that uses bias correction based on the QRF. To correct the first kind of bias, we propose a new scheme for feature sampling that allows to select good features for growing trees. The first level QRF is built based on this. For the second kind of bias, the residual term of the first level QRF model is used as the response feature to train the second level QRF model for bias correction. The second level model is then used to compute bias-corrected predictions. In our experiments, the proposed algorithm dramatically reduces prediction errors and outperforms most of the existing regression random forests models for both synthetic and well-known real-world data sets.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [21] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331
  • [22] Jackknife model averaging for high-dimensional quantile regression
    Wang, Miaomiao
    Zhang, Xinyu
    Wan, Alan T. K.
    You, Kang
    Zou, Guohua
    BIOMETRICS, 2023, 79 (01) : 178 - 189
  • [23] Debiasing and Distributed Estimation for High-Dimensional Quantile Regression
    Zhao, Weihua
    Zhang, Fode
    Lian, Heng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2569 - 2577
  • [24] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235
  • [25] Bias-Corrected AIC for Selecting Variables in Poisson Regression Models
    Kamo, Ken-Ichi
    Yanagihara, Hirokazu
    Satoh, Kenichi
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2013, 42 (11) : 1911 - 1921
  • [26] Weighted l1-Penalized Corrected Quantile Regression for High-Dimensional Temporally Dependent Measurement Errors
    Bhattacharjee, Monika
    Chakraborty, Nilanjan
    Koul, Hira L.
    JOURNAL OF TIME SERIES ANALYSIS, 2023, 44 (5-6) : 442 - 473
  • [27] Bias-Corrected Bootstrap Inference for Regression Models with Autocorrelated Errors
    Kim, Jae
    ECONOMICS BULLETIN, 2005, 3
  • [28] Bias-corrected estimation for GI0 regression with applications
    Sousa, M. F. S. S.
    Vasconcelos, J. M.
    Nascimento, A. D. C.
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2025,
  • [29] Communication-efficient estimation of high-dimensional quantile regression
    Wang, Lei
    Lian, Heng
    ANALYSIS AND APPLICATIONS, 2020, 18 (06) : 1057 - 1075
  • [30] Distributed high-dimensional regression under a quantile loss function
    Chen, Xi
    Liu, Weidong
    Mao, Xiaojun
    Yang, Zhuoyi
    Liu, Weidong (weidongl@sjtu.edu.cn); Mao, Xiaojun (maoxj@fudan.edu.cn), 1600, Microtome Publishing (21):