BIAS-CORRECTED QUANTILE REGRESSION FORESTS FOR HIGH-DIMENSIONAL DATA

被引:0
|
作者
Nguyen Thanh Tung [1 ,4 ]
Huang, Joshua Zhexue [1 ,2 ]
Thuy Thi Nguyen [3 ]
Khan, Imran [1 ]
机构
[1] Chinese Acad Sci, SIAT, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[3] Hanoi Univ Agr Vietnam, Hanoi, Vietnam
[4] Water Resources Univ, Hanoi, Vietnam
关键词
Bias Correction; Quantile Regression Forests; High-Dimensional Data; Random Forests; Data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Quantile Regression Forest (QRF), a nonparametric regression method based on the random forests, has been proved to perform well in terms of prediction accuracy, especially for non-Gaussian conditional distributions. However, the method may have two kinds of bias when solving regression problems: bias in the feature selection stage and bias in solving the regression problem. In this paper, we propose a new bias-correction algorithm that uses bias correction based on the QRF. To correct the first kind of bias, we propose a new scheme for feature sampling that allows to select good features for growing trees. The first level QRF is built based on this. For the second kind of bias, the residual term of the first level QRF model is used as the response feature to train the second level QRF model for bias correction. The second level model is then used to compute bias-corrected predictions. In our experiments, the proposed algorithm dramatically reduces prediction errors and outperforms most of the existing regression random forests models for both synthetic and well-known real-world data sets.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] Bias-corrected random forests in regression
    Zhang, Guoyi
    Lu, Yan
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (01) : 151 - 160
  • [2] Bias-corrected quantile regression estimation of censored regression models
    Cizek, P.
    Sadikoglu, S.
    STATISTICAL PAPERS, 2018, 59 (01) : 215 - 247
  • [3] Bias-corrected quantile regression estimation of censored regression models
    P. Čížek
    S. Sadikoglu
    Statistical Papers, 2018, 59 : 215 - 247
  • [4] Bias-Corrected Diagonal Discriminant Rules for High-Dimensional Classification
    Huang, Song
    Tong, Tiejun
    Zhao, Hongyu
    BIOMETRICS, 2010, 66 (04) : 1096 - 1106
  • [5] Bias-Corrected Inference of High-Dimensional Generalized Linear Models
    Tang, Shengfei
    Shi, Yanmei
    Zhang, Qi
    MATHEMATICS, 2023, 11 (04)
  • [6] Unconditional quantile regression with high-dimensional data
    Sasaki, Yuya
    Ura, Takuya
    Zhang, Yichong
    QUANTITATIVE ECONOMICS, 2022, 13 (03) : 955 - 978
  • [7] Bias-Corrected Hierarchical Bayesian Classification With a Selected Subset of High-Dimensional Features
    Li, Longhai
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (497) : 120 - 134
  • [8] A high-dimensional bias-corrected AIC for selecting response variables in multivariate calibration
    Oda, Ryoya
    Mima, Yoshie
    Yanagihara, Hirokazu
    Fujikoshi, Yasunori
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (14) : 3453 - 3476
  • [9] Quantile forward regression for high-dimensional survival data
    Lee, Eun Ryung
    Park, Seyoung
    Lee, Sang Kyu
    Hong, Hyokyoung G.
    LIFETIME DATA ANALYSIS, 2023, 29 (04) : 769 - 806
  • [10] Quantile forward regression for high-dimensional survival data
    Eun Ryung Lee
    Seyoung Park
    Sang Kyu Lee
    Hyokyoung G. Hong
    Lifetime Data Analysis, 2023, 29 : 769 - 806