Input dependent prediction intervals for supervised regression

被引:3
|
作者
Pevec, Darko [1 ]
Kononenko, Igor [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana 1000, Slovenia
关键词
Prediction intervals; regression; model validation; data and knowledge visualization; methodologies and tools; RELIABILITY;
D O I
10.3233/IDA-140673
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we compare and put to test two families of non-parametric approaches to constructing prediction intervals for arbitrary regression models in the supervised learning framework. It is often assumed for the errors to be independent and identically distributed, but we focus on the general case when the errors may be input dependent. The first family of approaches is based on the idea of explaining the total prediction error as a sum of the model's error and the error caused by noise inherent to the data, so the two are estimated independently. The second family is based on the assumption of similarity of the data and these approaches estimate the prediction intervals of the target regression variable by using sample's nearest neighbors. Results on a large set of artificial and real-world datasets show that one method from the second family is superior to other methods. Approaches from the first family always form valid, yet not necessarily confirmatory prediction intervals, whereas approaches from the second family prove to be more time efficient.
引用
收藏
页码:873 / 887
页数:15
相关论文
共 50 条
  • [31] SMALL-SAMPLE INTERVALS FOR REGRESSION
    TINGLEY, MA
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1992, 20 (03): : 271 - 280
  • [32] Analysis of regression confidence intervals and Bayesian credible intervals for uncertainty quantification
    Lu, Dan
    Ye, Ming
    Hill, Mary C.
    WATER RESOURCES RESEARCH, 2012, 48
  • [33] Prediction intervals in the beta autoregressive moving average model
    Palm, Bruna Gregory
    Bayer, Fabio M.
    Cintra, Renato J.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (08) : 3635 - 3656
  • [34] Supervised outlier detection for classification and regression
    Fernandez, Angela
    Bella, Juan
    Dorronsoro, Jose R.
    NEUROCOMPUTING, 2022, 486 : 77 - 92
  • [35] On sieve bootstrap prediction intervals
    Alonso, AM
    Peña, D
    Romo, J
    STATISTICS & PROBABILITY LETTERS, 2003, 65 (01) : 13 - 20
  • [36] BOOTSTRAP PREDICTION INTERVALS FOR AUTOREGRESSIONS
    MASAROTTO, G
    INTERNATIONAL JOURNAL OF FORECASTING, 1990, 6 (02) : 229 - 239
  • [37] Estimation procedures and prediction intervals
    Shayib, MA
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 70 (02) : 123 - 134
  • [38] Non-parametric prediction intervals for the lifetime of coherent systems
    Chahkandi, M.
    Ahmadi, Jafar
    Baratpour, S.
    STATISTICAL PAPERS, 2014, 55 (04) : 1019 - 1034
  • [39] Twin neural network regression is a semi-supervised regression algorithm
    Wetzel, Sebastian J.
    Melko, Roger G.
    Tamblyn, Isaac
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [40] Prediction by supervised principal components
    Bair, E
    Hastie, T
    Paul, D
    Tibshirani, R
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 119 - 137