The reliability of estimated confidence intervals for classification error rates when only a single sample is available

被引:6
作者
Hanczar, Blaise [1 ]
Dougherty, Edward R. [2 ,3 ]
机构
[1] Univ Paris 05, LIPADE, F-75006 Paris, France
[2] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX USA
[3] Translat Genom Res Inst, Computat Biol Div, Phoenix, AZ USA
关键词
Supervised learning; Error estimation; High dimension; Small sample setting; Confidence interval; CROSS-VALIDATION; MICROARRAY CLASSIFICATION; PREDICTION ERROR; PERFORMANCE; CLASSIFIERS; INFERENCE; CANCER;
D O I
10.1016/j.patcog.2012.09.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Error estimation accuracy is the salient issue regarding the validity of a classifier model. When samples are small, training-data-based error estimates tend to suffer from inaccuracy and quantification of error estimation accuracy is difficult. Numerous methods have been proposed for estimating confidence intervals for the true error based on the estimated error. This paper surveys proposed methods and quantifies their performance. Monte Carlo methods are used to obtain accurate estimates of the true confidence intervals and compare these to the intervals estimated from samples. We consider different error estimators and several proposed confidence-bound estimators. Both synthetic and real genomic data are employed. Our simulations show the majority of the confidence intervals methods have poor performance because of the difference of shape between true and estimated intervals. According to our results, the best estimation strategy is to use the 10-time 10-fold cross-validation with a confidence interval based on the standard deviation. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1067 / 1077
页数:11
相关论文
共 30 条
  • [1] [Anonymous], IJCAI
  • [2] BERNARDO JM, 1979, J R STAT SOC B, V41, P113
  • [3] Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses
    Bhattacharjee, A
    Richards, WG
    Staunton, J
    Li, C
    Monti, S
    Vasa, P
    Ladd, C
    Beheshti, J
    Bueno, R
    Gillette, M
    Loda, M
    Weber, G
    Mark, EJ
    Lander, ES
    Wong, W
    Johnson, BE
    Golub, TR
    Sugarbaker, DJ
    Meyerson, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) : 13790 - 13795
  • [4] Exact performance of error estimators for discrete classifiers
    Braga-Neto, U
    Dougherty, E
    [J]. PATTERN RECOGNITION, 2005, 38 (11) : 1799 - 1814
  • [5] Exact correlation between actual and estimated errors in discrete classification
    Braga-Neto, Ulisses M.
    Dougherty, Edward R.
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (05) : 407 - 412
  • [6] Is cross-validation valid for small-sample microarray classification?
    Braga-Neto, UM
    Dougherty, ER
    [J]. BIOINFORMATICS, 2004, 20 (03) : 374 - 380
  • [7] Exact Sample Conditioned MSE Performance of the Bayesian MMSE Estimator for Classification Error-Part I: Representation
    Dalton, Lori A.
    Dougherty, Edward R.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) : 2575 - 2587
  • [8] Performance of Error Estimators for Classification
    Dougherty, Edward R.
    Sima, Chao
    Hua, Jianping
    Hanczar, Blaise
    Braga-Neto, Ulisses M.
    [J]. CURRENT BIOINFORMATICS, 2010, 5 (01) : 53 - 67
  • [9] Epistemology of computational biology: Mathematical models and experimental prediction as the basis of their validity
    Dougherty, ER
    Braga-Neto, U
    [J]. JOURNAL OF BIOLOGICAL SYSTEMS, 2006, 14 (01) : 65 - 90
  • [10] Improvements on cross-validation: The .632+ bootstrap method
    Efron, B
    Tibshirani, R
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) : 548 - 560