Small-Sample Error Estimation for Bagged Classification Rules

被引:1
|
作者
Vu, T. T. [1 ]
Braga-Neto, U. M. [1 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2010年
基金
美国国家科学基金会;
关键词
CROSS-VALIDATION; MASS-SPECTRA; CANCER; PREDICTION; PROTEOMICS; ALGORITHM;
D O I
10.1155/2010/548906
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Application of ensemble classification rules in genomics and proteomics has become increasingly common. However, the problem of error estimation for these classification rules, particularly for bagging under the small-sample settings prevalent in genomics and proteomics, is not well understood. Breiman proposed the "out-of-bag" method for estimating statistics of bagged classifiers, which was subsequently applied by other authors to estimate the classification error. In this paper, we give an explicit definition of the out-of-bag estimator that is intended to remove estimator bias, by formulating carefully how the error count is normalized. We also report the results of an extensive simulation study of bagging of common classification rules, including LDA, 3NN, and CART, applied on both synthetic and real patient data, corresponding to the use of common error estimators such as resubstitution, leave-one-out, cross-validation, basic bootstrap, bootstrap 632, bootstrap 632 plus, bolstering, semi-bolstering, in addition to the out-of-bag estimator. The results from the numerical experiments indicated that the performance of the out-of-bag estimator is very similar to that of leave-one-out; in particular, the out-of-bag estimator is slightly pessimistically biased. The performance of the other estimators is consistent with their performance with the corresponding single classifiers, as reported in other studies.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Small-Sample Error Estimation for Bagged Classification Rules
    T. T. Vu
    U. M. Braga-Neto
    EURASIP Journal on Advances in Signal Processing, 2010
  • [2] Corrected small-sample estimation of the Bayes error
    Brun, M
    Sabbagh, D
    Kim, S
    Dougherty, ER
    BIOINFORMATICS, 2003, 19 (08) : 944 - 951
  • [3] ON SMALL-SAMPLE ESTIMATION
    BROWN, GW
    ANNALS OF MATHEMATICAL STATISTICS, 1947, 18 (04): : 582 - 585
  • [4] COMPARING THE SMALL-SAMPLE ESTIMATION ERROR OF CONCEPTUALLY DIFFERENT RISK MEASURES
    Auer, Benjamin R.
    Schuhmacher, Frank
    INTERNATIONAL JOURNAL OF THEORETICAL AND APPLIED FINANCE, 2021, 24 (05)
  • [5] IMPROVED BOUNDS FOR SMALL-SAMPLE ESTIMATION
    Gratton, Serge
    Titley-Peloquin, David
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2018, 39 (02) : 922 - 931
  • [6] Small-sample Bayesian error estimation for ergodic, chaotic systems of ordinary differential equations
    Frontin, Cory
    Darmofal, David L.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2025, 521
  • [7] Scientific knowledge is possible with small-sample classification
    Dougherty, Edward R.
    Dalton, Lori A.
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2013, Springer Verlag (01):
  • [8] A NOTE ON SMALL-SAMPLE MAXIMUM PROBABILITY ESTIMATION
    WEISS, L
    STATISTICS & PROBABILITY LETTERS, 1986, 4 (03) : 109 - 111
  • [9] Deep InterBoost networks for small-sample image classification
    Li, Xiaoxu
    Chang, Dongliang
    Ma, Zhanyu
    Tan, Zheng-Hua
    Xue, Jing-Hao
    Cao, Jie
    Guo, Jun
    NEUROCOMPUTING, 2021, 456 : 492 - 503
  • [10] Fads and fallacies in the name of small-sample microarray classification
    Braga-Neto, Ulisses
    IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (01) : 91 - 99