Small-Sample Error Estimation for Bagged Classification Rules

被引：1

作者：

Vu, T. T. ^{[1
]}

Braga-Neto, U. M. ^{[1
]}

机构：

[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2010年

基金：

美国国家科学基金会;

关键词：

CROSS-VALIDATION; MASS-SPECTRA; CANCER; PREDICTION; PROTEOMICS; ALGORITHM;

D O I：

10.1155/2010/548906

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Application of ensemble classification rules in genomics and proteomics has become increasingly common. However, the problem of error estimation for these classification rules, particularly for bagging under the small-sample settings prevalent in genomics and proteomics, is not well understood. Breiman proposed the "out-of-bag" method for estimating statistics of bagged classifiers, which was subsequently applied by other authors to estimate the classification error. In this paper, we give an explicit definition of the out-of-bag estimator that is intended to remove estimator bias, by formulating carefully how the error count is normalized. We also report the results of an extensive simulation study of bagging of common classification rules, including LDA, 3NN, and CART, applied on both synthetic and real patient data, corresponding to the use of common error estimators such as resubstitution, leave-one-out, cross-validation, basic bootstrap, bootstrap 632, bootstrap 632 plus, bolstering, semi-bolstering, in addition to the out-of-bag estimator. The results from the numerical experiments indicated that the performance of the out-of-bag estimator is very similar to that of leave-one-out; in particular, the out-of-bag estimator is slightly pessimistically biased. The performance of the other estimators is consistent with their performance with the corresponding single classifiers, as reported in other studies.

引用

页数：12

共 50 条

[1] Small-Sample Error Estimation for Bagged Classification Rules
T. T. Vu
U. M. Braga-Neto
EURASIP Journal on Advances in Signal Processing, 2010
[2] Corrected small-sample estimation of the Bayes error
Brun, M
Sabbagh, D
Kim, S
Dougherty, ER
BIOINFORMATICS, 2003, 19 (08) : 944 - 951
[3] ON SMALL-SAMPLE ESTIMATION
BROWN, GW
ANNALS OF MATHEMATICAL STATISTICS, 1947, 18 (04): : 582 - 585
[4] COMPARING THE SMALL-SAMPLE ESTIMATION ERROR OF CONCEPTUALLY DIFFERENT RISK MEASURES
Auer, Benjamin R.
Schuhmacher, Frank
INTERNATIONAL JOURNAL OF THEORETICAL AND APPLIED FINANCE, 2021, 24 (05)
[5] IMPROVED BOUNDS FOR SMALL-SAMPLE ESTIMATION
Gratton, Serge
Titley-Peloquin, David
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2018, 39 (02) : 922 - 931
[6] Small-sample Bayesian error estimation for ergodic, chaotic systems of ordinary differential equations
Frontin, Cory
Darmofal, David L.
JOURNAL OF COMPUTATIONAL PHYSICS, 2025, 521
[7] Scientific knowledge is possible with small-sample classification
Dougherty, Edward R.
Dalton, Lori A.
EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2013, Springer Verlag (01):
[8] A NOTE ON SMALL-SAMPLE MAXIMUM PROBABILITY ESTIMATION
WEISS, L
STATISTICS & PROBABILITY LETTERS, 1986, 4 (03) : 109 - 111
[9] Deep InterBoost networks for small-sample image classification
Li, Xiaoxu
Chang, Dongliang
Ma, Zhanyu
Tan, Zheng-Hua
Xue, Jing-Hao
Cao, Jie
Guo, Jun
NEUROCOMPUTING, 2021, 456 : 492 - 503
[10] Fads and fallacies in the name of small-sample microarray classification
Braga-Neto, Ulisses
IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (01) : 91 - 99

← 1 2 3 4 5 →