A concrete statistical realization of Kleinberg's stochastic discrimination for pattern recognition. part I. Two-class classification

被引:5
作者
Chen, DC [1 ]
Huang, P
Cheng, XZ
机构
[1] Uniformed Serv Univ Hlth Sci, Dept Prevent Med & Biometr, Bethesda, MD 20814 USA
[2] Med Univ S Carolina, Dept Biometry & Epidemiol, Charleston, SC 29425 USA
[3] George Washington Univ, Washington, DC 20052 USA
关键词
discriminant function; accuracy; training set; test set;
D O I
10.1214/aos/1065705112
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The method of stochastic discrimination (SD) introduced by Kleinberg is a new method in statistical pattern recognition. It works by producing many weak classifiers and then combining them to form a strong classifier. However, the strict mathematical assumptions in Kleinberg [The Annals of Statistics 24 (1996) 2319-2349] are rarely met in practice. This paper provides an applicable way to realize the SD algorithm. We recast SD in a probability-space framework and present a concrete statistical realization of SD for two-class pattern recognition. We weaken Kleinberg's theoretically strict assumptions of uniformity and indiscernibility by introducing near uniformity and weak indiscernibility. Such weaker notions are easily encountered in practical applications. We present a systematic resampling method to produce weak classifiers and then establish corresponding classification rules of SD. We analyze the performance of SD theoretically and explain why SD is overtraining-resistant and why SD has a high convergence rate. Testing results on real and simulated data sets are also given.
引用
收藏
页码:1393 / 1412
页数:20
相关论文
共 23 条
[1]  
[Anonymous], P 3 INT C IEEE
[2]  
[Anonymous], P 13 INT C PATT REC
[3]  
BERLIND R, 1994, THESIS STATE U NEW Y
[4]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[5]  
CHEN D, 1998, THESIS STATE U NEW Y
[6]  
Duda R O, 2001, PATTER CLASSIFICATIO
[7]   The statistical utilization of multiple measurements [J].
Fisher, RA .
ANNALS OF EUGENICS, 1938, 8 :376-386
[8]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[9]  
Freund Y, 1996, Experiments with a new boosting algorithm. In proceedings 13th Int Conf Mach learn. Pp.148-156, P45
[10]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407