Theoretical and experimental analysis of a two-stage system for classification

被引:25
作者
Giusti, N
Masulli, F
Sperduti, A
机构
[1] Micronix Comp, I-51016 Montecatini Terme, PT, Italy
[2] Univ Pisa, Dipartimento Informat, I-56125 Pisa, Italy
关键词
multicategory classification; rejection; global and local classification; hierarchical classifier; Bayes classifier;
D O I
10.1109/TPAMI.2002.1017617
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a popular approach to multicategory classification tasks: a two-stage system based on a first (global) classifier with rejection followed by a (local) nearest-neighbor classifier. Patterns which are not rejected by the first classifier are classified according to its output. Rejected patterns are passed to the nearest-neighbor classifier together with the top-it ranking classes returned by the first classifier. The nearest-neighbor classifier, looking at patterns in the top-h classes, classifies the rejected pattern. An editing strategy for the nearest-neighbor reference database, controlled by the first classifier, is also considered. We analyze this system, showing that even if the first level and nearest-neighbor classifiers are not optimal in a Bayes sense, the system as a whole may be optimal. Moreover, we formally relate the response time of the system to the rejection rate of the first classifier and to the other system parameters. The error-response time trade-off is also discussed. Finally, we experimentally study two instances of the system applied to the recognition of handwritten digits. In one system, the first classifier is a fuzzy basis functions network, while in the second system it is a feed-forward neural network. Classification results as well as response times for different settings of the system parameters are reported for both systems.
引用
收藏
页码:893 / 904
页数:12
相关论文
共 35 条
[1]  
ALFONSO D, 1996, P INT ICSC S IND INT, P2
[2]   LOCAL LEARNING ALGORITHMS [J].
BOTTOU, L ;
VAPNIK, V .
NEURAL COMPUTATION, 1992, 4 (06) :888-900
[3]  
Casalino F, 1998, INTELL AUTOM SOFT CO, V4, P73
[4]  
CHOW CK, 1970, IEEE T INFORM THEORY, V16, P41, DOI 10.1109/TIT.1970.1054406
[5]  
Furlanello C., 1997, Connection Science, V9, P31, DOI 10.1080/095400997116720
[6]  
GARRIS MD, 1992, NIST SPECIAL DATABAS, V3
[7]  
GUTTA S, 1997, P INT C NEUR NETW, V3, P1353
[8]  
Hart P.E., 1973, Pattern recognition and scene analysis
[9]  
Hashem S., 1996, Connection Science, V8, P315, DOI 10.1080/095400996116794
[10]  
Jimenez D, 1998, IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, P753, DOI 10.1109/IJCNN.1998.682375