MINIMAL CONSISTENT SET (MCS) IDENTIFICATION FOR OPTIMAL NEAREST-NEIGHBOR DECISION SYSTEMS-DESIGN

被引:134
作者
DASARATHY, BV
机构
[1] Dynetics, Inc., Huntsville, AL
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS | 1994年 / 24卷 / 03期
关键词
D O I
10.1109/21.278999
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A new approach is presented in this study for tackling the problem of high computational demands of nearest neighbor (NN) based decision systems. The approach, based on the concept of an optimal subset selection from a given training data set, derives a consistent subset which is aimed to be minimal in size. This minimal consistent subset (MCS) selection, in contrast to most of the other previous attempts of this nature, leads to an unique solution irrespective of the initial order of presentation of the data. Further, consistency property is assured at every iteration. Also, unlike under most prior approaches, the samples are selected here in the order of significance of their contribution for enabling the consistency property. This provides insight into the relative significance of the samples in the training set. Experimental results based on a number of independent training and test data sets are presented and discussed to illustrate the methodology and bring to focus its benefits. These results show that the nearest neighbor decision system performance suffers little degradation when the given large training set is replaced by its much smaller MCS in the operational phase of testing with an independent test set. A direct experimental comparison with a prior approach is also furnished to further strengthen the case for the new methodology.
引用
收藏
页码:511 / 517
页数:7
相关论文
共 15 条