Voting over multiple condensed nearest neighbors

被引:59
作者
Alpaydin, E
机构
[1] Boğaziçi University,Department of Computer Engineering
关键词
lazy learning; nonparametric estimation; k-nearest neighbor; condensed nearest neighbor; voting;
D O I
10.1023/A:1006563312922
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lazy learning methods like the Ic-nearest neighbor classifier require storing the whole training set and may be too costly when this set is large. The condensed nearest neighbor classifier incrementally stores a subset of the sample, thus decreasing storage and computation requirements. We propose to train multiple such subsets and take a vote over them, thus combining predictions from a set of concept descriptions. We investigate two voting schemes: simple voting where voters have equal weight and weighted voting where weights depend on classifiers' confidences in their predictions. We consider ways to form such subsets for improved performance: When the training set is small, voting improves performance considerably. If the training set is not small, then voters converge to similar solutions and we do not gain anything by voting. To alleviate this, when the training set is of intermediate size, we use bootstrapping to generate smaller training sets over which we train the voters, When the training set is large, we partition it into smaller, mutually exclusive subsets and then train the voters. Simulation results on six datasets are reported with good results. We give a review of methods for combining multiple learners. The idea of taking a vote over multiple learners can be applied with any type of learning scheme.
引用
收藏
页码:115 / 132
页数:18
相关论文
共 36 条
  • [1] AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
  • [2] COMPARISON OF KERNEL ESTIMATORS, PERCEPTRONS AND RADIAL-BASIS FUNCTIONS FOR OCR AND SPEECH CLASSIFICATION
    ALPAYDIN, E
    GURGEN, F
    [J]. NEURAL COMPUTING & APPLICATIONS, 1995, 3 (01) : 38 - 49
  • [3] ALPAYDIN E, 1991, TR91032 INT COMP SCI
  • [4] ALPAYDIN E, 1990, THESIS ECOLE POLYTEC
  • [5] ALPAYDIN E, 1993, INT C NEUR NETW CA U, V1, P9
  • [6] CONSENSUS THEORETIC CLASSIFICATION METHODS
    BENEDIKTSSON, JA
    SWAIN, PH
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (04): : 688 - 704
  • [7] BREIMAN L, 1992, TR367 U CAL DEP STAT
  • [8] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [9] Drucker H., 1993, ADV NEURAL INFORM PR, P42
  • [10] DUDA R. O., 2000, Pattern Classification and Scene Analysis