Selective sampling for nearest neighbor classifiers

被引:119
作者
Lindenbaum, M [1 ]
Markovitch, S [1 ]
Rusakov, D [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
active learning; selective sampling; nearest neighbor; random field;
D O I
10.1023/B:MACH.0000011805.60520.fe
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing inductive learning algorithms work under the assumption that their training examples are already tagged. There are domains, however, where the tagging procedure requires significant computation resources or manual labor. In such cases, it may be beneficial for the learner to be active, intelligently selecting the examples for labeling with the goal of reducing the labeling cost. In this paper we present LSS-a lookahead algorithm for selective sampling of examples for nearest neighbor classifiers. The algorithm is looking for the example with the highest utility, taking its effect on the resulting classifier into account. Computing the expected utility of an example requires estimating the probability of its possible labels. We propose to use the random field model for this estimation. The LSS algorithm was evaluated empirically on seven real and artificial data sets, and its performance was compared to other selective sampling algorithms. The experiments show that the proposed algorithm outperforms other methods in terms of average error rate and stability.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 34 条
[1]  
Adler R. J., 1981, GEOMETRY RANDOM FIEL
[2]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[3]  
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828
[4]  
[Anonymous], P 15 INT C MACH LEAR
[5]  
Blake C.L., 1998, UCI repository of machine learning databases
[6]   IMPROVING GENERALIZATION WITH ACTIVE LEARNING [J].
COHN, D ;
ATLAS, L ;
LADNER, R .
MACHINE LEARNING, 1994, 15 (02) :201-221
[7]  
Cohn D. A., 1995, Advances in Neural Information Processing Systems 7, P705
[8]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[9]  
Dagan I., 1995, P 12 INT C MACH LEAR, P150, DOI [10.1016/B978-1-55860-377-6.50027-X, DOI 10.1016/B978-1-55860-377-6.50027-X]
[10]  
DAVIS DT, 1992, P IJCNN, V1, P676