Reduction techniques for instance-based learning algorithms

被引:904
|
作者
Wilson, DR [1 ]
Martinez, TR [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Neural Network & Machine Learning Lab, Provo, UT 84602 USA
关键词
instance-based learning; nearest neighbor; instance reduction; pruning; classification;
D O I
10.1023/A:1007626913721
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance-based learning algorithms are often faced with the problem of deciding which instances to store for use during generalization. Storing too many instances can result in large memory requirements and slow execution speed, and can cause an oversensitivity to noise. This paper has two main purposes. First, it provides a survey of existing algorithms used to reduce storage requirements in instance-based learning algorithms and other exemplar-based algorithms. Second, it proposes six additional reduction algorithms called DROP1-DROP5 and DEL (three of which were first described in Wilson & Martinez, 1997c, as RT1-RT3) that can be used to remove instances from the concept description. These algorithms and 10 algorithms from the survey are compared on 31 classification tasks. Of those algorithms that provide substantial storage reduction, the DROP algorithms have the highest average generalization accuracy in these experiments, especially in the presence of uniform class noise.
引用
收藏
页码:257 / 286
页数:30
相关论文
共 50 条
  • [1] Reduction Techniques for Instance-Based Learning Algorithms
    D. Randall Wilson
    Tony R. Martinez
    Machine Learning, 2000, 38 : 257 - 286
  • [2] INSTANCE-BASED LEARNING ALGORITHMS
    AHA, DW
    KIBLER, D
    ALBERT, MK
    MACHINE LEARNING, 1991, 6 (01) : 37 - 66
  • [3] Advances in Instance Selection for Instance-Based Learning Algorithms
    Henry Brighton
    Chris Mellish
    Data Mining and Knowledge Discovery, 2002, 6 : 153 - 172
  • [4] Advances in instance selection for instance-based learning algorithms
    Brighton, H
    Mellish, C
    DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) : 153 - 172
  • [5] Reduction Technique for Instance-based Learning Using Distributed Genetic Algorithms
    Al-Ramadin, Tahseen A.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2011, 4 (03): : 47 - 60
  • [6] On the Impact of Distance Metrics in Instance-Based Learning Algorithms
    Lopes, Noel
    Ribeiro, Bernardete
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 48 - 56
  • [7] Effects of domain characteristics on instance-based learning algorithms
    Okamoto, S
    Yugami, N
    THEORETICAL COMPUTER SCIENCE, 2003, 298 (01) : 207 - 233
  • [8] EXPERIMENTING WITH DECISION TREES AND INSTANCE-BASED LEARNING ALGORITHMS
    Paun, Alexandru
    LET'S BUILD THE FUTURE THROUGH LEARNING INNOVATION!, VOL. 2, 2014, : 378 - 384
  • [9] Pragmatic Investigation on Performance of Instance-Based Learning Algorithms
    Venkatesh, Bharathan
    Singh, Danasingh Asir Antony Gnana
    Leavline, Epiphany Jebamalar
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 633 - 642
  • [10] Natural Neighbor Reduction Algorithm for Instance-based Learning
    Yang, Lijun
    Zhu, Qingsheng
    Huang, Jinlong
    Cheng, Dongdong
    Zhang, Cheng
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2016, 10 (04) : 59 - 73