Three new instance selection methods based on local sets: A comparative study with several approaches from a bi-objective perspective

被引:74
作者
Leyva, Enrique [1 ]
Gonzalez, Antonio [1 ]
Perez, Raul [1 ]
机构
[1] Univ Granada, Dept Ciencias Comp & IA, ETSIIT, E-18071 Granada, Spain
关键词
Local sets; Instance selection; Data reduction; Prototype-based classifiers; Instance-based learning; PROTOTYPE REDUCTION SCHEMES; NEAREST-NEIGHBOR; EVOLUTIONARY INSTANCE; CLASSIFICATION RULES; ALGORITHMS; SYSTEMS;
D O I
10.1016/j.patcog.2014.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The local set is the largest hypersphere centered on an instance such that it does not contain instances from any other class. Due to its geometrical nature, this structure can be very helpful for distance-based classification, such as classification based on the nearest neighbor rule. This paper is focused on instance selection for nearest neighbor classification which, in short, aims to reduce the number of instances in the training set without affecting the classification accuracy. Three instance selection methods based on local sets, which follow different and complementary strategies, are proposed. In an experimental study involving 26 known databases, they are compared with 11 of the most successful state-of-the-art methods in standard and noisy environments. To evaluate their performances, two complementary approaches are applied, the Pareto dominance relation and the Technique for Order Preference by Similarity to Ideal Solution. The results achieved by the proposals reveal that they are among the most effective methods in this field. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1523 / 1537
页数:15
相关论文
共 49 条
  • [1] AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
  • [2] Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
  • [3] Fast nearest neighbor condensation for large data sets classification
    Angiulli, Fabrizio
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (11) : 1450 - 1464
  • [4] Genetic Training Instance Selection in Multiobjective Evolutionary Fuzzy Systems: A Coevolutionary Approach
    Antonelli, Michela
    Ducange, Pietro
    Marcelloni, Francesco
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) : 276 - 290
  • [5] A review of instance selection methods
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    Kittler, Josef
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) : 133 - 143
  • [6] A new fast prototype selection method based on clustering
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) : 131 - 141
  • [7] Advances in instance selection for instance-based learning algorithms
    Brighton, H
    Mellish, C
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) : 153 - 172
  • [8] Combining instance selection methods based on data characterization: An approach to increase their effectiveness
    Caises, Yoel
    Gonzalez, Antonio
    Leyva, Enrique
    Perez, Raul
    [J]. INFORMATION SCIENCES, 2011, 181 (20) : 4780 - 4798
  • [9] Evolutionary stratified training set selection for extracting classification rules with trade off precision-interpretability
    Cano, Jose Ramon
    Herrera, Francisco
    Lozano, Manuel
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 60 (01) : 90 - 108
  • [10] Stratification for scaling up evolutionary prototype selection
    Cano, JR
    Herrera, F
    Lozano, M
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (07) : 953 - 963