InstanceRank based on borders for instance selection

被引:24
作者
Hernandez-Leal, Pablo [1 ]
Ariel Carrasco-Ochoa, J. [1 ]
Fco Martinez-Trinidad, J. [1 ]
Arturo Olvera-Lopez, J. [2 ]
机构
[1] Natl Inst Astrophys Opt & Elect, Dept Comp Sci, Puebla 72840, Mexico
[2] Benemerita Univ Autonoma Puebla, Dept Comp Sci, Puebla 72570, Mexico
关键词
Instance selection; Instance ranking; Border instances; Supervised classification; NEAREST-NEIGHBOR; ALGORITHM;
D O I
10.1016/j.patcog.2012.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance selection algorithms are used for reducing the number of training instances. However, most of them suffer from long runtimes which results in the incapability to be used with large datasets. In this work, we introduce an Instance Ranking per class using Borders (instances near to instances belonging to different classes), using this ranking we propose an instance selection algorithm (IRB). We evaluated the proposed algorithm using k-NN with small and large datasets, comparing it against state of the art instance selection algorithms. In our experiments, for large datasets IRB has the best compromise between time and accuracy. We also tested our algorithm using SVM, LWLR and C4.5 classifiers, in all cases the selection computed by our algorithm obtained the best accuracies in average. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:365 / 375
页数:11
相关论文
共 46 条
[1]  
[Anonymous], 1998, P 7 INT WORLD WID WE
[2]   A review of instance selection methods [J].
Arturo Olvera-Lopez, J. ;
Ariel Carrasco-Ochoa, J. ;
Francisco Martinez-Trinidad, J. ;
Kittler, Josef .
ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) :133-143
[3]   A new fast prototype selection method based on clustering [J].
Arturo Olvera-Lopez, J. ;
Ariel Carrasco-Ochoa, J. ;
Francisco Martinez-Trinidad, J. .
PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) :131-141
[4]   Nearest prototype classifier designs: An experimental study [J].
Bezdek, JC ;
Kuncheva, LI .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (12) :1445-1473
[5]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[6]   Advances in instance selection for instance-based learning algorithms [J].
Brighton, H ;
Mellish, C .
DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) :153-172
[7]   Automated breast cancer detection and classification using ultrasound images: A survey [J].
Cheng, H. D. ;
Shan, Juan ;
Ju, Wen ;
Guo, Yanhui ;
Zhang, Ling .
PATTERN RECOGNITION, 2010, 43 (01) :299-317
[8]  
Chou CH, 2006, INT C PATT RECOG, P556
[9]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[10]  
Czarnowski I., 2010, KNOWL INF SYST, P1