InstanceRank: Bringing order to datasets

被引:15
作者
Vallejo, Carlos G. [1 ]
Troyano, Jose A. [1 ]
Javier Ortega, F. [1 ]
机构
[1] Univ Seville, Dept Comp Languages & Syst, E-41012 Seville, Spain
关键词
Instance-based learning; Instance reduction; Nearest neighbor; PageRank; Classification; MULTIPLE DATA SETS; LEARNING ALGORITHMS; STATISTICAL COMPARISONS; NEAREST; CLASSIFIERS; ERROR;
D O I
10.1016/j.patrec.2009.09.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present InstanceRank, a ranking algorithm that reflects the relevance of the instances within a dataset. InstanceRank applies a similar solution to that used by PageRank, the web pages ranking algorithm in the Google search engine. We also present ISR, an instance selection technique that uses InstanceRank. This algorithm chooses the most representative instances from a learning database. Experiments show that ISR algorithm, with InstanceRank as ranking criteria, obtains similar results in accuracy to other instance reduction techniques, noticeably reducing the size of the instance set. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 26 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
[Anonymous], 1998, Ph.D. Thesis
[3]  
[Anonymous], P 21 C VER LARG DAT
[4]  
[Anonymous], 2007, Uci machine learning repository
[5]   Nearest prototype classifier designs: An experimental study [J].
Bezdek, JC ;
Kuncheva, LI .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (12) :1445-1473
[6]   Advances in instance selection for instance-based learning algorithms [J].
Brighton, H ;
Mellish, C .
DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) :153-172
[7]   The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[8]  
CAMERONJONES RM, 1995, P 8 AUSTR JOINT C AR, P293
[9]   A WEIGHTED NEAREST NEIGHBOR ALGORITHM FOR LEARNING WITH SYMBOLIC FEATURES [J].
COST, S ;
SALZBERG, S .
MACHINE LEARNING, 1993, 10 (01) :57-78
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1