Condensed fuzzy nearest neighbor methods based on fuzzy rough set technique

被引:10
作者
Zhai, Junhai [1 ,2 ]
Zhai, Mengyao [3 ]
Kang, Xiaomeng [1 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Baoding, Hebei, Peoples R China
[2] Key Lab Machine Learning & Computat Intelligence, Baoding, Hebei, Peoples R China
[3] Hebei Univ, Ind & Commercial Coll, Baoding, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
K-nearest neighbor; fuzzy K-nearest neighbor; rough set; fuzzy rough set; attribute reduction; instance selection; NEURAL-NETWORKS; DECISION TREES; ALGORITHMS; INDUCTION;
D O I
10.3233/IDA-140649
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a generalization of K-nearest neighbor (K-NN) algorithm, the fuzzy K-nearest neighbor (fuzzy K-NN) algorithm was originally developed by Keller in 1985 to overcome one of the drawbacks of K-NN (i.e. all of instances are considered equally important in K-NN). However, fuzzy K-NN algorithm still suffers from the problem of large memory requirement same as K-NN. To deal with this problem, based on fuzzy rough set technique, this paper proposed two condensed fuzzy nearest neighbor methods denoted by CFK-NN1 and CFK-NN2 and a modified fuzzy K-NN. The CFK-NN1 and CFK-NN2 both consists of three steps: (1) obtaining a fuzzy attribute reduct based on fuzzy rough set technique, (2) finding two sets of prototypes, the one is selected from fuzzy positive region (corresponding to CFK-NN1) and the other is selected from fuzzy boundary region (corresponding to CFK-NN2), (3) extracting fuzzy classification rules with the modified fuzzy K-NN from the two sets of prototypes. Extensive experiments and statistical analysis are conducted to verify the effectiveness of our proposed method. The experimental results and the statistical analysis of the experimental results both demonstrate that the proposed methods outperform other related methods such as CNN, ENN, and ICF et al.
引用
收藏
页码:429 / 447
页数:19
相关论文
共 39 条
[1]   FUZZY DECISION TREES [J].
ADAMO, JM .
FUZZY SETS AND SYSTEMS, 1980, 4 (03) :207-219
[2]   Fast nearest neighbor condensation for large data sets classification [J].
Angiulli, Fabrizio .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (11) :1450-1464
[3]  
[Anonymous], 1991, Nearest neighbor (NN) norms: NN pattern classification techniques
[4]   Decision boundary preserving prototype selection for nearest neighbor classification [J].
Barandela, R ;
Ferri, FJ ;
Sánchez, JS .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (06) :787-806
[5]   Advances in instance selection for instance-based learning algorithms [J].
Brighton, H ;
Mellish, C .
DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (02) :153-172
[6]   FUZZY NEURAL NETWORKS - A SURVEY [J].
BUCKLEY, JJ ;
HAYASHI, Y .
FUZZY SETS AND SYSTEMS, 1994, 66 (01) :1-13
[7]   Support vector learning for fuzzy rule-based classification systems [J].
Chen, YX ;
Wang, JZ .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (06) :716-728
[8]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[9]   MINIMAL CONSISTENT SET (MCS) IDENTIFICATION FOR OPTIMAL NEAREST-NEIGHBOR DECISION SYSTEMS-DESIGN [J].
DASARATHY, BV .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (03) :511-517
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1