Locally adaptive k parameter selection for nearest neighbor classifier: one nearest cluster

被引:18
作者
Bulut, Faruk [1 ]
Amasyali, Mehmet Fatih [1 ]
机构
[1] Yildiz Tech Univ, Dept Comp Engn, Fac Elect & Elect Engn, Istanbul, Turkey
关键词
Dynamic k parameter; k-NN; Classification; Clustering; Meta-parameter selection; CHOICE;
D O I
10.1007/s10044-015-0504-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k nearest neighbors (k-NN) classification technique has a worldly wide fame due to its simplicity, effectiveness, and robustness. As a lazy learner, k-NN is a versatile algorithm and is used in many fields. In this classifier, the k parameter is generally chosen by the user, and the optimal k value is found by experiments. The chosen constant k value is used during the whole classification phase. The same k value used for each test sample can decrease the overall prediction performance. The optimal k value for each test sample should vary from others in order to have more accurate predictions. In this study, a dynamic k value selection method for each instance is proposed. This improved classification method employs a simple clustering procedure. In the experiments, more accurate results are found. The reasons of success have also been understood and presented.
引用
收藏
页码:415 / 425
页数:11
相关论文
共 20 条
[1]  
Agapitos Alexandros, 2013, Genetic Programming. 16th European Conference (EuroGP 2013). Proceedings, P1, DOI 10.1007/978-3-642-37207-0_1
[2]  
Amal MA, 2011, INT J ADV COMPUT SC, V2, P59
[3]  
[Anonymous], KD TREE SEARCH CLASS
[4]  
Bache K., 2013, UCI Machine Learning Repository
[5]  
Bhatia N, 2010, Survey of nearest neighbor techniques
[6]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[7]   Resampling methods for quality assessment of classifier performance and optimal number of features [J].
Fandos, Raquel ;
Debes, Christian ;
Zoubir, Abdelhak M. .
SIGNAL PROCESSING, 2013, 93 (11) :2956-2968
[8]   On nearest neighbor classification using adaptive choice of k [J].
Ghosh, Anil K. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (02) :482-502
[9]   Multiscale classification using nearest neighbor density estimates [J].
Ghosh, Anil K. ;
Chaudhuri, Probal ;
Murthy, C. A. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (05) :1139-1148
[10]   On optimum choice of k in nearest neighbor classification [J].
Ghosh, Anil K. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (11) :3113-3123