A new locally adaptive k-nearest neighbor algorithm based on discrimination class

被引:74
作者
Pan, Zhibin [1 ,2 ]
Wang, Yikun [1 ]
Pan, Yiwei [1 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R China
[2] CAST, Natl Key Lab Sci & Technol Space Microwave, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification algorithm; k-nearest neighbor rule; Majority class; Second majority class; Discrimination class; Adaptive k value;
D O I
10.1016/j.knosys.2020.106185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbor (kNN) rule is a classical non-parametric classification algorithm in pattern recognition, and has been widely used in many fields due to its simplicity, effectiveness and intuitiveness. However, the classification performance of the kNN algorithm suffers from the choice of a fixed and single value of k for all queries in the search stage and the use of simple majority voting rule in the decision stage. In this paper, we propose a new kNN-based algorithm, called locally adaptive k-nearest neighbor algorithm based on discrimination class (DC-LAKNN). In our method, the role of the second majority class in classification is for the first time considered. Firstly, the discrimination classes at different values of k are selected from the majority class and the second majority class in the k-neighborhood of the query. Then, the adaptive k value and the final classification result are obtained according to the quantity and distribution information on the neighbors in the discrimination classes at each value of k. Extensive experiments on eighteen real-world datasets from UCI (University of California, Irvine) Machine Learning Repository and KEEL (Knowledge Extraction based on Evolutionary Learning) Repository show that the DC-LAKNN algorithm achieves better classification performance compared to standard kNN algorithm and nine other state-of-the-art kNN-based algorithms. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 35 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]   Distance-based detection and prediction of outliers [J].
Angiulli, F ;
Basta, S ;
Pizzuti, C .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (02) :145-160
[3]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[4]  
DUDANI SA, 1976, IEEE T SYST MAN CYB, V6, P327
[5]   A novel version of k nearest neighbor: Dependent nearest neighbor [J].
Ertugrul, Omer Faruk ;
Tagluk, Mehmet Emin .
APPLIED SOFT COMPUTING, 2017, 55 :480-490
[6]   A Proposal for Local k Values for k-Nearest Neighbor Rule [J].
Garcia-Pedrajas, Nicolas ;
Romero del Castillo, Juan A. ;
Cerruela-Garcia, Gonzalo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) :470-475
[7]   Fast mining of distance-based outliers in high-dimensional datasets [J].
Ghoting, Amol ;
Parthasarathy, Srinivasan ;
Otey, Matthew Eric .
DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (03) :349-364
[8]  
Gou J., 2011, J INF COMPUT SCI, V9, P1429
[9]   Improved pseudo nearest neighbor classification [J].
Gou, Jianping ;
Zhan, Yongzhao ;
Rao, Yunbo ;
Shen, Xiangjun ;
Wang, Xiaoming ;
He, Wu .
KNOWLEDGE-BASED SYSTEMS, 2014, 70 :361-375
[10]   A Local Mean-Based k-Nearest Centroid Neighbor Classifier [J].
Gou, Jianping ;
Yi, Zhang ;
Du, Lan ;
Xiong, Taisong .
COMPUTER JOURNAL, 2012, 55 (09) :1058-1071