Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation

被引:75
作者
Gallego, Antonio-Javier [1 ]
Calvo-Zaragoza, Jorge [1 ]
Valero-Mas, Jose J. [1 ]
Rico-Juan, Juan R. [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, Carretera San Vicente Raspeig S-N, Alicante 03690, Spain
关键词
Efficient kNN classification; Clustering; Deep neural networks; ALGORITHMS; SELECTION;
D O I
10.1016/j.patcog.2017.09.038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While standing as one of the most widely considered and successful supervised classification algorithms, the k-nearest Neighbor (kNN) classifier generally depicts a poor efficiency due to being an instance-based method. In this sense, Approximated Similarity Search (ASS) stands as a possible alternative to improve those efficiency issues at the expense of typically lowering the performance of the classifier. In this paper we take as initial point an ASS strategy based on clustering. We then improve its performance by solving issues related to instances located close to the cluster boundaries by enlarging their size and considering the use of Deep Neural Networks for learning a suitable representation for the classification task at issue. Results using a collection of eight different datasets show that the combined use of these two strategies entails a significant improvement in the accuracy performance, with a considerable reduction in the number of distances needed to classify a sample in comparison to the basic kNN rule. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:531 / 543
页数:13
相关论文
共 61 条
  • [1] Fast nearest neighbor condensation for large data sets classification
    Angiulli, Fabrizio
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (11) : 1450 - 1464
  • [2] [Anonymous], NEURAL CODES IMAGE R
  • [3] [Anonymous], TECHNICAL REPORT
  • [4] [Anonymous], 2007, P 18 ANN ACM SIAM S
  • [5] [Anonymous], 2014, ARXIV14082927
  • [6] [Anonymous], 2013, arXiv
  • [7] [Anonymous], 1982, Pattern Recognition: A Statistical Approach
  • [8] [Anonymous], 2006, Pattern Recognition and Machine Learning
  • [9] [Anonymous], 2001, PATTERN CLASSIFICATI
  • [10] [Anonymous], P IEEE COMP SOC C CO