Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation

被引:74
|
作者
Gallego, Antonio-Javier [1 ]
Calvo-Zaragoza, Jorge [1 ]
Valero-Mas, Jose J. [1 ]
Rico-Juan, Juan R. [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, Carretera San Vicente Raspeig S-N, Alicante 03690, Spain
关键词
Efficient kNN classification; Clustering; Deep neural networks; ALGORITHMS; SELECTION;
D O I
10.1016/j.patcog.2017.09.038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While standing as one of the most widely considered and successful supervised classification algorithms, the k-nearest Neighbor (kNN) classifier generally depicts a poor efficiency due to being an instance-based method. In this sense, Approximated Similarity Search (ASS) stands as a possible alternative to improve those efficiency issues at the expense of typically lowering the performance of the classifier. In this paper we take as initial point an ASS strategy based on clustering. We then improve its performance by solving issues related to instances located close to the cluster boundaries by enlarging their size and considering the use of Deep Neural Networks for learning a suitable representation for the classification task at issue. Results using a collection of eight different datasets show that the combined use of these two strategies entails a significant improvement in the accuracy performance, with a considerable reduction in the number of distances needed to classify a sample in comparison to the basic kNN rule. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:531 / 543
页数:13
相关论文
共 50 条
  • [1] A Large-Scale k-Nearest Neighbor Classification Algorithm Based on Neighbor Relationship Preservation
    Song, Yunsheng
    Kong, Xiaohan
    Zhang, Chao
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [2] Clustering-based reference set reduction for k-nearest neighbor
    Hwang, Seongseob
    Cho, Sungzoon
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 880 - +
  • [3] Insights Into Efficient k-Nearest Neighbor Classification With Convolutional Neural Codes
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    Ramon Rico-Juan, Juan
    IEEE ACCESS, 2020, 8 : 99312 - 99326
  • [4] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [5] K-Nearest Neighbor Intervals Based AP Clustering Algorithm for Large Incomplete Data
    Lu, Cheng
    Song, Shiji
    Wu, Cheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [6] Locality constrained representation-based K-nearest neighbor classification
    Gou, Jianping
    Qiu, Wenmo
    Yi, Zhang
    Shen, Xiangjun
    Zhan, Yongzhao
    Ou, Weihua
    KNOWLEDGE-BASED SYSTEMS, 2019, 167 : 38 - 52
  • [7] An RBF Neural Network Clustering Algorithm Based on K-Nearest Neighbor
    Li, Jitao
    Xu, Chugui
    Liang, Yongquan
    Wu, Gengkun
    Liang, Zhao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [8] Tissue Classification of Large-scale Multi-site MR Data Using Fuzzy k-Nearest Neighbor Method
    Ghayoor, Ali
    Paulsen, Jane S.
    Kim, Regina E. Y.
    Johnson, Hans J.
    MEDICAL IMAGING 2016: IMAGE PROCESSING, 2016, 9784
  • [9] Efficient K-Nearest Neighbor Graph Construction Using MapReduce for Large-Scale Data Sets
    Warashina, Tomohiro
    Aoyama, Kazuo
    Sawada, Hiroshi
    Hattori, Takashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (12): : 3142 - 3154
  • [10] Hierarchical Clustering-Based Graphs for Large Scale Approximate Nearest Neighbor Search
    Munoz, Javier Vargas
    Goncalves, Marcos A.
    Dias, Zanoni
    Torres, Ricardo da S.
    PATTERN RECOGNITION, 2019, 96