Random projection ensemble adaptive nearest neighbor classification

被引:0
作者
Kang, Jongkyeong [1 ]
Jhun, Myoungshic [2 ]
机构
[1] Korea Univ, Dept Stat, Seoul, South Korea
[2] State Univ New York Korea, Dept Appl Math & Stat, 119 Songdo Moonhwa Ro, Incheon 21985, South Korea
基金
新加坡国家研究基金会;
关键词
adaptive nearest neighbor; classification; high-dimensional data; K-nearest neighbor; random projection;
D O I
10.5351/KJAS.2021.34.3.401
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Popular in discriminant classification analysis, k-nearest neighbor classification methods have limitations that do not reflect the local characteristic of the data, considering only the number of fixed neighbors. Considering the local structure of the data, the adaptive nearest neighbor method has been developed to select the number of neighbors. In the analysis of high-dimensional data, it is common to perform dimension reduction such as random projection techniques before using k-nearest neighbor classification. Recently, an ensemble technique has been developed that carefully combines the results of such random classifiers and makes final assignments by voting. In this paper, we propose a novel discriminant classification technique that combines adaptive nearest neighbor methods with random projection ensemble techniques for analysis on high-dimensional data. Through simulation and real-world data analyses, we confirm that the proposed method outperforms in terms of classification accuracy compared to the previously developed methods.
引用
收藏
页码:401 / 410
页数:10
相关论文
共 17 条
[1]  
[Anonymous], 2013, A probabilistic theory of pattern recognition
[2]  
Bingham E., 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P245, DOI 10.1145/502512.502546
[3]   Random-projection ensemble classification [J].
Cannings, Timothy I. ;
Samworth, Richard J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (04) :959-1035
[4]  
장덕준, 2008, [Journal of the Korean Data And Information Science Sociaty, 한국데이터정보과학회지], V19, P511
[5]  
Fix E., 1951, PSYCEXTRA DATASET
[6]  
Friedman J., 1994, Flexible metric nearest-neighbors classification
[7]   CHOICE OF NEIGHBOR ORDER IN NEAREST-NEIGHBOR CLASSIFICATION [J].
Hall, Peter ;
Park, Byeong U. ;
Samworth, Richard J. .
ANNALS OF STATISTICS, 2008, 36 (05) :2135-2152
[8]   Discriminant adaptive nearest neighbor classification [J].
Hastie, T ;
Tibshirani, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (06) :607-616
[9]   Adaptive Nearest Neighbors for Classification [J].
Jhun, Myoungshic ;
Choi, Inkyung .
KOREAN JOURNAL OF APPLIED STATISTICS, 2009, 22 (03) :479-488
[10]  
Johnson W. B., 1984, Contemporary Mathematics, V26, P189, DOI DOI 10.1090/CONM/026/737400