Boosting for Vote Learning in High-dimensional kNN Classification

被引:0
作者
Tomasev, Nenad [1 ]
机构
[1] Jozef Stefan Inst, Artificial Intelligence Lab, Ljubljana, Slovenia
来源
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW) | 2014年
关键词
machine learning; k-nearest neighbor; high-dimensional data; boosting; hubness; hubness-aware classification;
D O I
10.1109/ICDMW.2014.38
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Intrinsically high-dimensional data has recently been shown to exhibit substantial hubness in terms of skewness of the k-nearest neighbor occurrence frequency distribution. While some points arise as centers of influence and dominate most k-nearest neighbor sets, other points occur very rarely and barely affect the inferred models. Hubness has been shown to be highly detrimental to many learning tasks and several hubness-aware learning methods have recently been proposed. This paper extends the existing fuzzy neighbor occurrence models in order to enable cost-sensitive learning. We evaluate the extended implementations within the context of multi-class boosting, which is used to learn the appropriate neighbor votes during the re-weighting iterations. The proposed approach is evaluated on a series of high-dimensional datasets from various domains. The results demonstrate promising improvements of the proposed approach over the baselines.
引用
收藏
页码:676 / 683
页数:8
相关论文
共 53 条
[1]  
Abbasian Houman, 2013, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2013. Proceedings: LNCS 8190, P33, DOI 10.1007/978-3-642-40994-3_3
[2]  
Aggarwal CC, 2001, LECT NOTES COMPUT SC, V1973, P420
[3]  
[Anonymous], 2009, P 26 INT C MACHINE L, DOI DOI 10.1145/1553374.1553485
[4]  
[Anonymous], ANALYSIS
[5]  
[Anonymous], 2004, Journal of negative results in speech and audio sciences
[6]  
[Anonymous], INVESTIGATING IMPACT
[7]  
[Anonymous], P 5 AUD MOSTL C C IN
[8]  
Bellman R., 1961, Adaptive Control Processes: A Guided Tour, DOI DOI 10.1515/9781400874668
[9]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[10]  
Buza K., 2010, Proceedings 2010 IEEE 13th International Conference on Computational Science and Engineering (CSE 2010), P48, DOI 10.1109/CSE.2010.16