The Novel k Nearest Neighbor Algorithm

被引:0
作者
Jivani, Anjali Ganesh [1 ]
机构
[1] Maharaja Sayajirao Univ Baroda, Dept Comp Sci & Engn, Vadodara, India
来源
2013 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS | 2013年
关键词
k nearest neighbors; text mining; text classification; text categorization;
D O I
暂无
中图分类号
R-058 [];
学科分类号
摘要
In the field of Text Classification/Categorization, the k Nearest Neighbor algorithm (kNN) has been to date one of the oldest and most popular methods. It has been experimented upon, implemented and tested by many researchers all over the world. There have been variations in the implementation of this algorithm and I have in this paper done the same. As the name suggests the method is dependent on the parameter 'k' which can drastically change the output as we vary its values. When the training set contains classes of unequal sizes, the test data is likely to get classified to a class which has more samples than the actual class it belongs to, if that actual class has less number of samples. In the proposed method, I have added a small variation to the classic kNN and have named this method 'The Novel k Nearest Neighbor Algorithm'. The parameter k in this method depends on the size of the smallest class sample.
引用
收藏
页数:4
相关论文
共 10 条
  • [1] Achtert Elke, 2007, P 12 GI FACHT DAT BU
  • [2] Apte C., 1994, SIGIR '94. Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, P23
  • [3] Baoli L., 2003, P 20 INT C COMPUTER, P469
  • [4] Dumais S., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P148, DOI 10.1145/288627.288651
  • [5] Joachims T., 1998, Machine Learning: ECML-98. 10th European Conference on Machine Learning. Proceedings, P137, DOI 10.1007/BFb0026683
  • [6] Kolahdouzan M.R., 2004, P 2 WORKSH SPAT TEMP
  • [7] WEISS S, 1999, IEEE INTELLIGENT SYS
  • [8] WINTER J, 2004, P 1 WORKSH DAT MAN S
  • [9] Yang Yi ming, 1999, P ACM SIGIR C RES DE, P42
  • [10] Yu Wang, 2007, P 6 INT C MACH LEARN