An Improved KNN Algorithm for Text Classification

被引:3
作者
Li, Huijuan [1 ]
Jiang, He [1 ]
Wang, Dongyuan [1 ]
Han, Bing [1 ]
机构
[1] Qilu Univ Technol, ShanDong Acad Sci, Jinan, Peoples R China
来源
2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018) | 2018年
关键词
KNN; text classification; similarity; coupling;
D O I
10.1109/IMCCC.2018.00225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among the many text classification algorithms based on vector space model, the effect of KNN(K-Nearest Neighbor) classifier is outstanding. For KNN classification algorithm, calculating the similarity between documents will directly affect the selections of K neighbors, which greatly affects the classification effect. However, the traditional KNN text classification is too rough to calculate text similarity, ignoring the relations within the document and the relationships between the documents. Therefore, this paper proposes an improved KNN algorithm, which calculates similarity by considering the interaction and coupling relationship between the document internal and the document. Theoretical analysis and experiments show that the improved algorithm can overcome the shortcomings of the previous algorithms and improve the accuracy of the KNN text classification.
引用
收藏
页码:1081 / 1085
页数:5
相关论文
共 10 条
[1]   Non-IID Recommender Systems: A Review and Framework of Recommendation Paradigm Shifting [J].
Cao, Longbing .
ENGINEERING, 2016, 2 (02) :212-224
[2]  
Cover T. M., 1968, P HAWAII INT C SYSTE, P413
[3]  
Jiang H, 2017, AUTOMATIC BUG TRIAGE, P209
[4]  
Jin X.B., 2006, AUTOMATION PANORAMA, V23, P24
[5]  
Li Fangfang, 2014, NONIID RECOMMENDER S
[6]  
Luo Fan, 2016, ELECT DESIGN ENG, V24, P1
[7]  
Song Y, 2007, LECT NOTES ARTIF INT, V4702, P248
[8]  
Wang C., 2013, International Joint Conference on Artificial Intelligence, P1736
[9]  
Xu Xiaoyan, 2012, RES BASED KNN ALGORI
[10]  
[周庆平 Zhou Qingping], 2016, [计算机应用研究, Application Research of Computers], V33, P3374