Feature Extraction based Text Classification using K-Nearest Neighbor Algorithm

被引:0
作者
Azam, Muhammad [1 ]
Ahmed, Tanvir [1 ]
Sabah, Fahad [1 ]
Hussain, Muhammad Iftikhar [2 ,3 ]
机构
[1] Super Univ Lahore, Dept Comp Sci & Informat Technol, Lahore, Pakistan
[2] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2018年 / 18卷 / 12期
关键词
K-NN; naive bayes; text classification; rapid miner; feature extraction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific publications has been increasing enormously, with this increase classification of scientific publications is becoming challenging task. The core objective of this research is to analyze the performance of classification algorithms using Scopus dataset. In text classification, classification and feature extraction from the document using extracted features are the major issues for decreasing the performances in different algorithms. In this paper, performances of classification algorithms such as Naive Bayes (NB) and K-Nearest Neighbor (K-NN) shown better improvement using Bayesian boost and bagging. The performance results were analyzed through selected classification algorithms over 10K documents from Scopus examined using F-measure and produced comparison matrices to estimate accuracy, precision and recall using NB and KNN classifier. Further, data preprocessing and cleaning steps are induced on the selected dataset and class imbalance issues are analyzed to increase the performance of text classification algorithms. Experimental results showed performances over 7% using K-NN and revealed better as compared to NB.
引用
收藏
页码:95 / 101
页数:7
相关论文
共 50 条
[41]   Diagnosis of Arthritis Using K-Nearest Neighbor Approach [J].
Kaur, Rupinder ;
Madaan, Vishu ;
Agrawal, Prateek .
ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 :160-171
[42]   K-Nearest Neighbour Classification and Feature Extraction GLCM for Identification of Terry's Nail [J].
Safira, Laura ;
Irawan, Budhi ;
Setianingsih, Casi .
2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRY 4.0, ARTIFICIAL INTELLIGENCE, AND COMMUNICATIONS TECHNOLOGY (IAICT), 2019, :98-104
[43]   Infant Cry Classification Using Semi-supervised K-Nearest Neighbor Approach [J].
Mahmoud, Amany Mounes ;
Swilem, Sarah Mohamed ;
Alqarni, Abrar Saeed ;
Haron, Fazilah .
2020 13TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2020), 2020, :305-310
[44]   Classification of EEG Data using k-Nearest Neighbor approach for Concealed Information Test [J].
Bablani, Annushree ;
Edla, Damodar Reddy ;
Dodia, Shubham .
8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 :242-249
[45]   Predictive Control Algorithm for Solar Station based on K-Nearest Neighbor and K-Means [J].
Sedliarov, Yehor ;
Klen, Kateryna .
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES IN POWER ENGINEERING AND ELECTRONICS, STEE, 2024,
[46]   Classification Hoax News of Covid-19 on Instagram Using K-Nearest Neighbor [J].
Akbar, F. Indra Malik ;
Yaddarabullah ;
Permana, Silvester Dian Handy .
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT 2021), 2021, :157-161
[47]   Hybrid k-Nearest Neighbor Classifier [J].
Yu, Zhiwen ;
Chen, Hantao ;
Liu, Jiming ;
You, Jane ;
Leung, Hareton ;
Han, Guoqiang .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (06) :1263-1275
[48]   Towards enriching the quality of k-nearest neighbor rule for document classification [J].
Basu, Tanmay ;
Murthy, C. A. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (06) :897-905
[49]   Towards enriching the quality of k-nearest neighbor rule for document classification [J].
Tanmay Basu ;
C. A. Murthy .
International Journal of Machine Learning and Cybernetics, 2014, 5 :897-905
[50]   Design of poultry farm disease detection system based on K-Nearest Neighbor Algorithm [J].
Kim, Seung Jae ;
Yoe, Hyun ;
Lee, Meong Hun .
2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, :762-766