Effective K-Nearest Neighbor Algorithms Performance Analysis of Thyroid Disease

被引:19
|
作者
Abbad Ur Rehman, Hafiz [1 ]
Lin, Chyi-Yeu [1 ]
Mushtaq, Zohaib [2 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Mech Engn, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei, Taiwan
关键词
Classification; thyroid disease; k-nearest neighbor; feature selection; SYSTEM;
D O I
10.1080/02533839.2020.1831967
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Thyroid is an essential gland as its hormones are controlling the metabolism system of the human body. An abnormal amount of thyroid gland secretion causes two major types of diseases which are hyperthyroidism and hypothyroidism. In this research study, the implementation of K-Nearest neighbor (KNN) with its various distance functions is presented to detect thyroid disease. The proposed study consists of three phases, which are KNN without feature selection, KNN using L-1-based feature selection, and KNN using chi-square-based feature selection techniques. Thyroid datasets from KEEL dataset repository and another from a registered hospital in Pakistan were used in this study. The new dataset was distinguished from existing datasets as it included three additional features, i.e., pulse rate, Body Mass Index (BMI), and Blood Pressure (BP). Various distance functions were used to analyze the performance of the KNN model on these two datasets. Performance evaluation metrics have been used to discuss the achievement of the classifier. The optimal range of k values from the results are described between 1 and 5. Euclidean and Cosine distance functions achieved the highest accuracy using chi-square-based feature selection technique for new dataset as compared to existing datasets.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 50 条
  • [31] Validation Based Modified K-Nearest Neighbor
    Parvin, Hamid
    Alizadeh, Hosein
    Minaei-Bidgoli, Behrouz
    IAENG TRANSACTIONS ON ENGINEERING TECHNOLOGIES, VOL II, 2009, 1127 : 153 - 161
  • [32] Binary k-nearest neighbor for text categorization
    Tan, SB
    ONLINE INFORMATION REVIEW, 2005, 29 (04) : 391 - 399
  • [33] Graph Clustering with K-Nearest Neighbor Constraints
    Jakawat, Wararat
    Makkhongkaew, Raywat
    2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, : 309 - 313
  • [34] Heart Disease Prediction Using Weighted K-Nearest Neighbor Algorithm
    Khalidou Abdoulaye Barry
    Youness Manzali
    Mohamed Lamrini
    Flouchi Rachid
    Mohamed Elfar
    Operations Research Forum, 5 (3)
  • [35] Microarray Data Classification using Fuzzy K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Santanu Ku
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1032 - 1038
  • [36] A Modified K-Nearest Neighbor Algorithm to Handle Uncertain Data
    Agrawal, Rashmi
    Ram, Babu
    2015 5TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2015,
  • [37] Using k-Nearest Neighbor and Speaker Ranking for Phoneme Prediction
    Rizwan, Muhammad
    Anderson, David V.
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 383 - 387
  • [38] K-Nearest Neighbor Learning based Diabetes Mellitus Prediction and Analysis for eHealth Services
    Sarker, Iqbal H.
    Faruque, Md Faisal
    Alqahtant, Hamed
    Kalim, Asra
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (26) : 1 - 9
  • [39] A Proposal for Local k Values for k-Nearest Neighbor Rule
    Garcia-Pedrajas, Nicolas
    Romero del Castillo, Juan A.
    Cerruela-Garcia, Gonzalo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) : 470 - 475
  • [40] Modified K-nearest Neighbor Algorithm with Variant K Values
    Waghmare, Kalyani C.
    Sonkamble, Balwant A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 220 - 224