Effective K-Nearest Neighbor Algorithms Performance Analysis of Thyroid Disease

被引:21
作者
Abbad Ur Rehman, Hafiz [1 ]
Lin, Chyi-Yeu [1 ]
Mushtaq, Zohaib [2 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Mech Engn, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei, Taiwan
关键词
Classification; thyroid disease; k-nearest neighbor; feature selection; SYSTEM;
D O I
10.1080/02533839.2020.1831967
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Thyroid is an essential gland as its hormones are controlling the metabolism system of the human body. An abnormal amount of thyroid gland secretion causes two major types of diseases which are hyperthyroidism and hypothyroidism. In this research study, the implementation of K-Nearest neighbor (KNN) with its various distance functions is presented to detect thyroid disease. The proposed study consists of three phases, which are KNN without feature selection, KNN using L-1-based feature selection, and KNN using chi-square-based feature selection techniques. Thyroid datasets from KEEL dataset repository and another from a registered hospital in Pakistan were used in this study. The new dataset was distinguished from existing datasets as it included three additional features, i.e., pulse rate, Body Mass Index (BMI), and Blood Pressure (BP). Various distance functions were used to analyze the performance of the KNN model on these two datasets. Performance evaluation metrics have been used to discuss the achievement of the classifier. The optimal range of k values from the results are described between 1 and 5. Euclidean and Cosine distance functions achieved the highest accuracy using chi-square-based feature selection technique for new dataset as compared to existing datasets.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 34 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]  
Alivar A, 2014, 2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), P669, DOI 10.1109/ICCKE.2014.6993434
[3]  
Amercian Thyroid Association, 2020, THYR FUNCT TESTS
[4]  
Amercian Thyroid Association, 2020, THYR PAT INF
[5]  
Aswathi AK, 2018, PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), P1261, DOI 10.1109/ICICCT.2018.8473349
[6]  
Begum BA, 2019, INT CONF ADVAN COMPU, P342, DOI [10.1109/ICACCS.2019.8728320, 10.1109/icaccs.2019.8728320]
[7]  
Caesarendra W, 2015, IEEE ASME INT C ADV, P802, DOI 10.1109/AIM.2015.7222636
[8]   A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques [J].
Khushboo Chandel ;
Veenita Kunwar ;
Sai Sabitha ;
Tanupriya Choudhury ;
Saurabh Mukherjee .
CSI Transactions on ICT, 2016, 4 (2-4) :313-319
[9]   A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques [J].
Khushboo Chandel ;
Veenita Kunwar ;
Sai Sabitha ;
Tanupriya Choudhury ;
Saurabh Mukherjee .
CSI Transactions on ICT, 2016, 4 (2-4) :313-319
[10]   A Three-Stage Expert System Based on Support Vector Machines for Thyroid Disease Diagnosis [J].
Chen, Hui-Ling ;
Yang, Bo ;
Wang, Gang ;
Liu, Jie ;
Chen, Yi-Dong ;
Liu, Da-You .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (03) :1953-1963