Effective K-Nearest Neighbor Algorithms Performance Analysis of Thyroid Disease

被引:19
|
作者
Abbad Ur Rehman, Hafiz [1 ]
Lin, Chyi-Yeu [1 ]
Mushtaq, Zohaib [2 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Mech Engn, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei, Taiwan
关键词
Classification; thyroid disease; k-nearest neighbor; feature selection; SYSTEM;
D O I
10.1080/02533839.2020.1831967
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Thyroid is an essential gland as its hormones are controlling the metabolism system of the human body. An abnormal amount of thyroid gland secretion causes two major types of diseases which are hyperthyroidism and hypothyroidism. In this research study, the implementation of K-Nearest neighbor (KNN) with its various distance functions is presented to detect thyroid disease. The proposed study consists of three phases, which are KNN without feature selection, KNN using L-1-based feature selection, and KNN using chi-square-based feature selection techniques. Thyroid datasets from KEEL dataset repository and another from a registered hospital in Pakistan were used in this study. The new dataset was distinguished from existing datasets as it included three additional features, i.e., pulse rate, Body Mass Index (BMI), and Blood Pressure (BP). Various distance functions were used to analyze the performance of the KNN model on these two datasets. Performance evaluation metrics have been used to discuss the achievement of the classifier. The optimal range of k values from the results are described between 1 and 5. Euclidean and Cosine distance functions achieved the highest accuracy using chi-square-based feature selection technique for new dataset as compared to existing datasets.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 50 条
  • [1] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [2] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [3] Fuzzy Monotonic K-Nearest Neighbor Versus Monotonic Fuzzy K-Nearest Neighbor
    Zhu, Hong
    Wang, Xizhao
    Wang, Ran
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (09) : 3501 - 3513
  • [4] A comparison between k-nearest neighbor and jk-nearest neighbor algorithms for signature verification
    Saleem, Mohammad
    Kovari, Bence
    2022 21ST INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2022,
  • [5] Effective K-nearest neighbor classifications for Wisconsin breast cancer data sets
    Mushtaq, Zohaib
    Yaqub, Akbari
    Sani, Shaima
    Khalid, Adnan
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2020, 43 (01) : 80 - 92
  • [6] An Improved k-Nearest Neighbor Algorithm for Recognition and Classification of Thyroid Nodules
    Ma, Xuesi
    Han, Xiang
    Zhang, Lina
    JOURNAL OF ULTRASOUND IN MEDICINE, 2024, 43 (06) : 1025 - 1036
  • [7] Effective k-nearest neighbor models for data classification enhancement
    Ali A. Amer
    Sri Devi Ravana
    Riyaz Ahamed Ariyaluran Habeeb
    Journal of Big Data, 12 (1)
  • [8] Exact bootstrap k-nearest neighbor learners
    Steele, Brian M.
    MACHINE LEARNING, 2009, 74 (03) : 235 - 255
  • [9] Implementation of the K-Nearest Neighbor Algorithm for Detecting Heart Attack Disease
    Sitanggang, Delima
    Indra, Evta
    Gulo, Juan Hardoni
    Turnip, Mardi
    INTERNETWORKING INDONESIA, 2021, 13 (02): : 35 - 41
  • [10] A novel ensemble method for k-nearest neighbor
    Zhang, Youqiang
    Cao, Guo
    Wang, Bisheng
    Li, Xuesong
    PATTERN RECOGNITION, 2019, 85 : 13 - 25