A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura
[2] Chandigarh University, Punjab, Gharuan, Mohali
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo
关键词
KNN; LR; Lung cancer; Machine learning; NB; PCWKNN; RF; SVM;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [11] An Evidential K-Nearest Neighbor Classifier Based on Contextual Discounting and Likelihood Maximization
    Kanjanatarakul, Orakanya
    Kuson, Siwarat
    Denoeux, Thierry
    BELIEF FUNCTIONS: THEORY AND APPLICATIONS, BELIEF 2018, 2018, 11069 : 155 - 162
  • [12] Validation of k-Nearest Neighbor Classifiers
    Bax, Eric
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (05) : 3225 - 3234
  • [13] Quantum K-nearest neighbor algorithm
    Chen, Hanwu
    Gao, Yue
    Zhang, Jun
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2015, 45 (04): : 647 - 651
  • [14] Enhanced K-Nearest Neighbor for Intelligent Fault Diagnosis of Rotating Machinery
    Lu, Jiantao
    Qian, Weiwei
    Li, Shunming
    Cui, Rongqing
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 15
  • [15] Intrusion Detection System for IP Multimedia Subsystem Using K-Nearest Neighbor classifier
    Farooqi, Ashfaq Hussain
    Munir, Ali
    INMIC: 2008 INTERNATIONAL MULTITOPIC CONFERENCE, 2008, : 423 - 428
  • [16] An Approach for Fault Diagnosis Based on an Improved k-Nearest Neighbor Algorithm
    Yu Feng
    Liu Lian-chang
    Liu Dong-ming
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 6521 - 6525
  • [17] Generalizing fuzzy k-nearest neighbor classifier using an OWA operator with a RIM quantifier
    Kumbure, Mahinda Mailagaha
    Luukka, Pasi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 282
  • [18] Fake News Detection Using LDA Topic Modelling and K-Nearest Neighbor Classifier
    Casillo, Mario
    Colace, Francesco
    Gupta, Brij B.
    Santaniello, Domenico
    Valentino, Carmine
    COMPUTATIONAL DATA AND SOCIAL NETWORKS, CSONET 2021, 2021, 13116 : 330 - 339
  • [19] bSRWPSO-FKNN: A boosted PSO with fuzzy K-nearest neighbor classifier for predicting atopic dermatitis disease
    Li, Yupeng
    Zhao, Dong
    Xu, Zhangze
    Heidari, Ali Asghar
    Chen, Huiling
    Jiang, Xinyu
    Liu, Zhifang
    Wang, Mengmeng
    Zhou, Qiongyan
    Xu, Suling
    FRONTIERS IN NEUROINFORMATICS, 2023, 16
  • [20] EVOLVING EDITED k-NEAREST NEIGHBOR CLASSIFIERS
    Gil-Pita, Roberto
    Yao, Xin
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2008, 18 (06) : 459 - 467