A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura
[2] Chandigarh University, Punjab, Gharuan, Mohali
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo
关键词
KNN; LR; Lung cancer; Machine learning; NB; PCWKNN; RF; SVM;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [1] Hybrid k-Nearest Neighbor Classifier
    Yu, Zhiwen
    Chen, Hantao
    Liu, Jiming
    You, Jane
    Leung, Hareton
    Han, Guoqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (06) : 1263 - 1275
  • [2] A fall detection system using k-nearest neighbor classifier
    Liu, Chien-Liang
    Lee, Chia-Hoang
    Lin, Ping-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 7174 - 7181
  • [3] Heart Disease Prediction Using k-Nearest Neighbor Classifier Based on Handwritten Text
    Kedar, Seema
    Bormane, D. S.
    Nair, Vaishnavi
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 49 - 56
  • [4] Diagnosis of Arthritis Using K-Nearest Neighbor Approach
    Kaur, Rupinder
    Madaan, Vishu
    Agrawal, Prateek
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 160 - 171
  • [5] Research on the Improvement of K-Nearest Neighbor Classifier for Imbalanced Text Categorization
    Yang Yanmei
    Xu Linying
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 968 - 972
  • [6] Fuzzy-belief K-nearest neighbor classifier for uncertain data
    Liu, Zhun-ga
    Pan, Quan
    Dezert, Jean
    Mercier, Gregoire
    Liu, Yong
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [7] Naive Bayes Model Based Improved K-Nearest Neighbor Classifier for Breast Cancer Prediction
    Goyal, Sonia
    Maheshwar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 3 - 11
  • [8] Comparative Analysis of Hepatitis C Using K-Nearest Neighbor Classifier and Decision Tree Classifier
    Sravanthi, D.
    Rani, Jenila D.
    CARDIOMETRY, 2022, (25): : 1010 - 1016
  • [9] Boosted K-nearest neighbor classifiers based on fuzzy granules
    Li, Wei
    Chen, Yumin
    Song, Yuping
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [10] Effects of Distance Measure Choice on K-Nearest Neighbor Classifier Performance: A Review
    Abu Alfeilat, Haneen Arafat
    Hassanat, Ahmad B. A.
    Lasassmeh, Omar
    Tarawneh, Ahmad S.
    Alhasanat, Mahmoud Bashir
    Salman, Hamzeh S. Eyal
    Prasath, V. B. Surya
    BIG DATA, 2019, 7 (04) : 221 - 248