A novel K-nearest neighbor classifier for lung cancer disease diagnosis

被引:0
|
作者
Sachdeva, Ravi Kumar [1 ]
Bathla, Priyanka [2 ]
Rani, Pooja [3 ]
Lamba, Rohit [4 ]
Ghantasala, G. S. Pradeep [5 ]
Nassar, Ibrahim F. [6 ]
机构
[1] Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, Rajpura
[2] Chandigarh University, Punjab, Gharuan, Mohali
[3] MMICTBM, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[4] Department of Electronics and Communication Engineering, MMEC, Maharishi Markandeshwar (Deemed to be University), Haryana, Mullana, Ambala
[5] Department of Computer Science and Engineering, Alliance College of Engineering and Design, Alliance University, Bengaluru
[6] Faculty of Specific Education, Ain Shams University, 365 Ramsis Street, Abassia, Cairo
关键词
KNN; LR; Lung cancer; Machine learning; NB; PCWKNN; RF; SVM;
D O I
10.1007/s00521-024-10235-w
中图分类号
学科分类号
摘要
One of the world's deadliest diseases is lung cancer. Based on a few features, machine learning techniques can help in the diagnosis of lung cancer. The performance of several classifiers: support vector machine (SVM), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and K-nearest neighbor (KNN), was evaluated by the authors using the dataset available on Kaggle to create a systematic approach for the diagnosis of lung cancer disease based on readily observable signs and historical medical data without the requirement of CT scan images. The authors have proposed a novel approach for classification called Pearson correlation weighted KNN (PCWKNN), which is a modified version of KNN and uses Pearson correlation coefficient values to determine weights in a weighted KNN. The performance of the classifiers was evaluated using the hold-out validation method. SVM, LR, and RF were 96.77% accurate. NB obtained 95.16% accuracy. KNN achieved 91.93% accuracy. PCWKNN outperformed the employed classifiers and obtained an accuracy of 98.39%. Addressing the imperative for improved model generalization, the researchers utilized PCWKNN on an alternative, more extensive lung cancer dataset and subsequently broadened its application to diverse diseases, including the brain stroke dataset. The encouraging outcomes underscore PCWKNN's resilience and adaptability, suggesting its viability for real-world implementation. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22403 / 22416
页数:13
相关论文
共 50 条
  • [31] Random K-nearest neighbor algorithm with learning process
    Fu Z.-L.
    Chen X.-Q.
    Ren W.
    Yao Y.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (01): : 209 - 220
  • [32] K-Nearest Neighbor Regression for Forecasting Electricity Demand
    Atanasovski, Metodija
    Kostov, Mitko
    Arapinoski, Blagoja
    Spirovski, Mile
    2020 55TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (IEEE ICEST 2020), 2020, : 110 - 113
  • [33] KNNCC: An Algorithm for K-Nearest Neighbor Clique Clustering
    Qu Chao
    Yuan Ruifen
    Wei Xiaorui
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1763 - 1766
  • [34] Fault diagnosis of industrial processes based on weighted k-nearest neighbor reconstruction analysis
    Wang, Guo-Zhu
    Liu, Jian-Chang
    Li, Yuan
    Shang, Liang-Liang
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (07): : 873 - 880
  • [35] Motorcycle Apprehension using Deep Learning and K-Nearest Neighbor Algorithm
    Garcia, Maria Rosario T.
    Bandala, Argel A.
    Dadios, Elmer P.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [36] Cancer Disease Prediction Using Naive Bayes,K-Nearest Neighbor and J48 algorithm.
    Maliha, Shanjida Khan
    Ema, Romana Rahman
    Ghosh, Simanta Kumar
    Ahmed, Helal
    Mollick, Md. Rafsun Jony
    Islam, Tajul
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [37] Design of poultry farm disease detection system based on K-Nearest Neighbor Algorithm
    Kim, Seung Jae
    Yoe, Hyun
    Lee, Meong Hun
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 762 - 766
  • [38] Prediction of heart disease using k-nearest neighbor and particle swarm optimization.
    Jabbar, M. A.
    BIOMEDICAL RESEARCH-INDIA, 2017, 28 (09): : 4154 - 4158
  • [39] Human Activity Recognition Using K-Nearest Neighbor Machine Learning Algorithm
    Mohsen, Saeed
    Elkaseer, Ahmed
    Scholz, Steffen G.
    SUSTAINABLE DESIGN AND MANUFACTURING, KES-SDM 2021, 2022, 262 : 304 - 313
  • [40] A method of false alarm recognition based on k-nearest neighbor
    Guan, Fei
    Shi, Junyou
    Ma, Xiaodong
    Cui, Weiwei
    Wu, Jie
    2017 FOURTH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND THEIR APPLICATIONS (DSA 2017), 2017, : 8 - 12