Analyzing the Impact of Principal Component Analysis on k-Nearest Neighbors and Naive Bayes Classification Algorithms

被引:0
|
作者
Macionczyk, Rafal [1 ]
Moryc, Michal [1 ]
Buchtyar, Patryk [1 ]
机构
[1] Silesian Tech Univ, Fac Appl Math, Kaszubska 23, PL-44100 Gliwice, Poland
关键词
PCA Algorithm; kNN Algorithm; Naive Bayes Classification Algorithms; analyze; impact; effect;
D O I
10.1007/978-3-031-48981-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Principal Component Analysis (PCA) is a well-known dimensionality reduction technique that has been widely used in various machine learning algorithms. This includes kNN and Naive Bayes algorithms which can be time-consuming. The reduction of dimensions can have positive effects on those two algorithms by reducing the number of related types of data and decreasing the data they need to analyze. Here we present detailed findings about how the PCA algorithm affects them both in time efficiency and accuracy. All calculations regarding those values were carried out in Python programming language. The dataset used in research is the Titanic dataset, on which data cleaning and normalization were done. The data in this paper suggests that it is possible to maintain the same level of accuracy with great improvement in time efficiency. For the kNN algorithm reducing the number of dimensions by one resulted in a 31.09% increase in accuracy and for the Naive Bayes algorithm an 18.18% increase while having an imperceptible effect on accuracy.
引用
收藏
页码:247 / 263
页数:17
相关论文
共 50 条
  • [1] Implicit Links-Based Techniques to Enrich K-Nearest Neighbors and Naive Bayes Algorithms forWeb Page Classification
    Belmouhcine, Abdelbadie
    Benkhalifa, Mohammed
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 755 - 766
  • [2] Comparison of Support Vector Machine, Naive Bayes, and K-Nearest Neighbors Algorithms for Classifying Heart Disease
    Lewandowicz, Bartosz
    Kisiala, Konrad
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2023, 2024, 1979 : 274 - 285
  • [3] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min DU
    Xing-shu CHEN
    JournalofZhejiangUniversity-ScienceC(Computers&Electronics), 2013, 14 (06) : 407 - 416
  • [4] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min DU
    Xing-shu CHEN
    Frontiers of Information Technology & Electronic Engineering, 2013, (06) : 407 - 416
  • [5] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min Du
    Xing-shu Chen
    Journal of Zhejiang University SCIENCE C, 2013, 14 : 407 - 416
  • [6] Study on density peaks clustering based on k-nearest neighbors and principal component analysis
    Du, Mingjing
    Ding, Shifei
    Jia, Hongjie
    KNOWLEDGE-BASED SYSTEMS, 2016, 99 : 135 - 145
  • [7] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Du, Min
    Chen, Xing-shu
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2013, 14 (06): : 407 - 416
  • [8] Choose of Wart Treatment Method Using Naive Bayes and k-Nearest Neighbors Classifiers
    Uzun, Rukiye
    Isler, Yalcin
    Toksan, Mualla
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [9] Internet Traffic Detection using Naive Bayes and K-Nearest Neighbors (KNN) algorithm
    Dixit, Mrudul
    Sharma, Ritu
    Shaikh, Saniya
    Muley, Krutika
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1153 - 1157
  • [10] Classification with learning k-nearest neighbors
    Laaksonen, J
    Oja, E
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1480 - 1483