Improving Intrusion Detection Using PCA And K-Means Clustering Algorithm

被引:2
|
作者
Khaoula, Radi [1 ]
Mohamed, Moughit [1 ]
机构
[1] Sultan Moulay Slimane Univ, LaSTI Lab, Natl Sch Appl Sci, Khouribga, Morocco
来源
2022 9TH INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE COMMUNICATIONS, WINCOM | 2022年
关键词
Intrusion Detection System; K-means; WEKA; Machine Learning; PCA; NSL-KDD dataset;
D O I
10.1109/WINCOM55661.2022.9966426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, the internet has been growing at an exponential rate, which has generated a severe increase in network attacks. So, to provide necessary security, an intrusion detection system (IDS) is used to detect malicious traffic and prevent attacks from various data sources. For this aim, clustering is the simple and reliable method in machine learning to detect intrusions in the case of unlabeled data, in addition to detecting unknown and new types of intrusions. In this paper, we are analyzing the NSL-KDD dataset, which is an improved version of its predecessor, the KDD-99 dataset, using the K-Means clustering algorithm. We compare the results by first using correlation as a feature selection method to eliminate redundant and irrelevant attributes in our data set, and then by increasing interpretability while minimizing information loss using the dimensionality reduction method of Principal Component Analysis (PCA). The analysis was done using Python and the data mining tool WEKA. Results are shown to have an improved accuracy after using PCA over K-means clustering. Our main objective is to provide a better model of IDS using machine learning, especially clustering methods.
引用
收藏
页码:19 / 23
页数:5
相关论文
共 50 条
  • [31] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [32] A Comparison of Intrusion Detection by K-Means and Fuzzy C-Means Clustering Algorithm over the NSL-KDD Dataset
    Bhattacharjee, Partha Sarathi
    Fujail, Abul Kashim Md
    Begum, Shahin Ara
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 1084 - 1089
  • [33] Statistically Improving K-means Clustering Performance
    Ihsanoglu, Abdullah
    Zaval, Mounes
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [34] IMPROVEMENT IN K-MEANS CLUSTERING ALGORITHM FOR DATA CLUSTERING
    Rajeswari, K.
    Acharya, Omkar
    Sharma, Mayur
    Kopnar, Mahesh
    Karandikar, Kiran
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 367 - 369
  • [35] A Novel Approach for Medical Image Segmentation using PCA and K-means Clustering
    Katkar, Juilee
    Baraskar, Trupti
    Mankar, Vijay R.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 430 - 435
  • [36] Representing the New Model for Improving K-Means Clustering Algorithm based on Genetic Algorithm
    Maghsoudi, Rouhollah
    Delavar, Arash Ghorbannia
    Hoseyny, Somayye
    Asgari, Rahmatollah
    Heidari, Yaghub
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2011, 2 (02): : 329 - 336
  • [37] Underground Electrical Profile Clustering Using K-MEANS Algorithm
    Kutbay, Ugurhan
    Ural, Ali Berkan
    Hardalac, Firat
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 561 - 564
  • [38] An Approach for Document Clustering using PSO and K-means Algorithm
    Chouhan, Rashmi
    Purohit, Anuradha
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 1380 - 1384
  • [39] Unsupervised detection of InSAR time series patterns based on PCA and K-means clustering
    Festa, Davide
    Novellino, Alessandro
    Hussain, Ekbal
    Bateson, Luke
    Casagli, Nicola
    Confuorto, Pierluigi
    Del Soldato, Matteo
    Raspini, Federico
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 118
  • [40] DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING
    Celebi, M. Emre
    Kingravi, Hassan A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)