Improving Intrusion Detection Using PCA And K-Means Clustering Algorithm

被引:2
|
作者
Khaoula, Radi [1 ]
Mohamed, Moughit [1 ]
机构
[1] Sultan Moulay Slimane Univ, LaSTI Lab, Natl Sch Appl Sci, Khouribga, Morocco
关键词
Intrusion Detection System; K-means; WEKA; Machine Learning; PCA; NSL-KDD dataset;
D O I
10.1109/WINCOM55661.2022.9966426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, the internet has been growing at an exponential rate, which has generated a severe increase in network attacks. So, to provide necessary security, an intrusion detection system (IDS) is used to detect malicious traffic and prevent attacks from various data sources. For this aim, clustering is the simple and reliable method in machine learning to detect intrusions in the case of unlabeled data, in addition to detecting unknown and new types of intrusions. In this paper, we are analyzing the NSL-KDD dataset, which is an improved version of its predecessor, the KDD-99 dataset, using the K-Means clustering algorithm. We compare the results by first using correlation as a feature selection method to eliminate redundant and irrelevant attributes in our data set, and then by increasing interpretability while minimizing information loss using the dimensionality reduction method of Principal Component Analysis (PCA). The analysis was done using Python and the data mining tool WEKA. Results are shown to have an improved accuracy after using PCA over K-means clustering. Our main objective is to provide a better model of IDS using machine learning, especially clustering methods.
引用
收藏
页码:19 / 23
页数:5
相关论文
共 50 条
  • [31] An Efficient Brain Tumor Detection Methodology Using K-Means Clustering Algorithm
    Vijay, J.
    Subhashini, J.
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 653 - 657
  • [32] Clustering Using Boosted Constrained k-Means Algorithm
    Okabe, Masayuki
    Yamada, Seiji
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [33] Improved Document Clustering using K-means Algorithm
    Bide, Pramod
    Shedge, Rajashree
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [34] Optimization of K-Means clustering Using Genetic Algorithm
    Irfan, Shadab
    Dwivedi, Gaurav
    Ghosh, Subhajit
    2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 157 - 162
  • [35] RACK: RApid Clustering using K-means algorithm
    Garg, Vikas K.
    Murty, M. N.
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING, 2009, : 621 - 626
  • [36] Colour Constancy using K-means Clustering Algorithm
    Hussain, Md. Akmol
    Akbari, Akbar Sheikh
    Ghaffari, Ahmad
    2016 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2016), 2016, : 283 - 288
  • [37] Image Segmentation using K-means Clustering Algorithm and Subtractive Clustering Algorithm
    Dhanachandra, Nameirakpam
    Manglem, Khumanthem
    Chanu, Yambem Jina
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 764 - 771
  • [38] Improving Arabic Document Clustering using K-Means Algorithm and Particle Swarm Optimization
    Daoud, Abdullah S.
    Sallam, Ahmed
    Wheed, Mohamed E.
    PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 879 - 885
  • [40] Improving Business Intelligence Based on Frequent Itemsets Using k-Means Clustering Algorithm
    Paulraj, Prabhu
    Neelamegam, Anbazhagan
    NETWORKS AND COMMUNICATIONS (NETCOM2013), 2014, 284 : 243 - 254