Improving Intrusion Detection Using PCA And K-Means Clustering Algorithm

被引:2
|
作者
Khaoula, Radi [1 ]
Mohamed, Moughit [1 ]
机构
[1] Sultan Moulay Slimane Univ, LaSTI Lab, Natl Sch Appl Sci, Khouribga, Morocco
来源
2022 9TH INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE COMMUNICATIONS, WINCOM | 2022年
关键词
Intrusion Detection System; K-means; WEKA; Machine Learning; PCA; NSL-KDD dataset;
D O I
10.1109/WINCOM55661.2022.9966426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, the internet has been growing at an exponential rate, which has generated a severe increase in network attacks. So, to provide necessary security, an intrusion detection system (IDS) is used to detect malicious traffic and prevent attacks from various data sources. For this aim, clustering is the simple and reliable method in machine learning to detect intrusions in the case of unlabeled data, in addition to detecting unknown and new types of intrusions. In this paper, we are analyzing the NSL-KDD dataset, which is an improved version of its predecessor, the KDD-99 dataset, using the K-Means clustering algorithm. We compare the results by first using correlation as a feature selection method to eliminate redundant and irrelevant attributes in our data set, and then by increasing interpretability while minimizing information loss using the dimensionality reduction method of Principal Component Analysis (PCA). The analysis was done using Python and the data mining tool WEKA. Results are shown to have an improved accuracy after using PCA over K-means clustering. Our main objective is to provide a better model of IDS using machine learning, especially clustering methods.
引用
收藏
页码:19 / 23
页数:5
相关论文
共 50 条
  • [1] Improved K-means clustering algorithm in intrusion detection
    Xiao, ShiSong
    Li, XiaoXu
    Liu, XueJiao
    2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 2, 2008, : 771 - 775
  • [2] Improving K-Means Clustering Using Discretization Technique In Network Intrusion Detection System
    Tahir, Hatim Mohamad
    Said, Abas Md
    Osman, Nor Hayani
    Zakaria, Nur Haryani
    Sabri, Puteri Nurul 'Ain M.
    Katuk, Norliza
    2016 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2016, : 248 - 252
  • [3] Efficient K-means Algorithm in Intrusion Detection
    Yang, Wenjun
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MODELLING, SIMULATION AND APPLIED MATHEMATICS (MSAM2017), 2017, 132 : 193 - 195
  • [4] Active Learning Intrusion Detection using k-Means Clustering Selection
    McElwee, Steven
    SOUTHEASTCON 2017, 2017,
  • [5] Intrusion Detection with K-Means Clustering and OneR Classification
    Muda, Z.
    Yassin, W.
    Sulaiman, M. N.
    Udzir, N. I.
    JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2012, 7 (06): : 347 - 354
  • [6] Hybridization of K-Means and Firefly Algorithm for intrusion detection system
    Kaur A.
    Pal S.K.
    Singh A.P.
    International Journal of System Assurance Engineering and Management, 2018, 9 (04) : 901 - 910
  • [7] Development of a Corruption Detection Algorithm using K-means Clustering
    Islam, Md. Tawheedul
    Abu Yousuf, Mohammad
    2018 INTERNATIONAL CONFERENCE ON ADVANCEMENT IN ELECTRICAL AND ELECTRONIC ENGINEERING (ICAEEE), 2018,
  • [8] Network Intrusion Detection Using Improved Genetic k-means Algorithm
    Sukumar, Anand J., V
    Pranav, I
    Neetish, M. M.
    Narayanan, Jayasree
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2441 - 2446
  • [9] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [10] Application of Clustering Algorithm in Intrusion Detection Based on PCA
    Li, Han
    2015 International Conference on Software Engineering and Information System (SEIS 2015), 2015, : 239 - 244