Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

被引:6
作者
Awadh, Khaldoon [1 ]
Akbas, Ayhan [2 ]
机构
[1] Univ Turkish Aeronaut Assoc, Comp Engn Dept, Ankara, Turkey
[2] Cankiri Karatekin Univ, Comp Engn Dept, Cankiri, Turkey
来源
JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI | 2021年 / 24卷 / 04期
关键词
IDS; TF.IDF; data mining; machine learning; network security;
D O I
10.2339/politeknik.693221
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, the use of machine learning and data mining technologies has drawn researchers' attention to new ways to improve the performance of Intrusion Detection Systems (IDS). These techniques have proven to be an effective method in distinguishing malicious network packets. One of the most challenging problems that researchers are faced with is the transformation of data into a form that can be handled effectively by Machine Learning Algorithms (MLA). In this paper, we present an IDS model based on the decision tree C4.5 algorithm with transforming simulated UNSW-NB15 dataset as a pre-processing operation. Our model uses Term Frequency.Inverse Document Frequency (TF.IDF) to convert data types to an acceptable and efficient form for machine learning to achieve high detection performance. The model has been tested with randomly selected 250000 records of the UNSW-NB15 dataset. Selected records have been grouped into various segment sizes, like 50, 500, 1000, and 5000 items. Each segment has been, further, grouped into two subsets of multi and binary class datasets. The performance of the Decision Tree C4.5 algorithm with Multilayer Perceptron (MLP) and Naive Bayes (NB) has been compared in Weka software. Our proposed method significantly has improved the accuracy of classifiers and decreased incorrectly detected instances. The increase in accuracy reflects the efficiency of transforming the dataset with TF.IDF of various segment sizes.
引用
收藏
页码:1691 / 1698
页数:8
相关论文
共 50 条
  • [21] System of Negative Indonesian Website Detection Using TF-IDF and Vector Space Model
    Adji, Teguh Bharata
    Abidin, Zainil
    Nugroho, Hanung Adi
    2014 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICEECS), 2014, : 174 - 178
  • [22] Interpretation of Clinical Data Based on C4.5 Algorithm for the Diagnosis of Coronary Heart Disease
    Wiharto, Wiharto
    Kusnanto, Hari
    Herianto, Herianto
    HEALTHCARE INFORMATICS RESEARCH, 2016, 22 (03) : 186 - 195
  • [23] C4.5 Decision Tree Machine Learning Algorithm Based GIS Route Identification
    Dalela, Pankaj Kumar
    Bansal, Prashant
    Yadav, Arun
    Majumdar, Sabyasachi
    Yadav, Anurag
    Tyagi, Vipin
    2018 TENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2018), 2018, : 213 - 218
  • [24] Research on a Charging Pile Fault Prediction Method Based on Improved C4.5 Algorithm
    Liu, Hongpeng
    Xu, Ziqi
    Sun, Yirui
    Wang, Liyuan
    Li, Hongwei
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [25] Construction of decision tree based on C4.5 algorithm for online voltage stability assessment
    Meng, Xiangfei
    Zhang, Pei
    Xu, Yan
    Xie, Hua
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2020, 118
  • [26] Decision Tree for Online Voltage Stability Margin Assessment Using C4.5 and Relief-F Algorithms
    Meng, Xiangfei
    Zhang, Pei
    Zhang, Dahai
    ENERGIES, 2020, 13 (15)
  • [27] Reservoir Inflow Forecasting Using ID3 and C4.5 Decision Tree Model
    Charoenporn, Pattama
    CONFERENCE PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON CONTROL SCIENCE AND SYSTEMS ENGINEERING (ICCSSE), 2017, : 698 - 701
  • [28] Using Data Mining Algorithms for Developing a Model for Intrusion Detection System (IDS)
    Duque, Solane
    bin Omar, Mohd Nizam
    COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 46 - 51
  • [29] A C4.5 decision tree classifier based floorplanning algorithm for System-on-Chip design
    Shanthi, J.
    Rani, D. Gracia Nirmala
    Rajaram, S.
    MICROELECTRONICS JOURNAL, 2022, 121
  • [30] A HYBRID INTRUSION DETECTION SYSTEM BASED ON DIFFERENTMACHINELEARNING ALGORITHMS
    Atefi, Kayvan
    Yahya, Saadiah
    Dak, Ahmad Yusri
    Atefi, Arash
    COMPUTING & INFORMATICS, 4TH INTERNATIONAL CONFERENCE, 2013, 2013, : 312 - +