Intrusion Detection System Using Stacked Ensemble Learning with Light Gradient Boosting Machine and Decision Tree

被引:0
作者
Dasari, Anil Kumar [1 ]
Biswas, Saroj Kr [1 ]
Baruah, Barnana [2 ]
Purkayastha, Biswajit [1 ]
Das, Soumen [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Silchar, India
[2] Kaziranga Univ, Dept Comp Sci & Engn, Jorhat, Assam, India
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NETWORK SECURITY AND BLOCKCHAIN TECHNOLOGY, ICNSBT 2024 | 2025年 / 1158卷
关键词
Intrusion detection system; Stacked ensemble learning; K-means smote oversampling; Isolation forest outlier detection; Decision tree; Light gradient boosting machine and logistic regression; FEATURE-SELECTION;
D O I
10.1007/978-981-97-8051-8_23
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The contemporary challenge of increased cyberattacks poses risks to individuals and businesses, impacting the availability, confidentiality, and integrity of sensitive data transmitted over networks. The resulting harm can vary from minor service interruptions to significant financial losses. While conventional security measures like firewalls and antivirus software provide an initial layer of protection, there is a recognized need to develop effective Intrusion Detection Systems (IDS). Current IDS solutions incorporate various classifiers, both individual and ensemble, yet challenges persist in identifying novel intrusions accurately. Thiswork introduces a novel model, the Intrusion Detection System, using the Stacked Ensemble Learning Technique (IDSELSE), which addresses these challenges. IDSELSE leverages Kmeans SMOTE oversampling to handle class imbalance and enhance minority class representation. Additionally, it employs Boruta for feature selection, streamlining the model's efficiency by eliminating irrelevant features. In classification, IDSELSE utilizes the Stacked Ensemble Learning Technique with a combination of Light Gradient Boosting Machine (LGBM) and Decision Tree (DT) as base classifiers, supported by the meta-model Logistic Regression (LR), to make accurate predictions. Performance evaluation involves assessing F1-score and accuracy through tenfold cross-validation, demonstrating the consistent superiority of the proposed model over various single-classifier and ensemble models documented in the literature.
引用
收藏
页码:291 / 303
页数:13
相关论文
共 30 条
  • [1] Fusion-based anomaly detection system using modified isolation forest for internet of things
    AbuAlghanam O.
    Alazzam H.
    Alhenawi E.
    Qatawneh M.
    Adwan O.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (01) : 131 - 145
  • [2] Anil Kumar D, 2023, 2023 IEEE 8 INT C CO
  • [3] [Anonymous], 2016, Transactions on Machine Learning and Artificial Intelligence
  • [4] Boukerche A, 2020, ACM COMPUT SURV, V53, DOI [10.1145/3381028, 10.1145/3421763]
  • [5] Gaikwad DP, 2015, 2015 INT C COMP COMM
  • [6] John H., 2019, Int. J. Comput. Sci. Eng, V7, P1060, DOI DOI 10.26438/IJCSE/V7I4.10601064
  • [7] Improving the Accuracy of Intrusion Detection Using GAR-Forest with Feature Selection
    Kanakarajan, Navaneeth Kumar
    Muniasamy, Kandasamy
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2015, 2016, 404 : 539 - 547
  • [8] MLEsIDSs: machine learning-based ensembles for intrusion detection systems-a review
    Kumar, Gulshan
    Thakur, Kutub
    Ayyagari, Maruthi Rohit
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (11) : 8938 - 8971
  • [9] Feature Selection with the Boruta Package
    Kursa, Miron B.
    Rudnicki, Witold R.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 36 (11): : 1 - 13
  • [10] Ensemble Classifiers for Network Intrusion Detection Using a Novel Network Attack Dataset
    Mahfouz, Ahmed
    Abuhussein, Abdullah
    Venugopal, Deepak
    Shiva, Sajjan
    [J]. FUTURE INTERNET, 2020, 12 (11) : 1 - 19