XIDINTFL-VAE: XGBoost-based intrusion detection of imbalance network traffic via class-wise focal loss variational autoencoder

被引：32

作者：

Abdulganiyu, Oluwadamilare Harazeem ^{[1
]}

Tchakoucht, Taha Ait ^{[1
]}

Saheed, Yakub Kayode ^{[2
]}

Ahmed, Hilali Alaoui ^{[1
]}

机构：

[1] Euro Mediterranean Univ Fes, Euromed Res Ctr, Sch Digital Engn & Artificial Intelligence, UEMF, Fes, Morocco

[2] Amer Univ Nigeria, Sch IT & Comp, Yola, Nigeria

来源：

JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 01期

关键词：

Intrusion detection system; Imbalance network traffic; Focal loss; Variational autoencoder; Extreme gradient boosting; SMOTE;

D O I：

10.1007/s11227-024-06552-5

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Intrusion Detection Systems (IDS) face significant challenges in detecting minority class attacks within imbalanced network traffic, where traditional methods often struggle to maintain high accuracy without sacrificing key performance metrics like precision and recall. This study introduces the XIDINTFL-VAE framework, which leverages Class-Wise Focal Loss (CWFL) and Variational AutoEncoder (VAE) integrated with XGBoost to effectively address this imbalance. Our approach is designed to enhance the detection of minority class intrusions while maintaining robust overall performance. Current techniques, such as SMOTE and its variants, often fail to balance precision and recall adequately in highly imbalanced datasets, resulting in either high false positives or missed detections. The proposed method directly addresses this gap by generating synthetic data tailored to the most challenging cases within the minority class, thereby improving the classifier's ability to detect these rare but critical instances. A comparative analysis of the proposed XIDINTFL-VAE method was conducted against various oversampling techniques, including SMOTE, Borderline-SMOTE, and Adaptive Synthetic Sampling (ADASYN), as well as traditional classifiers like Logistic Regression, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Decision Tree. While existing techniques focus on balancing datasets or generating realistic data, the CWFL-VAE method takes a more targeted approach by generating synthetic data that specifically enhances the classifier's ability to handle challenging minority class instances. Experimental evaluations using the NSL-KDD and CSE-CIC-IDS2018 datasets demonstrate that the XIDINTFL-VAE model outperforms traditional methods, achieving a precision of 99.67% and an F1 score of 94.74%, with a slight trade-off in recall at 89.41%. These results underscore the model's capability to reduce false positives while maintaining high detection rates, which is crucial for real-world applications. The statistical significance of these improvements is confirmed by comparative analysis, establishing that our approach offers a meaningful advancement over existing methods.

引用

页数：38

共 44 条

[1] Addressing the class imbalance problem in network intrusion detection systems using data resampling and deep learning [J].

Abdelkhalek, Ahmed ;

Mashaly, Maggie .

JOURNAL OF SUPERCOMPUTING, 2023, 79 (10) :10611-10644

[2] RETRACTED: Towards an efficient model for network intrusion detection system (IDS): systematic literature review (Retracted article. See vol. 31, pg. 4415, 2025) [J].

Abdulganiyu, Oluwadamilare Harazeem ;

Tchakoucht, Taha Ait ;

Saheed, Yakub Kayode .

WIRELESS NETWORKS, 2024, 30 (01) :453-482

[3] A systematic literature review for network intrusion detection system (IDS) [J].

Abdulganiyu, Oluwadamilare Harazeem ;

Tchakoucht, Taha Ait ;

Saheed, Yakub Kayode .

INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 22 (05) :1125-1162

[4] Deep and Machine Learning Approaches for Anomaly-Based Intrusion Detection of Imbalanced Network Traffic [J].

Abdulhammed, Razan ;

Faezipour, Miad ;

Abuzneid, Abdelshakour ;

AbuMallouh, Arafat .

IEEE SENSORS LETTERS, 2019, 3 (01)

[5] Network intrusion detection using oversampling technique and machine learning algorithms [J].

Ahmed, Hafiza Anisa ;

Hameed, Anum ;

Bawany, Narmeen Zakaria .

PEERJ COMPUTER SCIENCE, 2022, 8 :1-19

[6] Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues [J].

Aldweesh, Arwa ;

Derhab, Abdelouahid ;

Emam, Ahmed Z. .

KNOWLEDGE-BASED SYSTEMS, 2020, 189

[7] GAN augmentation to deal with imbalance in imaging-based intrusion detection [J].

Andresini, Giuseppina ;

Appice, Annalisa ;

De Rose, Luca ;

Malerba, Donato .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 (123) :108-127

[8]

[Anonymous], 2010, INT C NEUR INF PROC

[9] Network Intrusion Detection System using Deep Learning [J].

Ashiku, Lirim ;

Dagli, Cihan .

BIG DATA, IOT, AND AI FOR A SMARTER FUTURE, 2021, 185 :239-247

[10] The Effect of Dataset Imbalance on the Performance of SCADA Intrusion Detection Systems [J].

Balla, Asaad ;

Habaebi, Mohamed Hadi ;

Elsheikh, Elfatih A. A. ;

Islam, Md. Rafiqul ;

Suliman, F. M. .

SENSORS, 2023, 23 (02)

← 1 2 3 4 5 →