Detecting Network Transmission Anomalies using Autoencoders-SVM Neural Network on Multi-class NSL-KDD Dataset

被引：9

作者：

Khan, Shehram Sikander ^{[1
]}

Mailewa, Akalanka Bandara ^{[2
]}

机构：

[1] St Cloud State Univ, Dept Informat Assurance, St Cloud, MN 56301 USA

[2] St Cloud State Univ, Dept Comp Sci & IT, St Cloud, MN 56301 USA

来源：

2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC | 2023年

关键词：

NSL-KDD; Network Security; Intrusion Detection System; Deep Autoencoders; Anomaly Detection; Support Vector Machine (SVM); t-SNE; Deep Learning; DEEP LEARNING APPROACH; INTRUSION; CLASSIFIER;

D O I：

10.1109/CCWC57344.2023.10099056

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As modern manufacturing shifts towards industry 4.0, mass adoption of vulnerable Internet-of-Things (IoT), Operational Technology (OT), and IT-OT convergence have precipitated the rise of malware. Accordingly, there is a need for a security mechanism that is both low-resource and highly accurate to address sophisticated attacks like zero-day and Mirai botnets. The study proposed a novel scheme that combined Deep Autoencoder (DAE) and Support Vector Machine (SVM); the hybrid scheme was tested on NSL-KDD: an imbalanced, multi-class, and high-dimensional dataset. We conducted a grid search analysis on L1 and L2 regularization to avoid overfitting and examined varying neural network formations to improve F1-micro and balance accuracy. The paper thoroughly evaluated all four attack classes within the NSL-KDD dataset: U2R, Denial of Service, R2L, and Probe. We demonstrated that DAE-SVM had a significant classification advantage over PCA-SVM; our model outperformed the baseline models at detecting low-frequency attacks with a micro-average score of 0.72 compared to 0.63 for PCA-SVM. To minimize computational overhead, we examined optimal feature fusion usage on Principal Component Analysis (PCA) and deep autoencoders. Our models, namely SVM, PCA-SVM, and DAE-SVM, were evaluated based on train and test times for rapid predictions. The DAE-SVM with L1 penalty was the model of choice for binary classes with a train and test time of 145 seconds, while DAE-SVM without any penalty term outperformed other models in the multi-class scenario delivering 142.62 seconds compared to 300 seconds for PCA-SVM.

引用

页码：835 / 843

页数：9

共 38 条

[31]

Thapa S., 2020, ROLE INTRUSION DETEC

[32]

van der Maaten L, 2009, J. Mach. Learn. Research, P1, DOI DOI 10.1080/13506280444000102

[33]

van der Maaten L, 2008, J MACH LEARN RES, V9, P2579

[34]

Vincent P, 2010, J MACH LEARN RES, V11, P3371

[35] Auto-encoder based dimensionality reduction [J].

Wang, Yasi ;

Yao, Hongxun ;

Zhao, Sicheng .

NEUROCOMPUTING, 2016, 184 :232-242

[36] The Learning Effect of Different Hidden Layers Stacked Autoencoder [J].

Xu, Qingyang ;

Zhang, Caixia ;

Zhang, Li ;

Song, Yong .

2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, :148-151

[37] Cyber-physical systems security: Limitations, issues and future trends [J].

Yaacoub, Jean-Paul A. ;

Salman, Ola ;

Noura, Hassan N. ;

Kaaniche, Nesrine ;

Chehab, Ali ;

Malli, Mohamad .

MICROPROCESSORS AND MICROSYSTEMS, 2020, 77

[38]

Yousefi-Azar M, 2017, IEEE IJCNN, P3854, DOI 10.1109/IJCNN.2017.7966342

← 1 2 3 4 →