Detecting Network Transmission Anomalies using Autoencoders-SVM Neural Network on Multi-class NSL-KDD Dataset

被引:9
作者
Khan, Shehram Sikander [1 ]
Mailewa, Akalanka Bandara [2 ]
机构
[1] St Cloud State Univ, Dept Informat Assurance, St Cloud, MN 56301 USA
[2] St Cloud State Univ, Dept Comp Sci & IT, St Cloud, MN 56301 USA
来源
2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC | 2023年
关键词
NSL-KDD; Network Security; Intrusion Detection System; Deep Autoencoders; Anomaly Detection; Support Vector Machine (SVM); t-SNE; Deep Learning; DEEP LEARNING APPROACH; INTRUSION; CLASSIFIER;
D O I
10.1109/CCWC57344.2023.10099056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As modern manufacturing shifts towards industry 4.0, mass adoption of vulnerable Internet-of-Things (IoT), Operational Technology (OT), and IT-OT convergence have precipitated the rise of malware. Accordingly, there is a need for a security mechanism that is both low-resource and highly accurate to address sophisticated attacks like zero-day and Mirai botnets. The study proposed a novel scheme that combined Deep Autoencoder (DAE) and Support Vector Machine (SVM); the hybrid scheme was tested on NSL-KDD: an imbalanced, multi-class, and high-dimensional dataset. We conducted a grid search analysis on L1 and L2 regularization to avoid overfitting and examined varying neural network formations to improve F1-micro and balance accuracy. The paper thoroughly evaluated all four attack classes within the NSL-KDD dataset: U2R, Denial of Service, R2L, and Probe. We demonstrated that DAE-SVM had a significant classification advantage over PCA-SVM; our model outperformed the baseline models at detecting low-frequency attacks with a micro-average score of 0.72 compared to 0.63 for PCA-SVM. To minimize computational overhead, we examined optimal feature fusion usage on Principal Component Analysis (PCA) and deep autoencoders. Our models, namely SVM, PCA-SVM, and DAE-SVM, were evaluated based on train and test times for rapid predictions. The DAE-SVM with L1 penalty was the model of choice for binary classes with a train and test time of 145 seconds, while DAE-SVM without any penalty term outperformed other models in the multi-class scenario delivering 142.62 seconds compared to 300 seconds for PCA-SVM.
引用
收藏
页码:835 / 843
页数:9
相关论文
共 38 条
[31]  
Thapa S., 2020, ROLE INTRUSION DETEC
[32]  
van der Maaten L, 2009, J. Mach. Learn. Research, P1, DOI DOI 10.1080/13506280444000102
[33]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[34]  
Vincent P, 2010, J MACH LEARN RES, V11, P3371
[35]   Auto-encoder based dimensionality reduction [J].
Wang, Yasi ;
Yao, Hongxun ;
Zhao, Sicheng .
NEUROCOMPUTING, 2016, 184 :232-242
[36]   The Learning Effect of Different Hidden Layers Stacked Autoencoder [J].
Xu, Qingyang ;
Zhang, Caixia ;
Zhang, Li ;
Song, Yong .
2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, :148-151
[37]   Cyber-physical systems security: Limitations, issues and future trends [J].
Yaacoub, Jean-Paul A. ;
Salman, Ola ;
Noura, Hassan N. ;
Kaaniche, Nesrine ;
Chehab, Ali ;
Malli, Mohamad .
MICROPROCESSORS AND MICROSYSTEMS, 2020, 77
[38]  
Yousefi-Azar M, 2017, IEEE IJCNN, P3854, DOI 10.1109/IJCNN.2017.7966342