Improving Performance of Autoencoder-Based Network Anomaly Detection on NSL-KDD Dataset

被引：104

作者：

Xu, Wen ^{[1
]}

Jang-Jaccard, Julian ^{[1
]}

Singh, Amardeep ^{[1
]}

Wei, Yuanyuan ^{[1
]}

Sabrina, Fariza ^{[2
]}

机构：

[1] Massey Univ, Cybersecur Lab, Auckland 0632, New Zealand

[2] Cent Queensland Univ, Sch Engn & Technol, Sydney, NSW 2000, Australia

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Anomaly detection; Data models; Training; Network security; Mathematical models; Encoding; Task analysis; intrusion detection systems; network-based IDSs; anomaly detection; NSL-KDD; artificial intelligence; machine learning; deep learning; autoencoders; unsupervised learning; SPARSE AUTOENCODER;

D O I：

10.1109/ACCESS.2021.3116612

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Network anomaly detection plays a crucial role as it provides an effective mechanism to block or stop cyberattacks. With the recent advancement of Artificial Intelligence (AI), there has been a number of Autoencoder (AE) based deep learning approaches for network anomaly detection to improve our posture towards network security. The performance of existing state-of-the-art AE models used for network anomaly detection varies without offering a holistic approach to understand the critical impacts of the core set of important performance indicators of AE models and the detection accuracy. In this study, we propose a novel 5-layer autoencoder (AE)-based model better suited for network anomaly detection tasks. Our proposal is based on the results we obtained through an extensive and rigorous investigation of several performance indicators involved in an AE model. In our proposed model, we use a new data pre-processing methodology that transforms and removes the most affected outliers from the input samples to reduce model bias caused by data imbalance across different data types in the feature set. Our proposed model utilizes the most effective reconstruction error function which plays an essential role for the model to decide whether a network traffic sample is normal or anomalous. These sets of innovative approaches and the optimal model architecture allow our model to be better equipped for feature learning and dimension reduction thus producing better detection accuracy as well as f1-score. We evaluated our proposed model on the NSL-KDD dataset which outperformed other similar methods by achieving the highest accuracy and f1-score at 90.61% and 92.26% respectively in detection.

引用

页码：140136 / 140146

页数：11

共 33 条

[21] THE 3-SIGMA-RULE [J].

PUKELSHEIM, F .

AMERICAN STATISTICIAN, 1994, 48 (02) :88-91

[22] "Why Should I Trust You?" Explaining the Predictions of Any Classifier [J].

Ribeiro, Marco Tulio ;

Singh, Sameer ;

Guestrin, Carlos .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1135-1144

[23] Intrusion Detection Based on Autoencoder and Isolation Forest in Fog Computing [J].

Sadaf, Kishwar ;

Sultana, Jabeen .

IEEE ACCESS, 2020, 8 :167059-167068

[24]

Sainath TN, 2012, INT CONF ACOUST SPEE, P4153, DOI 10.1109/ICASSP.2012.6288833

[25] Outside the Closed World: On Using Machine Learning For Network Intrusion Detection [J].

Sommer, Robin ;

Paxson, Vern .

2010 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2010, :305-316

[26]

Tukey JW., 1977, Exploratory Data Analysis vol. 2

[27]

Wei Y., 2020, IEEE ACCESS, V8

[28]

Wei Y., ARXIV191006588

[29] Low-cost Indoor Air Quality (IAQ) Platform for Healthier Classrooms in New Zealand: Engineering Issues [J].

Weyers, Ryan ;

Jang-Jaccard, Julian ;

Moses, Alfred ;

Wang, Yu ;

Boulic, Mikael ;

Chitty, Chris ;

Phipps, Robyn ;

Cunningham, Chris ;

Olivares, Gustavo ;

Arif, Khalid ;

Page, Wyatt ;

Shekar, Aruna ;

Botes, Carli ;

Cresswell, Georgie ;

Chilcott, Kate ;

Withers, Emma ;

Delhumeau, Antoine ;

Hue, Corentin ;

Plagmann, Manfred ;

Trompetter, Bill ;

Bennett, Julie ;

Pierse, Nevil ;

Theobald, Chris ;

Mansell, Alishia ;

Kaua, Doris ;

Ponder-Sutton, Agathe .

2017 4TH ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWCONCSE 2017), 2017, :208-215

[30] Effective Feature Extraction via Stacked Sparse Autoencoder to Improve Intrusion Detection System [J].

Yan, Binghao ;

Han, Guodong .

IEEE ACCESS, 2018, 6 :41238-41248

← 1 2 3 4 →