Evaluation of recurrent neural network and its variants for intrusion detection system (IDS)

被引:50
作者
Vinayakumar R. [1 ]
Soman K.P. [1 ]
Poornachandran P. [2 ]
机构
[1] Center for Computational Engineering and Networking (CEN), Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Coimbatore
[2] Center for Cyber Security Systems and Networks, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Amritapuri
关键词
Deep learning (DL) approaches; Gated recurrent unit (GRU); Intrusion detection (ID) data sets; KDDCup’99; Long short-term memory (LSTM); Machine learning (ML); Recurrent neural network (RNN); UNSW-NB15;
D O I
10.4018/IJISMD.2017070103
中图分类号
学科分类号
摘要
This article describes how sequential data modeling is a relevant task in Cybersecurity. Sequences are attributed temporal characteristics either explicitly or implicitly. Recurrent neural networks (RNNs) are a subset of artificial neural networks (ANNs) which have appeared as a powerful, principle approach to learn dynamic temporal behaviors in an arbitrary length of large-scale sequence data. Furthermore, stacked recurrent neural networks (S-RNNs) have the potential to learn complex temporal behaviors quickly, including sparse representations. To leverage this, the authors model network traffic as a time series, particularly transmission control protocol / internet protocol (TCP/IP) packets in a predefined time range with a supervised learning method, using millions of known good and bad network connections. To find out the best architecture, the authors complete a comprehensive review of various RNN architectures with its network parameters and network structures. Ideally, as a test bed, they use the existing benchmark Defense Advanced Research Projects Agency / Knowledge Discovery and Data Mining (DARPA) / (KDD) Cup ‘99’ intrusion detection (ID) contest data set to show the efficacy of these various RNN architectures. All the experiments of deep learning architectures are run up to 1000 epochs with a learning rate in the range [0.01-0.5] on a GPU-enabled TensorFlow and experiments of traditional machine learning algorithms are done using Scikit-learn. Experiments of families of RNN architecture achieved a low false positive rate in comparison to the traditional machine learning classifiers. The primary reason is that RNN architectures are able to store information for long-term dependencies over time-lags and to adjust with successive connection sequence information. In addition, the effectiveness of RNN architectures are shown for the UNSW-NB15 data set. Copyright © 2017, IGI Global.
引用
收藏
页码:43 / 63
页数:20
相关论文
共 43 条
[31]  
Paxson V., Bro: A system for detecting network intruders in real-time, Computer Networks, 31, 23, pp. 2435-2463, (1999)
[32]  
Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Vanderplas J., Et al., Scikit-learn: Machine learning in python, Journal of Machine Learning Research, 12, pp. 2825-2830, (2011)
[33]  
Sak H., Senior A., Beaufays F., Long short-term memory recurrent neural network architectures for large scale acoustic modeling, Fifteenth Annual Conference of The International Speech Communication Association, (2014)
[34]  
Simonyan K., Vedaldi A., Zisserman A., Deep Inside Convolutional Networks: Visualizing Image Classification Models and Saliency Maps, (2013)
[35]  
Staudemeyer R.C., Applying long short-term memory recurrent neural networks to intrusion detection, South African Computer Journal, 56, 1, pp. 136-154, (2015)
[36]  
Staudemeyer R.C., Omlin C.W., Evaluating performance of long short-term memory recurrent neural networks on intrusion detection data, Proceedings of The South African Institute for Computer Scientists and Information Technologists Conference, pp. 218-224, (2013)
[37]  
Staudemeyer R.C., Omlin C.W., Extracting salient features for network intrusion detection using machine learning methods, South African Computer Journal, 52, 1, pp. 82-96, (2014)
[38]  
Tavallaee M., Bagheri E., Lu W., Ghorbani A.A., A detailed analysis of the KDD CUP 99 data set, IEEE Symposium on Computational Intelligence for Security and Defense Applications CISDA’09, (2009)
[39]  
Vaidya T., 2001-2013: Survey and Analysis of Major Cyberattacks, (2015)
[40]  
Wang Z., The Applications of Deep Learning on Traffic Identification, (2015)