Improving SSH detection model using IPA time and WGAN-GP

被引:8
作者
Lee, Junwon [1 ]
Lee, Heejo [1 ]
机构
[1] Korea Univ, Anam Ro 141, Seoul 02841, South Korea
关键词
GAN; WGAN-GP; SSH detection; Inter -packet arrival time; Session -based data; Random forest; Generator loss; PCA;
D O I
10.1016/j.cose.2022.102672
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the machine learning-based detection model, the detection accuracy tends to be proportional to the quantity and quality of the training dataset. The machine learning-based SSH detection model's performance is affected by the size of the training dataset and the ratio of target classes. However, in an actual network environment within a short period, it is inconvenient to collect a sufficient and diverse training dataset. Even though many training data samples are collected, it takes a lot of effort and time to prepare the training dataset through data classification. To overcome these limitations, we generate sophisticated samples using the WGAN-GP algorithm and present how to select samples by comparing generator loss. The synthetic training dataset with generated samples improves the performance of the SSH detection model. Furthermore, we add the new features to include the distinction of inter-packet arrival time. The enhanced SSH detection model decreases false positives and provides a 0.999 F 1-score by applying the synthetic dataset and the packet inter-arrival time features. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 43 条
[1]  
Al Olaimat M., 2020, 2020 29 INT C COMP C, P1
[2]  
Alshammari R, 2007, IEEE SYS MAN CYBERN, P2563
[3]   Can encrypted traffic be identified without port numbers, IP addresses and payload inspection? [J].
Alshammari, Riyad ;
Zincir-Heywood, A. Nur .
COMPUTER NETWORKS, 2011, 55 (06) :1326-1350
[4]  
[Anonymous], 2017, Nips 2016 tutorial: Generative adversarial networks
[5]  
[Anonymous], 2003, P 2003 ACM S APPL CO
[6]  
[Anonymous], 2018, Idsgan: Generative adversarial networks for attack generation against intrusion detection
[7]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[8]  
Beckett J, 2017, WHATS GENERATIVE ADV
[9]  
Berg A, 2019, ARXIV PREPRINT ARXIV
[10]  
Burande A., 2014, INT J SCI RES PUBL, V4, P2250