FLOWGAN:Unbalanced network encrypted traffic identification method based on GAN

被引:25
作者
Wang, ZiXuan [1 ]
Wang, Pan [1 ]
Zhou, Xiaokang [2 ,3 ]
Li, ShuHang [1 ]
Zhang, MoXuan [4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Modern Posts Univ, Nanjing, Peoples R China
[2] Shiga Univ, Fac Data Sci, Hikone, Japan
[3] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
[4] Jinling Inst Technol, Sch Int Educ, Nanjing, Peoples R China
来源
2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019) | 2019年
基金
美国国家科学基金会;
关键词
traffic classification; encrypted traffic; deep learning; Generative Adversarial Network; class imbalance;
D O I
10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00141
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is crucial to accurately identify the type of traffic and application so that it can enable various policy-driven network management and security monitoring. However, with the increasing adoption of Internet applications use encryption protocols to transmit data, traffic classification is becoming more difficult. Although existing machine learning methods and novel deep learning methods have many advantages, which can solve the drawbacks of port and payload based methods, but there are still some shortcomings, one of which is the imbalanced property of network traffic data. In this paper, we proposed a GAN based method called FIowGAN to tackle with the problem of class imbalance for traffic classification. As an instance of Generative Adversarial Network (GAN), FIowGAN leverages the superiority of GAN's data augmentation to produce synthetic traffic data for classes with few samples. Furthermore, we trained a classical deep learning model, Multilayer perceptron (MLP) based network traffic classifier to evaluate the performance of FIowGAN. Based on the public dataset 'ISCX', our experimental results show that our proposed FIowGAN can outperform an unbalanced dataset and balancing dataset by the oversampling method in terms of data augmentation. Based on the public dataset ISCX, our experimental results show that the recognition performance of FIowGAN on small samples, compared with the unbalanced dataset, Precision, Recall, and Fl-score increased by 13.2%, 17.0%, and 15.6% on average, compared with the balanced dataset Precision, Recall, Fl-score increased by 2.15%, 2.06%, 2.12% on average.
引用
收藏
页码:975 / 983
页数:9
相关论文
共 22 条
  • [1] Abadi M, 2015, 12 USENIX S OPERATIN
  • [2] Aceto G, 2018, 2018 NETWORK TRAFFIC MEASUREMENT AND ANALYSIS CONFERENCE (TMA)
  • [3] Chen Wei, 2012, Computer Engineering, V38, P22, DOI 10.3969/j.issn.1000-3428.2012.12.006
  • [4] A Hierarchical Approach to Encrypted Data Packet Classification in Smart Home Gateways
    Chen, Xuejiao
    Yu, Jiahui
    Ye, Feng
    Wang, Pan
    [J]. 2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 41 - 45
  • [5] Chollet F., 2015, keras
  • [6] Guo H., 2004, SIGKDD Explor. Newsl, V6, P30, DOI [DOI 10.1145/1007730.1007736, 10.1145/1007730.1007736]
  • [7] Habibi Lashkari A., 2016, Characterization of encrypted and vpn traffic using time-related features, 02
  • [8] HU W, 2017, ARXIV170205983
  • [9] Japkowicz N., 2002, Intelligent Data Analysis, V6, P429
  • [10] A multilevel taxonomy and requirements for an optimal traffic-classification model
    Khalife, Jawad
    Hajjar, Amjad
    Diaz-Verdejo, Jesus
    [J]. INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2014, 24 (02) : 101 - 120