FLOWGAN:Unbalanced network encrypted traffic identification method based on GAN

被引：25

作者：

Wang, ZiXuan ^{[1
]}

Wang, Pan ^{[1
]}

Zhou, Xiaokang ^{[2
,3
]}

Li, ShuHang ^{[1
]}

Zhang, MoXuan ^{[4
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Modern Posts Univ, Nanjing, Peoples R China

[2] Shiga Univ, Fac Data Sci, Hikone, Japan

[3] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan

[4] Jinling Inst Technol, Sch Int Educ, Nanjing, Peoples R China

来源：

2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019) | 2019年

基金：

美国国家科学基金会;

关键词：

traffic classification; encrypted traffic; deep learning; Generative Adversarial Network; class imbalance;

D O I：

10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00141

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

It is crucial to accurately identify the type of traffic and application so that it can enable various policy-driven network management and security monitoring. However, with the increasing adoption of Internet applications use encryption protocols to transmit data, traffic classification is becoming more difficult. Although existing machine learning methods and novel deep learning methods have many advantages, which can solve the drawbacks of port and payload based methods, but there are still some shortcomings, one of which is the imbalanced property of network traffic data. In this paper, we proposed a GAN based method called FIowGAN to tackle with the problem of class imbalance for traffic classification. As an instance of Generative Adversarial Network (GAN), FIowGAN leverages the superiority of GAN's data augmentation to produce synthetic traffic data for classes with few samples. Furthermore, we trained a classical deep learning model, Multilayer perceptron (MLP) based network traffic classifier to evaluate the performance of FIowGAN. Based on the public dataset 'ISCX', our experimental results show that our proposed FIowGAN can outperform an unbalanced dataset and balancing dataset by the oversampling method in terms of data augmentation. Based on the public dataset ISCX, our experimental results show that the recognition performance of FIowGAN on small samples, compared with the unbalanced dataset, Precision, Recall, and Fl-score increased by 13.2%, 17.0%, and 15.6% on average, compared with the balanced dataset Precision, Recall, Fl-score increased by 2.15%, 2.06%, 2.12% on average.

引用

页码：975 / 983

页数：9

共 22 条

[1] Abadi M, 2015, 12 USENIX S OPERATIN
[2] Aceto G, 2018, 2018 NETWORK TRAFFIC MEASUREMENT AND ANALYSIS CONFERENCE (TMA)
[3] Chen Wei, 2012, Computer Engineering, V38, P22, DOI 10.3969/j.issn.1000-3428.2012.12.006
[4] A Hierarchical Approach to Encrypted Data Packet Classification in Smart Home Gateways
Chen, Xuejiao
Yu, Jiahui
Ye, Feng
Wang, Pan
[J]. 2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 41 - 45
[5] Chollet F., 2015, keras
[6] Guo H., 2004, SIGKDD Explor. Newsl, V6, P30, DOI [DOI 10.1145/1007730.1007736, 10.1145/1007730.1007736]
[7] Habibi Lashkari A., 2016, Characterization of encrypted and vpn traffic using time-related features, 02
[8] HU W, 2017, ARXIV170205983
[9] Japkowicz N., 2002, Intelligent Data Analysis, V6, P429
[10] A multilevel taxonomy and requirements for an optimal traffic-classification model
Khalife, Jawad
Hajjar, Amjad
Diaz-Verdejo, Jesus
[J]. INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2014, 24 (02) : 101 - 120

← 1 2 3 →