Accurate compressed traffic detection via traffic analysis using Graph Convolutional Network based on graph structure feature

被引:1
作者
Fu, Nan [1 ,2 ]
Cheng, Guang [1 ,2 ,3 ]
Su, Xinyue [1 ,2 ]
机构
[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
[2] Jiangsu Prov Engn Res Ctr Secur Ubiquitous Network, Nanjing 211189, Peoples R China
[3] Purple Mt Labs, Nanjing 211189, Peoples R China
基金
中国国家自然科学基金;
关键词
Compressed traffic detection; Traffic analysis; Graph structure feature; Deep learning; Graph convolutional network;
D O I
10.1016/j.comcom.2023.04.031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the application of data compression technology expands in areas such as IoT, webpage, and video data transmission, there are problems such as leakage of compressed but unencrypted user data, difficulty in supervising compressed data, and confusion between compressed and private encrypted traffic. Existing compressed traffic detection methods are significantly affected by data length and rely on binary classification in one step using random tests. It remains a challenging task to conduct accurate and efficient compressed traffic detection. In this paper, we present GCN-RTG, a compressed traffic detection method using Graph Convolutional Network. We investigate the randomness feature transformation pattern of packet sequences, propose a graph structure based on this pattern, and design a powerful GCN-based classifier to detect compressed traffic. The experimental results show that GCN-RTG achieves 94% accuracy in compressed traffic detection, remarkably improving by nearly 10% accuracy compared with traditional machine learning methods and approximately 5% compared with CNN and LSTM. Considering the effect of private encrypted traffic, GCN-RTG attains an accuracy of 89% for detecting compressed traffic. Furthermore, GCN-RTG can maintain an 83% accuracy even in the most extreme packet loss scenario and reach an outstanding accuracy of up to 95% of compressed traffic detection in real-world network data sized 4GB from the Jiangsu education backbone network in China.
引用
收藏
页码:128 / 139
页数:12
相关论文
共 26 条
[1]  
Apthorpe N, 2017, Arxiv, DOI arXiv:1705.06805
[2]   HEDGE: Efficient Traffic Classification of Encrypted and Compressed Packets [J].
Casino, Fran ;
Choo, Kim-Kwang Raymond ;
Patsakis, Constantinos .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (11) :2916-2926
[3]  
Cheng G., 2018, Proceedings of the 13th International Conference on Future Internet Technologies, P1, DOI [DOI 10.1080/02726351.2018.1480547, 10.1080/02726351.2018.1480547]
[4]  
Cisco I., 2012, 518 CISCO
[5]   Reliable detection of compressed and encrypted data [J].
De Gaspari, Fabio ;
Hitaj, Dorjan ;
Pagnotta, Giulio ;
De Carli, Lorenzo ;
Mancini, Luigi, V .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22) :20379-20393
[6]   ENCOD: Distinguishing Compressed and Encrypted File Fragments [J].
De Gaspari, Fabio ;
Hitaj, Dorjan ;
Pagnotta, Giulio ;
De Carli, Lorenzo ;
Mancini, Luigi, V .
NETWORK AND SYSTEM SECURITY, NSS 2020, 2020, 12570 :42-62
[7]  
Hahn D., 2018, Detecting compressed cleartext traffic from consumer internet of things devices
[8]   RETRACTED: CLD-Net: A Network Combining CNN and LSTM for Internet Encrypted Traffic Classification (Retracted Article) [J].
Hu, Xinyi ;
Gu, Chunxiang ;
Wei, Fushan .
SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
[9]   A survey on data compression techniques: From the perspective of data quality, coding schemes, data type and applications [J].
Jayasankar, Uthayakumar ;
Thirumal, Vengattaraman ;
Ponnurangam, Dhavachelvan .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (02) :119-140
[10]   An Information-Theoretical Approach to High-Speed Flow Nature Identification [J].
Khakpour, Amir R. ;
Liu, Alex X. .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2013, 21 (04) :1076-1089