Self-Supervised Traffic Classification: Flow Embedding and Few-Shot Solutions

被引:3
作者
Horowicz, Eyal [1 ]
Shapira, Tal [2 ]
Shavitt, Yuval [1 ]
机构
[1] Tel Aviv Univ, Elect Engn Dept, IL-69978 Tel Aviv, Israel
[2] Hebrew Univ Jerusalem, Sch Comp Sci, IL-9190500 Jerusalem, Israel
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 03期
关键词
Internet traffic classification; application identification; traffic; security management; few-shot learning; contrastive representation learning; self-supervised learning; INTERNET; NETWORK;
D O I
10.1109/TNSM.2024.3366848
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet traffic classification has been intensively studied over the past decade due to its importance for traffic engineering and cyber security. A promising approach to several traffic classification problems is the FlowPic approach, where histograms of packet sizes in consecutive time slices are transformed into a picture that is fed into a Convolution Neural Network (CNN) model for classification. However, CNNs (and the FlowPic approach included) require a relatively large labeled flow dataset, which is not always easy to obtain. In this paper, we show that we can overcome this obstacle by using Contrastive Representation Learning in order to learn from an unlabeled flow dataset a flow representation that can be embedded in a latent space, enabling clustering of flows belonging to the same class together. We then show that by using just a few labeled flows (a few shots) from each class, we can achieve high accuracy in flow classification. We show that common picture augmentation techniques can help, but accuracy improves further when introducing augmentation techniques that mimic network behavior, such as changes in the RTT (Round-trip time). Finally, we show that we can replace the large FlowPics suggested in the past with much smaller mini-FlowPics and achieve two advantages: improved model performance and easier engineering. Interestingly, this even improves accuracy in some cases.
引用
收藏
页码:3054 / 3067
页数:14
相关论文
共 53 条
[1]   DISTILLER: Encrypted traffic classification via multimodal multitask deep learning [J].
Aceto, Giuseppe ;
Ciuonzo, Domenico ;
Montieri, Antonio ;
Pescape, Antonio .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 183
[2]   Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and Challenges [J].
Aceto, Giuseppe ;
Ciuonzo, Domenico ;
Montieri, Antonio ;
Pescape, Antonio .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (02) :445-458
[3]   Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples [J].
Assran, Mahmoud ;
Caron, Mathilde ;
Misra, Ishan ;
Bojanowski, Piotr ;
Joulin, Armand ;
Ballas, Nicolas ;
Rabbat, Michael .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8423-8432
[4]  
Bovenzi G., 2023, P IEEE C COMP COMM W, P1
[5]  
Chen T, 2020, PMLR, V119, P1597
[6]  
Chen ZT, 2017, IEEE INT CONF BIG DA, P1271, DOI 10.1109/BigData.2017.8258054
[7]   Traffic classification through simple statistical fingerprinting. [J].
Crotti, Manuel ;
Dusi, Maurizio ;
Gringoli, Francesco ;
Salgarelli, Luca .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2007, 37 (01) :5-16
[8]   SHAPE: A Simultaneous Header and Payload Encoding Model for Encrypted Traffic Classification [J].
Dai, Jianbang ;
Xu, Xiaolong ;
Gao, Honghao ;
Wang, Xinheng ;
Xiao, Fu .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02) :1993-2012
[9]   Issues and Future Directions in Traffic Classification [J].
Dainotti, Alberto ;
Pescape, Antonio ;
Claffy, Kimberly C. .
IEEE NETWORK, 2012, 26 (01) :35-40
[10]  
Draper-Gil Gerard, 2016, ICISSP 2016. 2nd International Conference on Information Systems Security and Privacy. Proceedings, P407