FEAT: A Federated Approach for Privacy-Preserving Network Traffic Classification in Heterogeneous Environments

被引:13
作者
Guo, Yingya [1 ,2 ,3 ]
Wang, Dan [4 ]
机构
[1] Coll Comp & Data Sci, Fujian Prov Key Lab Network Comp & Intelligent Inf, Fuzhou 350025, Peoples R China
[2] Fuzhou Univ, Fuzhou 350025, Peoples R China
[3] Fuzhou Univ, Key Lab Spatial Data Min Informat Sharing, Minist Educ, Fuzhou 350025, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated analytics (FA); federated learning (FL); heterogeneous environments; network traffic classification;
D O I
10.1109/JIOT.2022.3204975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network traffic classification is the foundation for many network security and network management applications. Recently, to preserve the privacy of the data which are generated in the mobile ends, federated learning (FL)-based classification methods are being proposed. Unfortunately, the performance of FL-based methods can seriously degrade when the client data have skewness. This is particularly true for mobile network traffic classification where the environments in the mobile ends are highly heterogeneous. In this article, we first conduct a measurement study on traffic classification accuracy through FL using real-world network traffic trace and we observe serious accuracy degradation due to heterogeneous environments. We propose a novel federated analytics (FA) approach, FEAT, to improve the accuracy. Note that FL emphasizes on model training, yet our FA performs local analytic tasks that can estimate traffic data skewness and select appropriate clients for FL model training. Our analytics tasks are performed locally and in a federated manner; thus, we preserve privacy as well. Our approach has strong theoretical properties where we exploit Hoeffding inequality to infer traffic data skewness and we leverage the Thompson Sampling for client selection. We evaluate our approach through extensive experiments using real-world traffic data sets QUIC and ISCX. The extensive experiments demonstrate that FEAT can improve traffic classification accuracy in heterogeneous environments.
引用
收藏
页码:1274 / 1285
页数:12
相关论文
共 42 条
[1]  
Bakopoulou E, 2019, Arxiv, DOI arXiv:1907.13113
[2]  
Bar-Yanai R, 2010, LECT NOTES COMPUT SC, V6049, P373, DOI 10.1007/978-3-642-13193-6_32
[3]  
Cho YJ, 2022, PR MACH LEARN RES, V151
[4]  
Draper-Gil Gerard, 2016, ICISSP 2016. 2nd International Conference on Information Systems Security and Privacy. Proceedings, P407
[5]   A Survey of Payload-Based Traffic Classification Approaches [J].
Finsterbusch, Michael ;
Richter, Chris ;
Rocha, Eduardo ;
Mueller, Jean-Alexander ;
Haenssgen, Klaus .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (02) :1135-1156
[6]  
Guo Y., 2021, WIRELESS COMMUN MOBI
[7]  
Guorui Xie, 2020, NetAI '20: Proceedings of the Workshop on Network Meets AI & ML, P14, DOI 10.1145/3405671.3405811
[8]  
Han Z, 2017, SIGNAL PROCESSING AND NETWORKING FOR BIG DATA APPLICATIONS, P1, DOI 10.1017/9781316408032
[9]  
Hsu TMH, 2019, Arxiv, DOI [arXiv:1909.06335, DOI 10.48550/ARXIV.1909.06335]
[10]  
Hoeffding W., 1994, PROBABILITY INEQUALI, P409, DOI DOI 10.1007/978-1-4612-0865-5_26