Fed-UGI: Federated Undersampling Learning Framework With Gini Impurity for Imbalanced Network Intrusion Detection

被引：1

作者：

Zheng, Ming ^{[1
,2
]}

Hu, Xiaowen ^{[1
]}

Hu, Ying ^{[1
,2
]}

Zheng, Xiaoyao ^{[1
,2
]}

Luo, Yonglong ^{[1
,2
]}

机构：

[1] Anhui Normal Univ, Sch Comp & Informat, Wuhu 241002, Peoples R China

[2] Anhui Normal Univ, Anhui Prov Key Lab Ind Intelligence Data Secur, Wuhu 241002, Peoples R China

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2025年 / 20卷

基金：

中国国家自然科学基金;

关键词：

Federated learning; Impurities; imbalanced data; network intrusion detection; undersampling; ATTACKS;

D O I：

10.1109/TIFS.2024.3516547

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the modern interconnected world, the popularization of networks and the rapid development of information technology led to the increasing security risks and threats in network systems. The existing intrusion detection system is constantly challenged by various malicious intrusion attacks. Machine learning algorithms have been widely used in intrusion detection. However, the model training requires the support of a sufficient high-quality samples, especially attack traffic data. Network intrusion detection datasets may not be shared between organizations due to data security and some privacy policy concerns. The federated learning framework is an optimal approach to address this issue, in which organizations collaborate to train a global model shared by multiple parties while keeping the data local to the client, guaranteeing the data privacy and security of all parties. However, there is a problem of class imbalance in the network traffic data owned by the organizations, which seriously affects the detection performance of the model and leads to a high consumption of model training time. Therefore, this study proposed a novel federated undersampling learning framework with Gini impurity, namely Fed-UGI. The framework is based on the hash-based block undersampling method to rebalance the client, which can solve the influence of imbalanced training data on the model detection performance and improve the model training efficiency. Moreover, the client weighted aggregation strategy based on Local Gini impurity can further optimize the effect of global model aggregation and reduce the impact of the dispersion degree and information difference in client data on model aggregation. In addition, extensive experiments on intrusion detection datasets show that compared to SOTA methods, the proposed Fed-UGI method has a good detection effect on the three metrics of F1-score, G-mean and AUC, the training time of the model is reduced by 51.76%-92.58%, especially in highly class imbalance situation.

引用

页码：1262 / 1277

页数：16

共 51 条

[1] Evaluating Federated Learning for intrusion detection in Internet of Things: Review and challenges [J].

Campos, Enrique Marmol ;

Saura, Pablo Fernandez ;

Gonzalez-Vidal, Aurora ;

Hernandez-Ramos, Jose L. ;

Bernabe, Jorge Bernal ;

Baldini, Gianmarco ;

Skarmeta, Antonio .

COMPUTER NETWORKS, 2022, 203

[2] FedDef: Defense Against Gradient Leakage in Federated Learning-Based Network Intrusion Detection Systems [J].

Chen, Jiahui ;

Zhao, Yi ;

Li, Qi ;

Feng, Xuewei ;

Xu, Ke .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 :4561-4576

[3] TMG-GAN: Generative Adversarial Networks-Based Imbalanced Learning for Network Intrusion Detection [J].

Ding, Hongwei ;

Sun, Yu ;

Huang, Nana ;

Shen, Zhidong ;

Cui, Xiaohui .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1156-1167

[4] Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems [J].

Duan, Moming ;

Liu, Duo ;

Chen, Xianzhang ;

Liu, Renping ;

Tan, Yujuan ;

Liang, Liang .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) :59-71

[5] Secure Aggregation is Insecure: Category Inference Attack on Federated Learning [J].

Gao, Jiqiang ;

Hou, Boyu ;

Guo, Xiaojie ;

Liu, Zheli ;

Zhang, Ying ;

Chen, Kai ;

Li, Jin .

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) :147-160

[6] Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].

Gong, Yunchao ;

Lazebnik, Svetlana ;

Gordo, Albert ;

Perronnin, Florent .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929

[7] Datasets are not enough: Challenges in labeling network traffic [J].

Guerra, Jorge Luis ;

Catania, Carlos ;

Veas, Eduardo .

COMPUTERS & SECURITY, 2022, 120

[8]

Guo Songyue, 2023, Database Systems for Advanced Applications: 28th International Conference, DASFAA 2023, Proceedings. Lecture Notes in Computer Science (13943), P703, DOI 10.1007/978-3-031-30637-2_47

[9] Robust and Secure Federated Learning Against Hybrid Attacks: A Generic Architecture [J].

Hao, Xiaohan ;

Lin, Chao ;

Dong, Wenhan ;

Huang, Xinyi ;

Xiong, Hui .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :1576-1588

[10] Learning from Imbalanced Data [J].

He, Haibo ;

Garcia, Edwardo A. .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284

← 1 2 3 4 5 6 →