FedTKD: A Trustworthy Heterogeneous Federated Learning Based on Adaptive Knowledge Distillation

被引:6
作者
Chen, Leiming [1 ]
Zhang, Weishan [1 ]
Dong, Cihao [1 ]
Zhao, Dehai [2 ]
Zeng, Xingjie [3 ]
Qiao, Sibo [4 ]
Zhu, Yichang [1 ]
Tan, Chee Wei [5 ]
机构
[1] China Univ Petr East China, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] CSIRO Data61, Sydney 2015, Australia
[3] Southwest Petr Univ, Sch Comp Sci, Chengdu 610500, Peoples R China
[4] Tiangong Univ, Sch Software, Tianjin 300387, Peoples R China
[5] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
关键词
heterogeneous federated learning; adaptive knowledge distillation; malicious client identification; trustworthy knowledge aggregation;
D O I
10.3390/e26010096
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Federated learning allows multiple parties to train models while jointly protecting user privacy. However, traditional federated learning requires each client to have the same model structure to fuse the global model. In real-world scenarios, each client may need to develop personalized models based on its environment, making it difficult to perform federated learning in a heterogeneous model environment. Some knowledge distillation methods address the problem of heterogeneous model fusion to some extent. However, these methods assume that each client is trustworthy. Some clients may produce malicious or low-quality knowledge, making it difficult to aggregate trustworthy knowledge in a heterogeneous environment. To address these challenges, we propose a trustworthy heterogeneous federated learning framework (FedTKD) to achieve client identification and trustworthy knowledge fusion. Firstly, we propose a malicious client identification method based on client logit features, which can exclude malicious information in fusing global logit. Then, we propose a selectivity knowledge fusion method to achieve high-quality global logit computation. Additionally, we propose an adaptive knowledge distillation method to improve the accuracy of knowledge transfer from the server side to the client side. Finally, we design different attack and data distribution scenarios to validate our method. The experiment shows that our method outperforms the baseline methods, showing stable performance in all attack scenarios and achieving an accuracy improvement of 2% to 3% in different data distributions.
引用
收藏
页数:31
相关论文
共 33 条
[1]  
Blanchard P, 2017, ADV NEUR IN, V30
[2]   FedHe: Heterogeneous Models and Communication-Efficient Federated Learning [J].
Chan, Yun Hin ;
Ngai, Edith C. H. .
2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, :207-214
[3]  
Chen HC, 2023, Arxiv, DOI arXiv:2301.08968
[4]  
Chen Leiming, 2025, IEEE Transactions on Artificial Intelligence, V6, P301, DOI 10.1109/TAI.2024.3355362
[5]  
Chen LM, 2024, Arxiv, DOI [arXiv:2307.13716, DOI 10.31577/CAI202411]
[6]  
Cheng Sijie, 2021, arXiv
[7]   Efficient Knowledge Distillation from an Ensemble of Teachers [J].
Fukuda, Takashi ;
Suzuki, Masayuki ;
Kurata, Gakuto ;
Thomas, Samuel ;
Cui, Jia ;
Ramabhadran, Bhuvana .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3697-3701
[8]  
Gao LZ, 2024, Arxiv, DOI arXiv:2201.03169
[9]  
He Y., 2022, P 36 AAAI C ARTIFICI
[10]  
Hinton G, 2015, Arxiv, DOI [arXiv:1503.02531, DOI 10.48550/ARXIV.1503.02531]