Personalized Federated Learning via Deviation Tracking Representation Learning

被引:0
作者
Jang, Jaewon [1 ]
Choi, Bong Jun [1 ]
机构
[1] Soongsil Univ, Comp Sci & Engn, Seoul, South Korea
来源
38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024 | 2024年
关键词
federated learning(FL); data heterogeneity; personalized federated learning(PFL); representation learning; metalearning;
D O I
10.1109/ICOIN59985.2024.10572208
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning preserves privacy by decentralized training of individual client devices, ensuring only model weights are shared centrally. However, the data heterogeneity across clients presents challenges. This paper focuses on representation learning, a variant of personalized federated learning. According to various studies, the representation learning model can be divided into two: the base layer, shared and updated to the server, and the head layer, localized to individual clients. The novel approach exclusively utilizes the base layer for both local and global training, arguing that the head layer might introduce noise due to data heterogeneity. This can potentially affect accuracy, and the head layer is used only for fine-tuning after training to capture unique client data characteristics. Here, we observed that prolonged base training can diminish accuracy in the post-fine-tuning. As a countermeasure, we proposed a method to determine the best round for fine-tuning based on monitoring the standard deviation of test accuracy across clients. This strategy aims to generalize the global model for all the clients before fine-tuning. The study highlights the downside of excessive base training on fine-tuning accuracy and introduces a novel approach to pinpoint optimal fine-tuning moments, thereby minimizing computational and communication overheads. Similarly, we achieved a better accuracy of 53.6% than other approaches while there's a trade-off of minute communication round.
引用
收藏
页码:762 / 766
页数:5
相关论文
共 12 条
[1]  
Chen HY, 2022, Arxiv, DOI arXiv:2107.00778
[2]  
Collins Liam, 2021, PMLR
[3]  
Fallah A, 2020, ADV NEUR IN, V33
[4]  
Finn C, 2017, PR MACH LEARN RES, V70
[5]  
Arivazhagan MG, 2019, Arxiv, DOI arXiv:1912.00818
[6]  
Jacobs S, 1963, Magnetism, VIII, P271
[7]  
Karimireddy SP, 2020, PR MACH LEARN RES, V119
[8]  
Li T., 2020, PROC MACH LEARN SYST, V2, P429, DOI DOI 10.48550/ARXIV.1812.06127
[9]  
McMahan HB, 2017, PR MACH LEARN RES, V54, P1273
[10]  
Nichol A, 2018, Arxiv, DOI [arXiv:1803.02999, DOI 10.48550/ARXIV.1803.02999]