Optimizing Federated Learning on Non-IID Data with Reinforcement Learning

被引：0

作者：

Wang, Hao ^{[1
]}

Kaplan, Zakhary ^{[1
]}

Niu, Di ^{[2
]}

Li, Baochun ^{[1
]}

机构：

[1] Univ Toronto, Toronto, ON, Canada

[2] Univ Alberta, Edmonton, AB, Canada

来源：

IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS | 2020年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/infocom41043.2020.9155494

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The widespread deployment of machine learning applications in ubiquitous environments has sparked interests in exploiting the vast amount of data stored on mobile devices. To preserve data privacy, Federated Learning has been proposed to learn a shared model by performing distributed training locally on participating devices and aggregating the local models into a global one. However, due to the limited network connectivity of mobile devices, it is not practical for federated learning to perform model updates and aggregation on all participating devices in parallel. Besides, data samples across all devices are usually not independent and identically distributed (IID), posing additional challenges to the convergence and speed of federated learning. In this paper, we propose FAVOR, an experience-driven control framework that intelligently chooses the client devices to participate in each round of federated learning to counterbalance the bias introduced by non-IID data and to speed up convergence. Through both empirical and mathematical analysis, we observe an implicit connection between the distribution of training data on a device and the model weights trained based on those data, which enables us to profile the data distribution on that device based on its uploaded model weights. We then propose a mechanism based on deep Q-learning that learns to select a subset of devices in each communication round to maximize a reward that encourages the increase of validation accuracy and penalizes the use of more communication rounds. With extensive experiments performed in PyTorch, we show that the number of communication rounds required in federated learning can be reduced by up to 49% on the MNIST dataset, 23% on FashionMNIST, and 42% on CIFAR-10, as compared to the Federated Averaging algorithm.

引用

页码：1698 / 1707

页数：10

共 50 条

[11] A Novel Approach for Federated Learning with Non-IID Data [J].

Nguyen, Hiep ;

Warrier, Harikrishna ;

Gupta, Yogesh .

2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, :62-67

[12] Federated Dictionary Learning from Non-IID Data [J].

Gkillas, Alexandros ;

Ampeliotis, Dimitris ;

Berberidis, Kostas .

2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,

[13] Dual Adversarial Federated Learning on Non-IID Data [J].

Zhang, Tao ;

Yang, Shaojing ;

Song, Anxiao ;

Li, Guangxia ;

Dong, Xuewen .

KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 :233-246

[14] EFL: ELASTIC FEDERATED LEARNING ON NON-IID DATA [J].

Ma, Zichen ;

Lu, Yu ;

Li, Wenye ;

Cui, Shuguang .

CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199

[15] Decoupled Federated Learning for ASR with Non-IID Data [J].

Zhu, Han ;

Wang, Jindong ;

Cheng, Gaofeng ;

Zhang, Pengyuan ;

Yan, Yonghong .

INTERSPEECH 2022, 2022, :2628-2632

[16] FedEL: Federated ensemble learning for non-iid data [J].

Wu, Xing ;

Pei, Jie ;

Han, Xian-Hua ;

Chen, Yen-Wei ;

Yao, Junfeng ;

Liu, Yang ;

Qian, Quan ;

Guo, Yike .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237

[17] Contractible Regularization for Federated Learning on Non-IID Data [J].

Chen, Zifan ;

Wu, Zhe ;

Wu, Xian ;

Zhang, Li ;

Zhao, Jie ;

Yan, Yangtian ;

Zheng, Yefeng .

2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, :61-70

[18] Dynamic Clustering Federated Learning for Non-IID Data [J].

Chen, Ming ;

Wu, Jinze ;

Yin, Yu ;

Huang, Zhenya ;

Liu, Qi ;

Chen, Enhong .

ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 :119-131

[19] Federated Learning With Non-IID Data in Wireless Networks [J].

Zhao, Zhongyuan ;

Feng, Chenyuan ;

Hong, Wei ;

Jiang, Jiamo ;

Jia, Chao ;

Quek, Tony Q. S. ;

Peng, Mugen .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) :1927-1942

[20] An Optimization Method for Non-IID Federated Learning Based on Deep Reinforcement Learning [J].

Meng, Xutao ;

Li, Yong ;

Lu, Jianchao ;

Ren, Xianglin .

SENSORS, 2023, 23 (22)

← 1 2 3 4 5 →