Mutual Information Driven Federated Learning

被引：29

作者：

Uddin, Md Palash ^{[1
]}

Xiang, Yong ^{[1
]}

Lu, Xuequan ^{[1
]}

Yearwood, John ^{[2
]}

Gao, Longxiang ^{[1
]}

机构：

[1] Deakin Univ, Deakin Blockchain Innovat Lab, Sch Informat Technol, Geelong, Vic 3220, Australia

[2] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2021年 / 32卷 / 07期

关键词：

Data models; Training; Computational modeling; Servers; Mathematical model; Convergence; Analytical models; Distributed learning; federated learning; parallel optimization; data parallelism; information theory; mutual information; communication bottleneck; data heterogeneity; FEATURE-SELECTION;

D O I：

10.1109/TPDS.2020.3040981

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Federated Learning (FL) is an emerging research field that yields a global trained model from different local clients without violating data privacy. Existing FL techniques often ignore the effective distinction between local models and the aggregated global model when doing the client-side weight update, as well as the distinction of local models for the server-side aggregation. In this article, we propose a novel FL approach with resorting to mutual information (MI). Specifically, in client-side, the weight update is reformulated through minimizing the MI between local and aggregated models and employing Negative Correlation Learning (NCL) strategy. In server-side, we select top effective models for aggregation based on the MI between an individual local model and its previous aggregated model. We also theoretically prove the convergence of our algorithm. Experiments conducted on MNIST, CIFAR-10, ImageNet, and the clinical MIMIC-III datasets manifest that our method outperforms the state-of-the-art techniques in terms of both communication and testing performance.

引用

页码：1526 / 1538

页数：13

共 35 条

[1] Agarwal Naman, 2018, PROC ANN C NEURAL IN, P7564
[2] USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING
BATTITI, R
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04): : 537 - 550
[3] Practical Secure Aggregation for Privacy-Preserving Machine Learning
Bonawitz, Keith
Ivanov, Vladimir
Kreuter, Ben
Marcedone, Antonio
McMahan, H. Brendan
Patel, Sarvar
Ramage, Daniel
Segal, Aaron
Seth, Karn
[J]. CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 1175 - 1191
[4] Mutual information based input feature selection for classification problems
Cang, Shuang
Yu, Hongnian
[J]. DECISION SUPPORT SYSTEMS, 2012, 54 (01) : 691 - 698
[5] Dean J., 2012, NIPS, V2012, P1223, DOI [10.5555/2999134.2999271, DOI 10.5555/2999134.2999271]
[6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7] Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems
Duan, Moming
Liu, Duo
Chen, Xianzhang
Liu, Renping
Tan, Yujuan
Liang, Liang
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (01) : 59 - 71
[8] Normalized Mutual Information Feature Selection
Estevez, Pablo. A.
Tesmer, Michel
Perez, Claudio A.
Zurada, Jacek A.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02): : 189 - 201
[9] Combination of loss functions for deep text classification
Hajiabadi, Hamideh
Molla-Aliod, Diego
Monsefi, Reza
Yazdi, Hadi Sadoghi
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (04) : 751 - 761
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 →