Empirical Measurement of Client Contribution for Federated Learning With Data Size Diversification

被引：2

作者：

Shyn, Sung Kuk ^{[1
]}

Kim, Donghee ^{[2
]}

Kim, Kwangsu ^{[3
]}

机构：

[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon, Gyonggi do, South Korea

[2] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon, Gyonggi do, South Korea

[3] Sungkyunkwan Univ, Coll Comp & Informat, Suwon, Gyonggi do, South Korea

来源：

IEEE ACCESS | 2022年 / 10卷

关键词：

Client contribution; client selection; data valuation; data heterogeneity; federated learning; incentive mechanism; shapley value;

D O I：

10.1109/ACCESS.2022.3210950

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Client contribution evaluation is crucial in federated learning(FL) to effectively select influential clients. Contrary to data valuation in centralized settings, client contribution evaluation in FL faces a lack of data accessibility and consequently challenges stable quantification of the impact of data heterogeneity. To address this instability of client contribution evaluation, we introduce an empirical method, Federated Client Contribution Evaluation through Accuracy Approximation(FedCCEA), which exploits data size as a tool for client contribution evaluation. After several FL simulations, FedCCEA approximates the test accuracy using the sampled data size and extracts the client contribution from the trained accuracy approximator. In addition, FedCCEA grants data size diversification, which reduces the massive variation in accuracy resulting from game-theoretic strategies. Several experiments have shown that FedCCEA strengthens the robustness to diverse heterogeneous data environments and the practicality of partial participation.

引用

页码：118563 / 118574

页数：12

共 47 条

[1] Barabanov V. F., 2013, WORLD APPL SCI J, V23, P1239, DOI DOI 10.5829/idosi.wasj.2013.23.09.13136
[2] Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
[3] DETECTION OF INFLUENTIAL OBSERVATION IN LINEAR-REGRESSION
COOK, RD
[J]. TECHNOMETRICS, 1977, 19 (01) : 15 - 18
[4] Fallah A, 2020, ADV NEUR IN, V33
[5] An Online Outlier Identification and Removal Scheme for Improving Fault Detection Performance
Ferdowsi, Hasan
Jagannathan, Sarangapani
Zawodniok, Maciej
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) : 908 - 919
[6] Classification in the Presence of Label Noise: a Survey
Frenay, Benoit
Verleysen, Michel
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) : 845 - 869
[7] Ghorbani A, 2019, PR MACH LEARN RES, V97
[8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[9] Himayat N., 2021, PROC INT C LEARN REP, P1
[10] Cho YJ, 2020, Arxiv, DOI arXiv:2010.01243

← 1 2 3 4 5 →