Empirical Measurement of Client Contribution for Federated Learning With Data Size Diversification

被引:2
作者
Shyn, Sung Kuk [1 ]
Kim, Donghee [2 ]
Kim, Kwangsu [3 ]
机构
[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon, Gyonggi do, South Korea
[2] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon, Gyonggi do, South Korea
[3] Sungkyunkwan Univ, Coll Comp & Informat, Suwon, Gyonggi do, South Korea
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Client contribution; client selection; data valuation; data heterogeneity; federated learning; incentive mechanism; shapley value;
D O I
10.1109/ACCESS.2022.3210950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Client contribution evaluation is crucial in federated learning(FL) to effectively select influential clients. Contrary to data valuation in centralized settings, client contribution evaluation in FL faces a lack of data accessibility and consequently challenges stable quantification of the impact of data heterogeneity. To address this instability of client contribution evaluation, we introduce an empirical method, Federated Client Contribution Evaluation through Accuracy Approximation(FedCCEA), which exploits data size as a tool for client contribution evaluation. After several FL simulations, FedCCEA approximates the test accuracy using the sampled data size and extracts the client contribution from the trained accuracy approximator. In addition, FedCCEA grants data size diversification, which reduces the massive variation in accuracy resulting from game-theoretic strategies. Several experiments have shown that FedCCEA strengthens the robustness to diverse heterogeneous data environments and the practicality of partial participation.
引用
收藏
页码:118563 / 118574
页数:12
相关论文
共 47 条
  • [1] Barabanov V. F., 2013, WORLD APPL SCI J, V23, P1239, DOI DOI 10.5829/idosi.wasj.2013.23.09.13136
  • [2] Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
  • [3] DETECTION OF INFLUENTIAL OBSERVATION IN LINEAR-REGRESSION
    COOK, RD
    [J]. TECHNOMETRICS, 1977, 19 (01) : 15 - 18
  • [4] Fallah A, 2020, ADV NEUR IN, V33
  • [5] An Online Outlier Identification and Removal Scheme for Improving Fault Detection Performance
    Ferdowsi, Hasan
    Jagannathan, Sarangapani
    Zawodniok, Maciej
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) : 908 - 919
  • [6] Classification in the Presence of Label Noise: a Survey
    Frenay, Benoit
    Verleysen, Michel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) : 845 - 869
  • [7] Ghorbani A, 2019, PR MACH LEARN RES, V97
  • [8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [9] Himayat N., 2021, PROC INT C LEARN REP, P1
  • [10] Cho YJ, 2020, Arxiv, DOI arXiv:2010.01243