Feddaw: Dual Adaptive Weighted Federated Learning for Non-IID Medical Data

被引:0
作者
Ren, Linan [1 ]
Li, Kaixin [1 ]
An, Ying [1 ]
Liu, Yuan [2 ]
Chen, Xianlai [1 ,3 ]
机构
[1] Cent South Univ, Big Data Inst, Changsha 410083, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Cent South Univ, Coll Hunan Prov, Key Lab Med Informat Res, Changsha, Peoples R China
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS, PT III, ISBRA 2024 | 2024年 / 14956卷
关键词
Federated Learning; Non-IID; Medical Data; Disease Diagnosis;
D O I
10.1007/978-981-97-5087-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of deep learning methods in disease diagnosis holds great promise with the development of medical big data. However, the scale of parameters in deep learning models, which can often reach millions, requires learning from large and diverse medical datasets to achieve the accuracy required for clinical applications. The challenges of cross-domain, decentralization, and data privacy in medical data have constrained the development of this field. Federated learning (FL) addresses these challenges by exchanging model parameters between clients and servers to share the model. However, in the case of medical data, there may be significant disparities in data quality among medical institutions, leading to imbalances in data volume and labeling, which may significantly affect model performance. Traditional FL approaches typically use simple methods such as averaging or weighted averaging during the parameter aggregation process, ignoring the Non-IID (Non-Independent and Identically Distributed) problem among clients. In this paper, a novel FL approach, Feddaw, is proposed based on the characteristics of non-IID medical data distribution. Feddaw aims to reduce the negative impact of label distribution shift in medical data by limiting the probability weighting factor of the CNN classification layer during client-side local training. Additionally, it verifies the accuracy of the client-side model in each round at the server-side, using accuracy-based weight aggregation to balance the negative impact of different data sample shifts. The experimental results show that the proposed Feddaw outperforms traditional FL methods in medical disease diagnosis.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 22 条
  • [1] Dhillon I., 2020, Proceedings of Machine Learning and Systems, V2, P429
  • [2] Jeong E, 2023, Arxiv, DOI arXiv:1811.11479
  • [3] Karimireddy SP, 2020, PR MACH LEARN RES, V119
  • [4] Konečny J, 2017, Arxiv, DOI arXiv:1610.05492
  • [5] Kopparapu K, 2020, Arxiv, DOI arXiv:2006.09637
  • [6] Krizhevsky A, 2010, CONVOLUTIONAL DEEP B
  • [7] Li A, 2020, Arxiv, DOI arXiv:2008.03371
  • [8] Li T, 2020, Arxiv, DOI arXiv:1812.06127
  • [9] FedRS: Federated Learning with Restricted Softmax for Label Distribution Non-IID Data
    Li, Xin-Chun
    Zhan, De-Chuan
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 995 - 1005
  • [10] McMahan H.B., 2016, CoRR, DOI 10.48550/arXiv.1602.05629