Privacy-preserving federated learning for collaborative medical data mining in multi-institutional settings

被引:0
作者
Rahul Haripriya [1 ]
Nilay Khare [1 ]
Manish Pandey [1 ]
机构
[1] Department of Computer Science and Engineering, Maulana Azad National Institute of Technology, Bhopal
关键词
Artificial intelligence; Data privacy; Deep learning; Federated learning; Image classification; Machine learning; Transfer learning;
D O I
10.1038/s41598-025-97565-4
中图分类号
学科分类号
摘要
Ensuring data privacy in medical image classification is a critical challenge in healthcare, especially with the increasing reliance on AI-driven diagnostics. In fact, over 30% of healthcare organizations globally have experienced a data breach in the last year, highlighting the need for secure solutions. This study investigates the integration of transfer learning and federated learning for privacy-preserving medical image classification using GoogLeNet and VGG16 as baseline models to evaluate the generalizability of the proposed framework. Pre-trained on ImageNet and fine-tuned on three specialized medical datasets for TB chest X-rays, brain tumor MRI scans, and diabetic retinopathy images, these models achieved high classification accuracy across various aggregation methods. Additionally, the proposed dynamic aggregation method was further analyzed using modern architectures, EfficientNetV2 and ResNet-RS, to assess the scalability and robustness of the model. A key contribution is the introduction of a novel adaptive aggregation method, which dynamically alternates between Federated Averaging (FedAvg) and Federated Stochastic Gradient Descent (FedSGD), based on data divergence during communication rounds. This approach optimizes model convergence while preserving privacy in collaborative settings. The results demonstrate that transfer learning, when combined with federated learning, offers a scalable, robust, and secure solution for real-world medical diagnostics, enabling healthcare institutions to train highly accurate models without compromising sensitive patient data. © The Author(s) 2025.
引用
收藏
相关论文
共 50 条
  • [1] Liu W., Liang S., Qin X., A novel embedded kernel cnn-pcff algorithm for breast cancer pathological image classification, Sci. Rep, 14, (2024)
  • [2] Antunes R.S., Andre da Costa C., Kuderle A., Yari I.A., Eskofier B., Systematic review and architecture proposal. Federated learning for healthcare. In ACM Transactions on Intelligent Systems and Technology (TIST), Vol, 1–23, (2022)
  • [3] Nguyen D.C., Et al., Federated learning for smart healthcare: A survey, ACM Comput. Surv. (Csur), 55, pp. 1-37, (2022)
  • [4] Xu J., Et al., Federated learning for healthcare informatics, J. Healthc. Inform. Res, 5, pp. 1-19, (2021)
  • [5] Xu C., Et al., A deep image classification model based on prior feature knowledge embedding and application in medical diagnosis, Sci. Rep, 14, (2024)
  • [6] Chung H., Lee J.S., Federated influencer learning for secure and efficient collaborative learning in realistic medical database environment, Sci. Rep, 14, (2024)
  • [7] Pan Z., Et al., Efficient federated learning for pediatric pneumonia on chest x-ray classification, Sci. Rep, 14, (2024)
  • [8] Rashidi G., Et al., The potential of federated learning for self-configuring medical object detection in heterogeneous data distributions, Sci. Rep, 14, (2024)
  • [9] Darzi E., Sijtsema N.M., van Ooijen P., A comparative study of federated learning methods for COVID-19 detection, Sci. Rep, 14, (2024)
  • [10] Gao H., Et al., Swinbtc: Transfer learning to brain tumor classification for healthcare electronics using augmented MR images, IEEE Transactions on Consumer Electronics, (2025)