Vertical federated learning based on data subset representation for healthcare application

被引:0
作者
Shi, Yukun [1 ]
Zhang, Jilin [1 ]
Xue, Meiting [1 ]
Zeng, Yan [2 ]
Jia, Gangyong [2 ]
Yu, Qihong [2 ]
Li, Miaoqi [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Latent feature representation; Smart healthcare; Privacy preservation;
D O I
10.1016/j.cmpb.2025.108623
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective : Artificial intelligence is increasingly essential for disease classification and clinical diagnosis tasks in healthcare. Given the strict privacy needs of healthcare data, Vertical Federated Learning (VFL) has been introduced. VFL allows multiple hospitals to collaboratively train models on vertically partitioned data, where each holds only the patient's partial data features, thus maintaining patient confidentiality. However, VFL applications in healthcare scenarios with fewer samples and labels are challenging because existing methods heavily depend on labeled samples and do not consider the intrinsic connections among the data across hospitals. Methods : This paper proposes FedRL, a representation-based VFL method that enhances the performance of downstream tasks by utilizing aligned data for federated representation pretraining. The proposed method creates the same feature dimensions subsets by splitting the local data, exploiting the relationships among these subsets, constructing a bespoke loss function, and collaboratively training a representation model to these subsets across all participating hospitals. This model captures the latent representations of the global data, which are then applied to the downstream classification tasks. Results and Conclusion : The proposed FedRL method was validated through experiments on three healthcare datasets. The results demonstrate that the proposed method outperforms several existing methods across three performance metrics. Specifically, FedRL achieves average improvements of 4.7%, 5.6%, and 4.8% in accuracy, AUC, and F1-score, respectively, compared to current methods. In addition, FedRL demonstrates greater robustness and consistent performance in scenarios with limited labeled samples, thereby confirming its effectiveness and potential use in healthcare data analysis.
引用
收藏
页数:11
相关论文
共 44 条
  • [1] Interpretation of intelligence in CNN-pooling processes: a methodological survey
    Akhtar, Nadeem
    Ragavendran, U.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03) : 879 - 898
  • [2] Federated Learning for Healthcare: Systematic Review and Architecture Proposal
    Antunes, Rodolfo Stoffel
    da Costa, Cristiano Andre
    Kuederle, Arne
    Yari, Imrana Abdullahi
    Eskofier, Bjoern
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (04)
  • [3] Practical Secure Aggregation for Privacy-Preserving Machine Learning
    Bonawitz, Keith
    Ivanov, Vladimir
    Kreuter, Ben
    Marcedone, Antonio
    McMahan, H. Brendan
    Patel, Sarvar
    Ramage, Daniel
    Segal, Aaron
    Seth, Karn
    [J]. CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 1175 - 1191
  • [4] Buitinck Lars, 2013, API DESIGN MACHINE L, P108
  • [5] Implementing Vertical Federated Learning Using Autoencoders: Practical Application, Generalizability, and Utility Study
    Cha, Dongchul
    Sung, MinDong
    Park, Yu-Rang
    [J]. JMIR MEDICAL INFORMATICS, 2021, 9 (06)
  • [6] Explainable, Domain-Adaptive, and Federated Artificial Intelligence in Medicine
    Chaddad, Ahmad
    Lu, Qizong
    Li, Jiali
    Katib, Yousef
    Kateb, Reem
    Tanougast, Camel
    Bouridane, Ahmed
    Abdulkadir, Ahmed
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (04) : 859 - 876
  • [7] Choudhury Olivia, 2019, AMIA Annu Symp Proc, V2019, P313
  • [8] Federated Learning for Smart Healthcare: A Survey
    Dinh C Nguyen
    Quoc-Viet Pham
    Pathirana, Pubudu N.
    Ding, Ming
    Seneviratne, Aruna
    Lin, Zihuai
    Dobre, Octavia
    Hwang, Won-Joo
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (03)
  • [9] Vertical federated learning-based feature selection with non-overlapping sample utilization
    Feng, Siwei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [10] Transfer Learning with Partial Observability Applied to Cervical Cancer Screening
    Fernandes, Kelwin
    Cardoso, Jaime S.
    Fernandes, Jessica
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 243 - 250