Vertical federated learning based on data subset representation for healthcare application

被引:1
作者
Shi, Yukun [1 ]
Zhang, Jilin [1 ]
Xue, Meiting [1 ]
Zeng, Yan [2 ]
Jia, Gangyong [2 ]
Yu, Qihong [2 ]
Li, Miaoqi [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Latent feature representation; Smart healthcare; Privacy preservation;
D O I
10.1016/j.cmpb.2025.108623
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective : Artificial intelligence is increasingly essential for disease classification and clinical diagnosis tasks in healthcare. Given the strict privacy needs of healthcare data, Vertical Federated Learning (VFL) has been introduced. VFL allows multiple hospitals to collaboratively train models on vertically partitioned data, where each holds only the patient's partial data features, thus maintaining patient confidentiality. However, VFL applications in healthcare scenarios with fewer samples and labels are challenging because existing methods heavily depend on labeled samples and do not consider the intrinsic connections among the data across hospitals. Methods : This paper proposes FedRL, a representation-based VFL method that enhances the performance of downstream tasks by utilizing aligned data for federated representation pretraining. The proposed method creates the same feature dimensions subsets by splitting the local data, exploiting the relationships among these subsets, constructing a bespoke loss function, and collaboratively training a representation model to these subsets across all participating hospitals. This model captures the latent representations of the global data, which are then applied to the downstream classification tasks. Results and Conclusion : The proposed FedRL method was validated through experiments on three healthcare datasets. The results demonstrate that the proposed method outperforms several existing methods across three performance metrics. Specifically, FedRL achieves average improvements of 4.7%, 5.6%, and 4.8% in accuracy, AUC, and F1-score, respectively, compared to current methods. In addition, FedRL demonstrates greater robustness and consistent performance in scenarios with limited labeled samples, thereby confirming its effectiveness and potential use in healthcare data analysis.
引用
收藏
页数:11
相关论文
共 44 条
[1]   Interpretation of intelligence in CNN-pooling processes: a methodological survey [J].
Akhtar, Nadeem ;
Ragavendran, U. .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03) :879-898
[2]   Federated Learning for Healthcare: Systematic Review and Architecture Proposal [J].
Antunes, Rodolfo Stoffel ;
da Costa, Cristiano Andre ;
Kuederle, Arne ;
Yari, Imrana Abdullahi ;
Eskofier, Bjoern .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (04)
[3]   Practical Secure Aggregation for Privacy-Preserving Machine Learning [J].
Bonawitz, Keith ;
Ivanov, Vladimir ;
Kreuter, Ben ;
Marcedone, Antonio ;
McMahan, H. Brendan ;
Patel, Sarvar ;
Ramage, Daniel ;
Segal, Aaron ;
Seth, Karn .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1175-1191
[4]  
Buitinck L., 2013, P ECML PKDD WORKSH L, P108
[5]   Implementing Vertical Federated Learning Using Autoencoders: Practical Application, Generalizability, and Utility Study [J].
Cha, Dongchul ;
Sung, MinDong ;
Park, Yu-Rang .
JMIR MEDICAL INFORMATICS, 2021, 9 (06)
[6]   Explainable, Domain-Adaptive, and Federated Artificial Intelligence in Medicine [J].
Chaddad, Ahmad ;
Lu, Qizong ;
Li, Jiali ;
Katib, Yousef ;
Kateb, Reem ;
Tanougast, Camel ;
Bouridane, Ahmed ;
Abdulkadir, Ahmed .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (04) :859-876
[7]  
Choudhury Olivia, 2019, AMIA Annu Symp Proc, V2019, P313
[8]   Federated Learning for Smart Healthcare: A Survey [J].
Dinh C Nguyen ;
Quoc-Viet Pham ;
Pathirana, Pubudu N. ;
Ding, Ming ;
Seneviratne, Aruna ;
Lin, Zihuai ;
Dobre, Octavia ;
Hwang, Won-Joo .
ACM COMPUTING SURVEYS, 2023, 55 (03)
[9]   Vertical federated learning-based feature selection with non-overlapping sample utilization [J].
Feng, Siwei .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
[10]   Transfer Learning with Partial Observability Applied to Cervical Cancer Screening [J].
Fernandes, Kelwin ;
Cardoso, Jaime S. ;
Fernandes, Jessica .
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 :243-250