FedPseudo: Privacy-Preserving Pseudo Value-Based Deep Learning Models for Federated Survival Analysis

被引:4
作者
Rahman, Md Mahmudur [1 ]
Purushotham, Sanjay [1 ]
机构
[1] Univ Maryland Baltimore Cty, Baltimore, MD 21228 USA
来源
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年
基金
美国国家科学基金会;
关键词
Survival analysis; Federated Learning; Deep neural networks; REGRESSION-MODELS; INFERENCE;
D O I
10.1145/3580305.3599348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Survival analysis, aka time-to-event analysis, has a wide-ranging impact on patient care. Federated Survival Analysis (FSA) is an emerging Federated Learning (FL) paradigm for performing survival analysis on distributed decentralized data available at multiple medical institutions. FSA enables individual medical institutions, referred to as clients, to improve their survival predictions while ensuring privacy. However, FSA faces challenges due to non-linear and non-IID data distributions among clients, as well as bias caused by censoring. Although recent studies have adapted Cox Proportional Hazards (CoxPH) survival models for FSA, a systematic exploration of these challenges is currently lacking. In this paper, we address these critical challenges by introducing FedPseudo, a pseudo value-based deep learning framework for FSA. FedPseudo uses deep learning models to learn robust representations from non-linear survival data, leverages the power of pseudo values to handle non-uniform censoring, and employs FL algorithms such as FedAvg to learn model parameters. We propose a novel and simple approach for estimating pseudo values for FSA. We provide theoretical proof that the estimated pseudo values, referred to as Federated Pseudo Values, are consistent. Moreover, our empirical results demonstrate that they can be computed faster than traditional methods of deriving pseudo values. To ensure and enhance the privacy of both the estimated pseudo values and the shared model parameters, we systematically investigate the application of differential privacy (DP) on both the federated pseudo values and local model updates. Furthermore, we adapt V-Usable Information metric to quantify the informativeness of a client's data for training a survival model and utilize this metric to show the advantages of participating in FSA. We conducted extensive experiments on synthetic and real-world survival datasets to demonstrate that our FedPseudo framework achieves better performance than other FSA approaches and performs similarly to the best centrally trained deep survival model. Moreover, FedPseudo consistently achieves superior results across different censoring settings.
引用
收藏
页码:1999 / 2009
页数:11
相关论文
共 51 条
[1]   NONPARAMETRIC INFERENCE FOR A FAMILY OF COUNTING PROCESSES [J].
AALEN, O .
ANNALS OF STATISTICS, 1978, 6 (04) :701-726
[2]   Pseudo-observations in survival analysis [J].
Andersen, Per Kragh ;
Perme, Maja Pohar .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) :71-99
[3]  
Andreux Mathieu, 2020, ARXIV200608997
[4]   A time-dependent discrimination index for survival data [J].
Antolini, L ;
Boracchi, P ;
Biganzoli, E .
STATISTICS IN MEDICINE, 2005, 24 (24) :3927-3944
[5]  
Archetti Alberto, 2023, ARXIV230202807
[6]  
Banerjee Soumya., 2022, bioRxiv
[7]  
Barrajon Enrique, 2020, ARXIV201208649
[8]   Variables with time-varying effects and the Cox model: Some statistical concepts illustrated with a prognostic factor study in breast cancer [J].
Bellera, Carine A. ;
MacGrogan, Gaetan ;
Debled, Marc ;
de lara, Christine Tunon ;
Brouste, Veronique ;
Mathoulin-Pelissier, Simone .
BMC MEDICAL RESEARCH METHODOLOGY, 2010, 10
[9]   Protecting patient privacy in survival analyses [J].
Bonomi, Luca ;
Jiang, Xiaoqian ;
Ohno-Machado, Lucila .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) :366-375
[10]   Time-dependent relevance of steroid receptors in breast cancer [J].
Coradini, D ;
Daidone, MG ;
Boracchi, P ;
Biganzoli, E ;
Oriana, S ;
Bresciani, G ;
Pellizzaro, C ;
Tomasic, G ;
Di Fronzo, G ;
Marubini, E .
JOURNAL OF CLINICAL ONCOLOGY, 2000, 18 (14) :2702-2709