Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation

被引：4

作者：

Shao, Rulin ^{[1
]}

He, Hongyu ^{[2
]}

Chen, Ziwei ^{[3
]}

Liu, Hui ^{[4
]}

Liu, Dianbo ^{[5
]}

机构：

[1] Xi An Jiao Tong Univ, Dept Math & Stat, Xian, Peoples R China

[2] Xi An Jiao Tong Univ, Dept Elect Engn, Xian, Peoples R China

[3] Beijing Jiaotong Univ, Beijing, Peoples R China

[4] Mianyang Vocat Coll, Dept Math, Mianyang, Sichuan, Peoples R China

[5] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

JMIR FORMATIVE RESEARCH | 2020年 / 4卷 / 12期

关键词：

federated learning; differential privacy preserving; neural network pruning; health care; privacy; medical data; machine learning; neural network;

D O I：

10.2196/17265

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive information during both the training and using processes. Objective: To address security and privacy issues, we propose a privacy-preserving method for the analysis of distributed medical data. The proposed method, termed stochastic channel-based federated learning (SCBFL), enables participants to train a high-performance model cooperatively and in a distributed manner without sharing their inputs. Methods: We designed, implemented, and evaluated a channel-based update algorithm for a central server in a distributed system. The update algorithm will select the channels with regard to the most active features in a training loop, and then upload them as learned information from local datasets. A pruning process, which serves as a model accelerator, was further applied to the algorithm based on the validation set. Results: We constructed a distributed system consisting of 5 clients and 1 server. Our trials showed that the SCBFL method can achieve an area under the receiver operating characteristic curve (AUC-ROC) of 0.9776 and an area under the precision-recall curve (AUC-PR) of 0.9695 with only 10% of channels shared with the server. Compared with the federated averaging algorithm, the proposed SCBFL method achieved a 0.05388 higher AUC-ROC and 0.09695 higher AUC-PR. In addition, our experiment showed that 57% of the time is saved by the pruning process with only a reduction of 0.0047 in AUC-ROC performance and a reduction of 0.0068 in AUC-PR performance. Conclusions: In this experiment, our model demonstrated better performance and a higher saturating speed than the federated averaging method, which reveals all of the parameters of local models to the server. The saturation rate of performance could be promoted by introducing a pruning process and further improvement could be achieved by tuning the pruning rate.

引用

页数：16

共 49 条

[1] Deep Learning with Differential Privacy [J].

Abadi, Martin ;

Chu, Andy ;

Goodfellow, Ian ;

McMahan, H. Brendan ;

Mironov, Ilya ;

Talwar, Kunal ;

Zhang, Li .

CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318

[2] Big healthcare data: preserving security and privacy [J].

Abouelmehdi, Karim ;

Beni-Hessane, Abderrahim ;

Khaloufi, Hayat .

JOURNAL OF BIG DATA, 2018, 5 (01)

[3]

Adam Nabil, 2007, AMIA Annu Symp Proc, P1

[4]

[Anonymous], 2012, Proc. the 26th International Conference on Neural Information Processing Systems

[5]

Bagdasaryan E, 2020, PR MACH LEARN RES, V108, P2938

[6] Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds [J].

Bassily, Raef ;

Smith, Adam ;

Thakurta, Abhradeep .

2014 55TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2014), 2014, :464-473

[7]

Bertino E, 2005, PROC INT CONF DATA, P521

[8]

Bonawitz K., 2019, MLSYS

[9] Practical Secure Aggregation for Privacy-Preserving Machine Learning [J].

Bonawitz, Keith ;

Ivanov, Vladimir ;

Kreuter, Ben ;

Marcedone, Antonio ;

McMahan, H. Brendan ;

Patel, Sarvar ;

Ramage, Daniel ;

Segal, Aaron ;

Seth, Karn .

CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1175-1191

[10]

Chilimbi Trishul, 2014, Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI '14). OSDI '14, P571

← 1 2 3 4 5 →