Privacy-preserving Collaborative Training for Medical Image Analysis Based on Multi-Blockchain

被引:2
作者
Zhang, Wanlu [1 ]
Wang, Qigang [1 ]
Li, Mei [1 ]
机构
[1] Lenovo, AI Lab, Beijing, Peoples R China
关键词
Blockchain; deep learning; transfer learning; personalized learning; distributed training; medical image analysis; ARCHITECTURES; MANAGEMENT;
D O I
10.2174/1386207323666201022110616
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: As artificial intelligence and big data analysis develop rapidly, data privacy, especially patient medical data privacy, is getting more and more attention. Objective: The study aims to strengthen the protection of private data while ensuring the model training process; this article introduces a multi-Blockchain-based decentralized collaborative machine learning training method for medical image analysis. In this way, researchers from different medical institutions are able to collaborate to train models without exchanging sensitive patient data. Methods: Partial parameter update method is applied to prevent indirect privacy leakage during model propagation. With the peer-to-peer communication in the multi-Blockchain system, a machine learning task can leverage auxiliary information from another similar task in another Blockchain. In addition, after the collaborative training process, personalized models of different medical institutions will be trained. Results: The experimental results show that our method achieves similar performance with the centralized model-training method by collecting data sets of all participants and prevents private data leakage at the same time. Transferring auxiliary information from similar task on another Blockchain has also been proven to effectively accelerate model convergence and improve model accuracy, especially in the scenario of absence of data. Personalization training process further improves model performance. Conclusion: Our approach can effectively help researchers from different organizations to achieve collaborative training without disclosing their private data.
引用
收藏
页码:933 / 946
页数:14
相关论文
共 68 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Abbasi M., 2019, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], 2018, ARXIV PREPRINT ARXIV
[4]  
[Anonymous], 2018, Blockchain in Healthcare Today, DOI [DOI 10.30953/BHTY.V1.10, 10.30953/bhty.v1.10]
[5]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[6]   Ask the GRU: Multi-task Learning for Deep Text Recommendations [J].
Bansal, Trapit ;
Belanger, David ;
McCallum, Andrew .
PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, :107-114
[7]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[8]  
Bonawitz Keith, 2019, P MACHINE LEARNING S, P374, DOI 10.48550/arXiv.1902.01046
[9]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[10]  
Carroll JL, 2005, IEEE IJCNN, P803