Blockchain for Privacy Preserving and Trustworthy Distributed Machine Learning in Multicentric Medical Imaging (C-DistriM)

被引:49
作者
Zerka, Fadila [1 ,2 ]
Urovi, Visara [3 ]
Vaidyanathan, Akshayaa [1 ,2 ]
Barakat, Samir [2 ]
Leijenaar, Ralph T. H. [2 ]
Walsh, Sean [1 ,2 ]
Gabrani-Juma, Hanif [2 ]
Miraglio, Benjamin [2 ]
Woodruff, Henry C. [1 ,4 ]
Dumontier, Michel [3 ]
Lambin, Philippe [1 ,4 ]
机构
[1] Maastricht Univ, GROW Sch Oncol, Dept Precis Med, D Lab, NL-6229 GT Maastricht, Netherlands
[2] Oncoradiomics SA, B-4000 Liege, Belgium
[3] Maastricht Univ, Inst Data Sci IDS, NL-6229 GT Maastricht, Netherlands
[4] Maastricht Univ, Med Ctr, Dept Radiol & Nucl Med, NL-6202 AZ Maastricht, Netherlands
关键词
Data models; Training; Machine learning; Servers; Biomedical imaging; Blockchain; data privacy; decentralized learning; distributed learning; HEALTH-CARE; MODEL;
D O I
10.1109/ACCESS.2020.3029445
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The utility of Artificial Intelligence (AI) in healthcare strongly depends upon the quality of the data used to build models, and the confidence in the predictions they generate. Access to sufficient amounts of high-quality data to build accurate and reliable models remains problematic owing to substantive legal and ethical constraints in making clinically relevant research data available offsite. New technologies such as distributed learning offer a pathway forward, but unfortunately tend to suffer from a lack of transparency, which undermines trust in what data are used for the analysis. To address such issues, we hypothesized that, a novel distributed learning that combines sequential distributed learning with a blockchain-based platform, namely Chained Distributed Machine learning C-DistriM, would be feasible and would give a similar result as a standard centralized approach. C-DistriM enables health centers to dynamically participate in training distributed learning models. We demonstrate C-DistriM using the NSCLC-Radiomics open data to predict two-year lung-cancer survival. A comparison of the performance of this distributed solution, evaluated in six different scenarios, and the centralized approach, showed no statistically significant difference (AUCs between central and distributed models), all DeLong tests yielded p-val > 0.05. This methodology removes the need to blindly trust the computation in one specific server on a distributed learning network. This fusion of blockchain and distributed learning serves as a proof-of-concept to increase transparency, trust, and ultimately accelerate the adoption of AI in multicentric studies. We conclude that our blockchain-based model for sequential training on distributed datasets is a feasible approach, provides equivalent performance to the centralized approach.
引用
收藏
页码:183939 / 183951
页数:13
相关论文
共 51 条
[1]   Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach [J].
Aerts, Hugo J. W. L. ;
Velazquez, Emmanuel Rios ;
Leijenaar, Ralph T. H. ;
Parmar, Chintan ;
Grossmann, Patrick ;
Cavalho, Sara ;
Bussink, Johan ;
Monshouwer, Rene ;
Haibe-Kains, Benjamin ;
Rietveld, Derek ;
Hoebers, Frank ;
Rietbergen, Michelle M. ;
Leemans, C. Rene ;
Dekker, Andre ;
Quackenbush, John ;
Gillies, Robert J. ;
Lambin, Philippe .
NATURE COMMUNICATIONS, 2014, 5
[2]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.7937/K9/TCIA.2015.PF0M9REI
[3]  
[Anonymous], 2017, ICML
[4]  
[Anonymous], 1997, Machine Learning
[5]  
Barker E., 2019, NIST Special Publication 800-131A Revision 2-Transitioning the Use of Cryptographic Algorithms and Key Lengths, DOI DOI 10.6028/NIST.SP.800-131AR2
[6]  
Chen XH, 2018, IEEE INT CONF BIG DA, P1178, DOI 10.1109/BigData.2018.8622598
[7]  
Clack C.D., 2016, ARXIV160800771
[8]   The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository [J].
Clark, Kenneth ;
Vendt, Bruce ;
Smith, Kirk ;
Freymann, John ;
Kirby, Justin ;
Koppel, Paul ;
Moore, Stephen ;
Phillips, Stanley ;
Maffitt, David ;
Pringle, Michael ;
Tarbox, Lawrence ;
Prior, Fred .
JOURNAL OF DIGITAL IMAGING, 2013, 26 (06) :1045-1057
[9]   Blockchain for Internet of Things: A Survey [J].
Dai, Hong-Ning ;
Zheng, Zibin ;
Zhang, Yan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8076-8094
[10]   Distributed learning on 20 000+lung cancer patients - The Personal Health Train [J].
Deist, Timo M. ;
Dankers, Frank J. W. M. ;
Ojha, Priyanka ;
Marshall, M. Scott ;
Janssen, Tomas ;
Faivre-Finn, Corinne ;
Masciocchi, Carlotta ;
Valentini, Vincenzo ;
Wang, Jiazhou ;
Chen, Jiayan ;
Zhang, Zhen ;
Spezi, Emiliano ;
Button, Mick ;
Nuyttens, Joost Jan ;
Vernhout, Rene ;
van Soest, Johan ;
Jochems, Arthur ;
Monshouwer, Rene ;
Bussink, Johan ;
Price, Gareth ;
Lambin, Philippe ;
Dekker, Andre .
RADIOTHERAPY AND ONCOLOGY, 2020, 144 :189-200