Blockchain for Privacy Preserving and Trustworthy Distributed Machine Learning in Multicentric Medical Imaging (C-DistriM)

被引:49
作者
Zerka, Fadila [1 ,2 ]
Urovi, Visara [3 ]
Vaidyanathan, Akshayaa [1 ,2 ]
Barakat, Samir [2 ]
Leijenaar, Ralph T. H. [2 ]
Walsh, Sean [1 ,2 ]
Gabrani-Juma, Hanif [2 ]
Miraglio, Benjamin [2 ]
Woodruff, Henry C. [1 ,4 ]
Dumontier, Michel [3 ]
Lambin, Philippe [1 ,4 ]
机构
[1] Maastricht Univ, GROW Sch Oncol, Dept Precis Med, D Lab, NL-6229 GT Maastricht, Netherlands
[2] Oncoradiomics SA, B-4000 Liege, Belgium
[3] Maastricht Univ, Inst Data Sci IDS, NL-6229 GT Maastricht, Netherlands
[4] Maastricht Univ, Med Ctr, Dept Radiol & Nucl Med, NL-6202 AZ Maastricht, Netherlands
关键词
Data models; Training; Machine learning; Servers; Biomedical imaging; Blockchain; data privacy; decentralized learning; distributed learning; HEALTH-CARE; MODEL;
D O I
10.1109/ACCESS.2020.3029445
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The utility of Artificial Intelligence (AI) in healthcare strongly depends upon the quality of the data used to build models, and the confidence in the predictions they generate. Access to sufficient amounts of high-quality data to build accurate and reliable models remains problematic owing to substantive legal and ethical constraints in making clinically relevant research data available offsite. New technologies such as distributed learning offer a pathway forward, but unfortunately tend to suffer from a lack of transparency, which undermines trust in what data are used for the analysis. To address such issues, we hypothesized that, a novel distributed learning that combines sequential distributed learning with a blockchain-based platform, namely Chained Distributed Machine learning C-DistriM, would be feasible and would give a similar result as a standard centralized approach. C-DistriM enables health centers to dynamically participate in training distributed learning models. We demonstrate C-DistriM using the NSCLC-Radiomics open data to predict two-year lung-cancer survival. A comparison of the performance of this distributed solution, evaluated in six different scenarios, and the centralized approach, showed no statistically significant difference (AUCs between central and distributed models), all DeLong tests yielded p-val > 0.05. This methodology removes the need to blindly trust the computation in one specific server on a distributed learning network. This fusion of blockchain and distributed learning serves as a proof-of-concept to increase transparency, trust, and ultimately accelerate the adoption of AI in multicentric studies. We conclude that our blockchain-based model for sequential training on distributed datasets is a feasible approach, provides equivalent performance to the centralized approach.
引用
收藏
页码:183939 / 183951
页数:13
相关论文
共 51 条
[21]   Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital - A real life proof of concept [J].
Jochems, Arthur ;
Deist, Timo M. ;
Van Soest, Johan ;
Eble, Michael ;
Bulens, Paul ;
Coucke, Philippe ;
Dries, Wim ;
Lambin, Philippe ;
Dekker, Andre .
RADIOTHERAPY AND ONCOLOGY, 2016, 121 (03) :459-467
[22]   Machine learning applications in cancer prognosis and prediction [J].
Kourou, Konstantina ;
Exarchos, Themis P. ;
Exarchos, Konstantinos P. ;
Karamouzis, Michalis V. ;
Fotiadis, Dimitrios I. .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2015, 13 :8-17
[23]   Artificial Intelligence in Medicine: Where Are We Now? [J].
Kulkarni, Sagar ;
Seneviratne, Nuran ;
Baig, Mirza Shaheer ;
Khan, Ameer Hamid Ahmed .
ACADEMIC RADIOLOGY, 2020, 27 (01) :62-70
[24]   Privacy-preserving model learning on a blockchain network-of-networks [J].
Kuo, Tsung-Ting ;
Kim, Jihoon ;
Gabriel, Rodney A. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) :343-354
[25]   Fair compute loads enabled by blockchain: sharing models by alternating client and server roles [J].
Kuo, Tsung-Ting ;
Gabriel, Rodney A. ;
Ohno-Machado, Lucila .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (05) :392-403
[26]   Blockchain distributed ledger technologies for biomedical and health care applications [J].
Kuo, Tsung-Ting ;
Kim, Hyeon-Eui ;
Ohno-Machado, Lucila .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (06) :1211-1220
[27]   Radiomics: the bridge between medical imaging and personalized medicine [J].
Lambin, Philippe ;
Leijenaar, Ralph T. H. ;
Deist, Timo M. ;
Peerlings, Jurgen ;
de Jong, Evelyn E. C. ;
van Timmeren, Janita ;
Sanduleanu, Sebastian ;
Larue, Ruben T. H. M. ;
Even, Aniek J. G. ;
Jochems, Arthur ;
van Wijk, Yvonka ;
Woodruff, Henry ;
van Soest, Johan ;
Lustberg, Tim ;
Roelofs, Erik ;
van Elmpt, Wouter ;
Dekker, Andre ;
Mottaghy, Felix M. ;
Wildberger, Joachim E. ;
Walsh, Sean .
NATURE REVIEWS CLINICAL ONCOLOGY, 2017, 14 (12) :749-762
[28]   'Rapid Learning health care in oncology' - An approach towards decision support systems enabling customised radiotherapy' [J].
Lambin, Philippe ;
Roelofs, Erik ;
Reymen, Bart ;
Velazquez, Emmanuel Rios ;
Buijsen, Jeroen ;
Zegers, Catharina M. L. ;
Carvalho, Sara ;
Leijenaar, Ralph T. H. ;
Nalbantov, Georgi ;
Oberije, Cary ;
Marshall, M. Scott ;
Hoebers, Frank ;
Troost, Esther G. C. ;
van Stiphout, Ruud G. P. M. ;
van Elmpt, Wouter ;
van der Weijden, Trudy ;
Boersma, Liesbeth ;
Valentini, Vincenzo ;
Dekker, Andre .
RADIOTHERAPY AND ONCOLOGY, 2013, 109 (01) :159-164
[29]   Multi-View Mammographic Density Classification by Dilated and Attention-Guided Residual Learning [J].
Li, Cheng ;
Xu, Jingxu ;
Liu, Qiegen ;
Zhou, Yongjin ;
Mou, Lisha ;
Pu, Zuhui ;
Xia, Yong ;
Zheng, Hairong ;
Wang, Shanshan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) :1003-1013
[30]   Deep learning in bioinformatics: Introduction, application, and perspective in the big data era [J].
Li, Yu ;
Huang, Chao ;
Ding, Lizhong ;
Li, Zhongxiao ;
Pan, Yijie ;
Gao, Xin .
METHODS, 2019, 166 :4-21