NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data

被引:0
作者
Nipuna Senanayake
Robert Podschwadt
Daniel Takabi
Vince D. Calhoun
Sergey M. Plis
机构
[1] Georgia State University,
来源
Neuroinformatics | 2022年 / 20卷
关键词
Neuroimaging; Machine learning; Privacy; Secure multiparty computation; Logistic regression; Convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
The field of neuroimaging can greatly benefit from building machine learning models to detect and predict diseases, and discover novel biomarkers, but much of the data collected at various organizations and research centers is unable to be shared due to privacy or regulatory concerns (especially for clinical data or rare disorders). In addition, aggregating data across multiple large studies results in a huge amount of duplicated technical debt and the resources required can be challenging or impossible for an individual site to build. Training on the data distributed across organizations can result in models that generalize much better than models trained on data from any of organizations alone. While there are approaches for decentralized sharing, these often do not provide the highest possible guarantees of sample privacy that only cryptography can provide. In addition, such approaches are often focused on probabilistic solutions. In this paper, we propose an approach that leverages the potential of datasets spread among a number of data collecting organizations by performing joint analyses in a secure and deterministic manner when only encrypted data is shared and manipulated. The approach is based on secure multiparty computation which refers to cryptographic protocols that enable distributed computation of a function over distributed inputs without revealing additional information about the inputs. It enables multiple organizations to train machine learning models on their joint data and apply the trained models to encrypted data without revealing their sensitive data to the other parties. In our proposed approach, organizations (or sites) securely collaborate to build a machine learning model as it would have been trained on the aggregated data of all the organizations combined. Importantly, the approach does not require a trusted party (i.e. aggregator), each contributing site plays an equal role in the process, and no site can learn individual data of any other site. We demonstrate effectiveness of the proposed approach, in a range of empirical evaluations using different machine learning algorithms including logistic regression and convolutional neural network models on human structural and functional magnetic resonance imaging datasets.
引用
收藏
页码:91 / 108
页数:17
相关论文
共 225 条
  • [1] Agarwal A(2019)Protecting privacy of users in brain-computer interface applications IEEE Transactions on Neural Systems and Rehabilitation Engineering 27 1546-1555
  • [2] Dowsley R(2017)Multimodal neuroimaging in schizophrenia: description and dissemination Neuroinformatics 15 343-364
  • [3] McKinney ND(2017)Single subject prediction of brain disorders in neuroimaging Promises and pitfalls NeuroImage 145 137-165
  • [4] Wu D(2019)Decentralized temporal independent component analysis: Leveraging fMRI data in collaborative settings NeuroImage 186 557-569
  • [5] Lin C-T(2017)Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker NeuroImage 163 115-124
  • [6] Cock MD(2019)Privacy-preserving analysis of distributed biomedical data: Designing efficient and secure multiparty computations using distributed statistical learning theory JMIR medical informatics 7 e12702-656
  • [7] Nascimento AndersonCA(2019)Machine learning in neuroimaging: Progress and challenges NeuroImage 197 652-4189
  • [8] Aine CJ(2015)The role of machine learning in neuroimaging for drug discovery and development Psychopharmacology 232 4179-3
  • [9] Bockholt HJ(2017)A pragmatic introduction to secure multi-party computation Foundations and Trends®;in Privacy and Security 2 2-781
  • [10] Bustillo JR(2012)Freesurfer Neuroimage 62 774-1159