Privacy-preserving model learning on a blockchain network-of-networks

被引:36
作者
Kuo, Tsung-Ting [1 ]
Kim, Jihoon [1 ]
Gabriel, Rodney A. [1 ,2 ]
机构
[1] Univ Calif San Diego, UCSD Hlth Dept Biomed Informat, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Anesthesiol, San Diego, CA 92103 USA
基金
美国国家卫生研究院;
关键词
blockchain distributed ledger technology; privacy-preserving predictive modeling; hierarchical network; clinical information systems; decision support systems; PATIENT PRIVACY; ONLINE; HIPAA; SNOW; MYTH;
D O I
10.1093/jamia/ocz214
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To facilitate clinical/genomic/biomedical research, constructing generalizable predictive models using cross-institutional methods while protecting privacy is imperative. However, state-of-the-art methods assume a "flattened" topology, while real-world research networks may consist of "network-of-networks" which can imply practical issues including training on small data for rare diseases/conditions, prioritizing locally trained models, and maintaining models for each level of the hierarchy. In this study, we focus on developing a hierarchical approach to inherit the benefits of the privacy-preserving methods, retain the advantages of adopting blockchain, and address practical concerns on a research network-of-networks. Materials and Methods: We propose a framework to combine level-wise model learning, blockchain-based model dissemination, and a novel hierarchical consensus algorithm for model ensemble. We developed an example implementation HierarchicalChain (hierarchical privacy-preserving modeling on blockchain), evaluated it on 3 healthcare/genomic datasets, as well as compared its predictive correctness, learning iteration, and execution time with a state-of-the-art method designed for flattened network topology. Results: HierarchicalChain improves the predictive correctness for small training datasets and provides comparable correctness results with the competing method with higher learning iteration and similar per-iteration execution time, inherits the benefits of the privacy-preserving learning and advantages of blockchain technology, and immutable records models for each level. Discussion: HierarchicalChain is independent of the core privacy-preserving learning method, as well as of the underlying blockchain platform. Further studies are warranted for various types of network topology, complex data, and privacy concerns. C Conclusion: We demonstrated the potential of utilizing the information from the hierarchical network-ofnetworks topology to improve prediction.
引用
收藏
页码:343 / 354
页数:12
相关论文
共 58 条
[1]  
[Anonymous], 2014, Handbook of Biological Statistics
[2]  
[Anonymous], 2014, Anonymous byzantine consensus from moderatelyhard puzzles: a model for bitcoin
[3]  
[Anonymous], MULTICHAIN OP PLATF
[4]  
[Anonymous], ACM SIGKDD EXPLOR NE
[5]   Commentary: Edmund Alexander Parkes, John Snow and the miasma controversy [J].
Bergman, Beverly P. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2013, 42 (06) :1562-1565
[6]  
Bissias G., 2014, Proc. the 13th Workshop on Privacy in the Electronic Society, P149
[7]  
Boyd S, 2005, IEEE INFOCOM SER, P1653
[8]   Randomized gossip algorithms [J].
Boyd, Stephen ;
Ghosh, Arpita ;
Prabhakar, Balaji ;
Shah, Devavrat .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (06) :2508-2530
[9]   Map-making and myth-making in Broad Street: the London cholera epidemic, 1854 [J].
Brody, H ;
Rip, MR ;
Vinten-Johansen, P ;
Paneth, N ;
Rachman, S .
LANCET, 2000, 356 (9223) :64-68
[10]  
Bruce Julie, 2003, Ann Health Law, V12, P75