A distributed decision support algorithm that preserves personal privacy

被引:4
作者
Mathew, George [1 ]
Obradovic, Zoran [1 ]
机构
[1] Temple Univ, Ctr Data Analyt & Biomed Informat, Philadelphia, PA 19122 USA
基金
美国国家科学基金会;
关键词
Data privacy; Privacy-preserving framework; Distributed decision support systems; TREES;
D O I
10.1007/s10844-014-0331-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assuring confidentiality of personal information and preserving privacy are vital when data is harvested from multiple institutions for business decision-making. An algorithm that builds knowledge using statistics based on subject data from distributed sites that satisfy specified selection criteria is presented here. The algorithm maintains complete fidelity of information structures in the distributed data compared to the centralized equivalent. Heterogeneous data schemas across sites can be accommodated and thresholds can be set for global minimum saturation for attributes to participate in the prediction model building. Policies for inclusion and exclusion of non-exhaustive attributes among sites are introduced. Unification of attributes is introduced for homogenizing attribute values globally. Results of experiments using data from medical, higher education, and social domains elucidate the value of our algorithm in regulated industries, where shipping raw data outside parent institution is not practical.
引用
收藏
页码:107 / 132
页数:26
相关论文
共 63 条
[1]  
ADAM NR, 1989, COMPUT SURV, V21, P515, DOI 10.1145/76894.76895
[2]  
Aggarwal CC, 2008, ADV DATABASE SYST, V34, P1, DOI 10.1007/978-0-387-70992-5
[3]  
Allaert FA, 1998, EUR J INFORM SYST, V7, P1, DOI 10.1057/palgrave.ejis.3000278
[4]  
[Anonymous], 1880, LECT NOTES COMPUTER, P36
[5]  
[Anonymous], 2006, Introduction to Data Mining
[6]  
[Anonymous], 2006, SPIRIT POWER 10 COUN
[7]  
[Anonymous], 1998, SECURE MULTIPARTY CO
[8]  
[Anonymous], 2003, P 22 ACM SIGMOD SIGA
[9]   Hierarchical decision tree induction in distributed genomic databases [J].
Bar-Or, A ;
Keren, D ;
Schuster, A ;
Wolff, R .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (08) :1138-1151
[10]  
Bialecki A., 2012, SIGIR 2012 WORKSH OP, P17