Privacy preservation for data cubes

被引:20
作者
Sung, SY [1 ]
Liu, Y
Xiong, H
Ng, PA
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117548, Singapore
[2] Rutgers State Univ, MSIS Dept, Piscataway, NJ 08855 USA
[3] Univ Texas, Dept Comp Sci, Edinburg, TX USA
关键词
privacy preservation; OLAP; random data distortion; range query;
D O I
10.1007/s10115-004-0193-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A range query finds the aggregated values over all selected cells of an online analytical processing (OLAP) data cube where the selection is specified by the ranges of contiguous values for each dimension. An important issue in reality is how to preserve the confidential information in individual data cells while still providing an accurate estimation of the original aggregated values for range queries. In this paper, we propose an effective solution, called the zero-sum method, to this problem. We derive theoretical formulas to analyse the performance of our method. Empirical experiments are also carried out by using analytical processing benchmark (APB) dataset from the OLAP Council. Various parameters, such as the privacy factor and the accuracy factor, have been considered and tested in the experiments. Finally, our experimental results show that there is a trade-off between privacy preservation and range query accuracy, and the zero-sum method has fulfilled three design goals: security, accuracy, and accessibility.
引用
收藏
页码:38 / 61
页数:24
相关论文
共 29 条
[1]  
ADAM NR, 1989, COMPUT SURV, V21, P515, DOI 10.1145/76894.76895
[2]  
Agrawal D., 2001, Proceedings of the 20th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, P247, DOI DOI 10.1145/375551.375602
[3]   Modeling multidimensional databases [J].
Agrawal, R ;
Gupta, A ;
Sarawagi, S .
13TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING - PROCEEDINGS, 1997, :232-243
[4]  
[Anonymous], 1982, CRYPTOGRAPHY DATA SE, DOI DOI 10.5555/539308
[5]  
[Anonymous], 2002, Proceedings of The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, DOI DOI 10.1145/775047.775080
[6]  
[Anonymous], 2000, Privacy-preserving data mining, DOI DOI 10.1145/342009.335438
[7]  
Barbara D., 1997, SIGMOD Record, V26, P12, DOI 10.1145/262762.262764
[8]  
BARBARA D, 1997, NEW JERSEY DATA REDU, V20, P3
[9]  
Beck L. L., 1980, ACM Transactions on Database Systems, V5, P316, DOI 10.1145/320613.320617
[10]  
Chaudhuri S., 1997, SIGMOD Record, V26, P65, DOI 10.1145/248603.248616