An efficient quasi-identifier index based approach for privacy preservation over incremental data sets on cloud

被引:48
作者
Zhang, Xuyun [1 ]
Liu, Chang [1 ]
Nepal, Surya [2 ]
Chen, Jinjun [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Broadway, NSW 2007, Australia
[2] CSIRO, Ctr Informat & Commun Technol, N Ryde, NSW 2122, Australia
关键词
Cloud computing; Privacy preservation; Incremental data set; Anonymization; Quasi-identifier index; FULLY HOMOMORPHIC ENCRYPTION; MAPREDUCE; SEARCH;
D O I
10.1016/j.jcss.2012.11.008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing provides massive computation power and storage capacity which enable users to deploy applications without infrastructure investment. Many privacy-sensitive applications like health services are built on cloud for economic benefits and operational convenience. Usually, data sets in these applications are anonymized to ensure data owners' privacy, but the privacy requirements can be potentially violated when new data join over time. Most existing approaches address this problem via re-anonymizing all data sets from scratch after update or via anonymizing the new data incrementally according to the already anonymized data sets. However, privacy preservation over incremental data sets is still challenging in the context of cloud because most data sets are of huge volume and distributed across multiple storage nodes. Existing approaches suffer from poor scalability and inefficiency because they are centralized and access all data frequently when update occurs. In this paper, we propose an efficient quasi-identifier index based approach to ensure privacy preservation and achieve high data utility over incremental and distributed data sets on cloud. Quasi-identifiers, which represent the groups of anonymized data, are indexed for efficiency. An algorithm is designed to fulfil our approach accordingly. Evaluation results demonstrate that with our approach, the efficiency of privacy preservation on large-volume incremental data sets can be improved significantly over existing approaches. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:542 / 555
页数:14
相关论文
共 43 条
[1]  
Andoni A, 2006, ANN IEEE SYMP FOUND, P459
[2]  
[Anonymous], 2006, P 32 INT C VER LARG
[3]  
[Anonymous], 5 INT WORKSH PERS AC
[4]  
[Anonymous], 2005, P 2005 ACM SIGMOD IN
[5]  
[Anonymous], 2010, 2010 IEEE International Symposium on Parallel Distributed Processing (IPDPS), DOI DOI 10.1109/INFCOM.2010.5462196
[6]  
[Anonymous], IEEE ICDE INT WORKSH
[7]  
[Anonymous], 2011, P 2011 ACM SIGMOD IN
[8]   A View of Cloud Computing [J].
Armbrust, Michael ;
Fox, Armando ;
Griffith, Rean ;
Joseph, Anthony D. ;
Katz, Randy ;
Konwinski, Andy ;
Lee, Gunho ;
Patterson, David ;
Rabkin, Ariel ;
Stoica, Ion ;
Zaharia, Matei .
COMMUNICATIONS OF THE ACM, 2010, 53 (04) :50-58
[9]  
Bhatotia P., 2011, Proceedings of the 2nd ACM Symposium on Cloud Computing - SOCC '11, P1, DOI [10.1145/2038916.2038923, DOI 10.1145/2038916.2038923]
[10]   Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility [J].
Buyya, Rajkumar ;
Yeo, Chee Shin ;
Venugopal, Srikumar ;
Broberg, James ;
Brandic, Ivona .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (06) :599-616