Private Search Over Big Data Leveraging Distributed File System and Parallel Processing

被引:0
作者
Selcuk, Ayse [1 ]
Orencik, Cengiz [1 ]
Savas, Erkay [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
来源
CLOUD COMPUTING 2015: THE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION | 2015年
关键词
Cloud computing; Big Data; Keyword Search; Privacy; Hadoop; ENCRYPTION; MAPREDUCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we identify the security and privacy problems associated with a certain Big Data application, namely secure keyword-based search over encrypted cloud data and emphasize the actual challenges and technical difficulties in the Big Data setting. More specifically, we provide definitions from which privacy requirements can be derived. In addition, we adapt an existing work on privacy-preserving keyword-based search method to the Big Data setting, in which, not only data is huge but also changing and accumulating very fast. Our proposal is scalable in the sense that it can leverage distributed file systems and parallel programming techniques such as the Hadoop Distributed File System (HDFS) and the MapReduce programming model, to work with very large data sets. We also propose a lazy idf-updating method that can efficiently handle the relevancy scores of the documents in a dynamically changing, large data set. We empirically show the efficiency and accuracy of the method through an extensive set of experiments on real data.
引用
收藏
页码:116 / 121
页数:6
相关论文
共 23 条
[1]  
Amazon Web Services, WHAT IS CLOUD COMP
[2]  
[Anonymous], 2012, ENRON EMAIL DATASET
[3]  
[Anonymous], 2011, Mining of Massive Datasets
[4]  
[Anonymous], 2003, P 19 ACM S OP SYST P, DOI [10.1145/1165389.945450, DOI 10.1145/1165389.945450]
[5]  
Cao N., 2011, IEEE INFOCOM
[6]  
Cash D, 2013, LECT NOTES COMPUT SC, V8042, P353, DOI 10.1007/978-3-642-40041-4_20
[7]  
Christopher H. S., 2008, INTRO INFORM RETRIEV
[8]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[9]  
DeCandia Giuseppe, 2007, Operating Systems Review, V41, P205, DOI 10.1145/1323293.1294281
[10]  
Hacigumus H., 2002, P 2002 ACM SIGMOD IN, P216, DOI DOI 10.1145/564691.564717