Big Data Anonymization with Spark

被引:0
作者
Canbay, Yavuz [1 ]
Sagiroglu, Seref [1 ]
机构
[1] Gazi Univ, Fac Engn, Dept Comp Engn, Ankara, Turkey
来源
2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK) | 2017年
关键词
big data; anonymization; privacy preserving; hadoop; spark; model; review; PRIVACY;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Privacy is an important issue for big data including sensitive attributes. In the case of directly sharing or publishing these data, privacy breach occurs. In order to overcome this problem, previous studies were focused on developing big data anonymization techniques on Hadoop environment. When compared to Hadoop, Spark facilitates to develop faster applications with the help of keeping data in memory instead of hard disk. Despite a number of projects were developed on Hadoop, now this trend is shifting to Spark. In addition, the problem of anonymizing big data streams for realtime applications can be solved with Spark technology. Hence to sum up, Spark is the main technology facilitates developing both faster anonymization applications and big data stream anonymization solutions. In this study, anonymization techniques, big data technologies and privacy preserving big data publishing was reviewed and a big data anonymization model based on Spark was proposed for the first time. It is expected that the proposed model might help to researchers to solve big data privacy issues and also provide solutions for new generation privacy violations problems.
引用
收藏
页码:833 / 838
页数:6
相关论文
共 23 条
[1]  
Al-Zobbi M, 2016, PROCEEDINGS OF THE 2016 IEEE 41ST CONFERENCE ON LOCAL COMPUTER NETWORKS - LCN WORKSHOPS 2016, P58, DOI [10.1109/LCNW.2016.25, 10.1109/LCN.2016.029]
[2]  
[Anonymous], P 27 INT C SCI STAT
[3]  
[Anonymous], 2017, P AUSTR COMP SCI WEE, DOI DOI 10.1145/3014812.3014886
[4]  
[Anonymous], 2015, INDIA C INDICON 2015
[5]  
[Anonymous], 2012, NSDI
[6]  
Fung B.C., 2010, Introduction to Privacy-Preserving Data Publishing: Concepts and Techniques
[7]  
Gao Z., 2017, Advances in Internetwork-ing, Data & Web Technologies, Lecture Notes on DataEngineering and Communications Technologies, V6, P367
[8]  
Hbibi L, 2016, 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT), P485, DOI 10.1109/EITech.2016.7519647
[9]  
Kavitha S., 2015, 2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO), P1, DOI 10.1109/ISCO.2015.7282237
[10]   LRDM: Local Record-Driving Mechanism for Big Data Privacy Preservation in Social Networks [J].
Li, Weihao ;
Li, Hui .
2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, :556-560