Privacy-preserving governmental data publishing: A fog-computing-based differential privacy approach

被引:36
作者
Piao, Chunhui [1 ]
Shi, Yajuan [1 ]
Yan, Jiaqi [2 ]
Zhang, Changyou [3 ]
Liu, Liping [1 ]
机构
[1] Shijiazhuang Tiedao Univ, Sch Informat Sci & Technol, Shijiazhuang, Hebei, Peoples R China
[2] Nanjing Univ, Sch Informat Management, Nanjing, Jiangsu, Peoples R China
[3] Chinese Acad Sci, Inst Software, Lab Parallel Software & Computat Sci, Beijing 100190, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2019年 / 90卷
基金
中国国家自然科学基金;
关键词
Governmental statistical data publishing; Privacy-preserving; Fog computing; Differential privacy; MaxDiff histogram; SECURITY;
D O I
10.1016/j.future.2018.07.038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the growing availability of public open data, the protection of citizens' privacy has become a vital issue for governmental data publishing. However, there are a large number of operational risks in the current government cloud platforms. When the cloud platform is attacked, most existing privacy protection models for data publishing cannot resist the attacks if the attacker has prior background knowledge. Potential attackers may gain access to the published statistical data, and identify specific individual's background information, which may cause the disclosure of citizens' private information. To address this problem, we propose a fog-computing-based differential privacy approach for privacy-preserving data publishing in this paper. We discuss the risk of citizens' privacy disclosure related to governmental data publishing, and present a differential privacy framework for publishing governmental statistical data based on fog computing. Based on the framework, a data publishing algorithm using a MaxDiff histogram is developed, which can be used to realize the function of preserving user privacy based on fog computing. Applying the differential method, Laplace noises are added to the original data set, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. We evaluate the proposed approach by computational experiments based on the real data set of Philippine families' income and expenditures provided by Kaggle. It shows that the proposed data publishing approach can not only effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published. (C) 2018 Published by Elsevier B.V.
引用
收藏
页码:158 / 174
页数:17
相关论文
共 39 条
[1]   The impact of security and its antecedents in behaviour intention of using e-government services [J].
Alharbi, Nawaf ;
Papadaki, Maria ;
Dowland, Paul .
BEHAVIOUR & INFORMATION TECHNOLOGY, 2017, 36 (06) :620-636
[2]  
Ali O., 2015, INVESTIGATION MAIN F, V21, P72
[3]  
[Anonymous], CORR
[4]  
Bi Jianxin, 2015, E GOVT, P56
[5]  
Blum A., 2013, J ACM, P60
[6]  
Cao Z.F., 2016, COMPUTER RES DEV, V53, P2137
[7]  
Chi Yaping, 2016, COMPUT APPL, V36, P402
[8]  
Cynthia Dwork, 2006, THEOR CRYPT C
[9]  
Dwork C., 2010, Innovations in Computer Science (ICS), P66
[10]   Differential privacy: A survey of results [J].
Dwork, Cynthia .
THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 :1-19