A Differentially Private Big Data Nonparametric Bayesian Clustering Algorithm in Smart Grid

被引:16
作者
Guan, Zhitao [1 ]
Lv, Zefang [1 ]
Sun, Xianwen [1 ]
Wu, Longfei [2 ]
Wu, Jun [3 ]
Du, Xiaojiang [4 ]
Guizani, Mohsen [5 ]
机构
[1] North China Elect Power Univ, Sch Control & Comp Engn, Beijing 102206, Peoples R China
[2] Fayetteville State Univ, Dept Math & Comp Sci, Fayetteville, NC 28301 USA
[3] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[4] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
[5] Qatar Univ, Dept Comp Sci & Engn, Doha 2713, Qatar
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2020年 / 7卷 / 04期
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Smart grids; Clustering algorithms; Bayes methods; Privacy; Data analysis; Differential privacy; Nonparametric Bayesian Method; Clustering; Big data; Smart grid; EFFICIENT;
D O I
10.1109/TNSE.2020.2985096
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Smart systems, including smart grid (SG) and Internet of Things (IoT), have been playing a critical role in addressing contemporary issues. Taking full advantage of the big data generated by the smart grid can enhance the system stability and reliability, increase asset utilization, and offer better customer experience. To better support the data-driven smart grid, the machine learning technologies such as cluster analysis can be applied to process the massive data generated in smart grid. However, the process of cluster analysis may cause the disclosure of personal private information. In this paper, to achieve privacy-preserving cluster analysis in smart grid, we propose IDPC, a Differentially Private Clustering algorithm based on the Infinite Gaussian mixture model (IGMM). IDPC uses a combination of nonparametric Bayesian method and differential privacy. The nonparametric Bayesian method allows certain parameters to change along with the data and it is usually adopted in a clustering algorithm without a fixed number of clusters. The Laplace mechanism is used in data releasing process to make IDPC differentially private. We present how to make the nonparametric Bayesian clustering algorithm differentially private by adding Laplace noise. By security analysis and performance evaluation, IDPC is proved to be privacy-preserving as well as efficient.
引用
收藏
页码:2631 / 2641
页数:11
相关论文
共 40 条
[1]   Smart Electricity Meter Data Intelligence for Future Energy Systems: A Survey [J].
Alahakoon, Damminda ;
Yu, Xinghuo .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2016, 12 (01) :425-436
[2]  
Bilal A, 2016, INT WIREL COMMUN, P541, DOI 10.1109/IWCMC.2016.7577115
[3]   Big Data Analytics for Dynamic Energy Management in Smart Grids [J].
Diamantoulakis, Panagiotis D. ;
Kapinas, Vasileios M. ;
Karagiannidis, George K. .
BIG DATA RESEARCH, 2015, 2 (03) :94-101
[4]  
Du X., 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470), P453, DOI 10.1109/INM.2001.918059
[5]   A routing-driven key management scheme for Heterogeneous Sensor Networks [J].
Du, Xiaojiang ;
Xiao, Yang ;
Ci, Song ;
Guizani, Mohsen ;
Chen, Hsiao-Hwa .
2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, :3407-+
[6]   Defending DoS attacks on broadcast authentication in wireless sensor networks [J].
Du, Xiaojiang ;
Xiao, Yang ;
Guizani, Mohsen ;
Chen, Hsiao-Hwa .
2008 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, PROCEEDINGS, VOLS 1-13, 2008, :1653-+
[7]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
[8]   Calibrating noise to sensitivity in private data analysis [J].
Dwork, Cynthia ;
McSherry, Frank ;
Nissim, Kobbi ;
Smith, Adam .
THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284
[9]   A Firm Foundation for Private Data Analysis [J].
Dwork, Cynthia .
COMMUNICATIONS OF THE ACM, 2011, 54 (01) :86-95
[10]   The Path of the Smart Grid [J].
Farhangi, Hassan .
IEEE POWER & ENERGY MAGAZINE, 2010, 8 (01) :18-28