Differentially Private K-Means Clustering Applied to Meter Data Analysis and Synthesis

被引:11
作者
Ravi, Nikhil [1 ]
Scaglione, Anna [1 ]
Kadam, Sachin [2 ,3 ]
Gentz, Reinhard [4 ,5 ]
Peisert, Sean [4 ]
Lunghino, Brent [6 ]
Levijarvi, Emmanuel [7 ]
Shumavon, Aram [8 ]
机构
[1] Cornell Tech, Dept Elect & Comp Engn, New York, NY 10044 USA
[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[3] Sungkyunkwan Univ, Suwon 16419, Gyeonggi, South Korea
[4] Lawrence Berkeley Natl Lab, Computat Res, Berkeley, CA 94720 USA
[5] Amazon, Networking Dept, Seattle, WA 98170 USA
[6] Kevala Inc, Data Sci & Methodol Implementat, San Francisco, CA 94133 USA
[7] Kevala Inc, Software Engn Dept, San Francisco, CA 94133 USA
[8] Kevala Inc, San Francisco, CA 94133 USA
关键词
Differential privacy; clustering; smart grids; summary statistics; synthetic load generation; NOISE;
D O I
10.1109/TSG.2022.3184252
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The proliferation of smart meters has resulted in a large amount of data being generated. It is increasingly apparent that methods are required for allowing a variety of stakeholders to leverage the data in a manner that preserves the privacy of the consumers. The sector is scrambling to define policies, such as the so called '15/15 rule', to respond to the need. However, the current policies fail to adequately guarantee privacy. In this paper, we address the problem of allowing third parties to apply K-means clustering, obtaining customer labels and centroids for a set of load time series by applying the framework of differential privacy. We leverage the method to design an algorithm that generates differentially private synthetic load data consistent with the labeled data. We test our algorithm's utility by answering summary statistics such as average daily load profiles for a 2-dimensional synthetic dataset and a real-world power load dataset.
引用
收藏
页码:4801 / 4814
页数:14
相关论文
共 50 条
  • [31] Deep k-Means: Jointly clustering with k-Means and learning representations
    Fard, Maziar Moradi
    Thonet, Thibaut
    Gaussier, Eric
    PATTERN RECOGNITION LETTERS, 2020, 138 : 185 - 192
  • [32] k-POD: A Method for k-Means Clustering of Missing Data
    Chi, Jocelyn T.
    Chi, Eric C.
    Baraniuk, Richard G.
    AMERICAN STATISTICIAN, 2016, 70 (01) : 91 - 99
  • [33] K-means clustering of electricity consumers using time-domain features from smart meter data
    George Emeka Okereke
    Mohamed Chaker Bali
    Chisom Nneoma Okwueze
    Emmanuel Chukwudi Ukekwe
    Stephenson Chukwukanedu Echezona
    Celestine Ikechukwu Ugwu
    Journal of Electrical Systems and Information Technology, 10 (1)
  • [34] A Survey on Various K-Means algorithms for Clustering
    Singh, Malwinder
    Bansal, Meenakshi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 60 - 65
  • [35] On the Added Value of Bootstrap Analysis for K-Means Clustering
    Hofmans, Joeri
    Ceulemans, Eva
    Steinley, Douglas
    Van Mechelen, Iven
    JOURNAL OF CLASSIFICATION, 2015, 32 (02) : 268 - 284
  • [36] On the Added Value of Bootstrap Analysis for K-Means Clustering
    Joeri Hofmans
    Eva Ceulemans
    Douglas Steinley
    Iven Van Mechelen
    Journal of Classification, 2015, 32 : 268 - 284
  • [37] Analysis of K-means clustering for Human Capital Trends
    Sharma, Gamini
    Sharma, Manish Kumar
    Sharma, Dakshata
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [38] Analysis and Study of Incremental K-Means Clustering Algorithm
    Chakraborty, Sanjay
    Nagwani, N. K.
    HIGH PERFORMANCE ARCHITECTURE AND GRID COMPUTING, 2011, 169 : 338 - 341
  • [39] Smart Meter Data Analytics based on Modified Streaming k-Means
    Zhu, Wendong
    Yu, Weiqing
    Kan, Bowen
    Liu, Guangyi
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 328 - 333
  • [40] Diffusing-CRN k-means: an improved k-means clustering algorithm applied in cognitive radio ad hoc networks
    Benmammar, Badr
    Taleb, Mohammed Housseyn
    Krief, Francine
    WIRELESS NETWORKS, 2017, 23 (06) : 1849 - 1861