Differentially Private K-Means Clustering Applied to Meter Data Analysis and Synthesis

被引:11
作者
Ravi, Nikhil [1 ]
Scaglione, Anna [1 ]
Kadam, Sachin [2 ,3 ]
Gentz, Reinhard [4 ,5 ]
Peisert, Sean [4 ]
Lunghino, Brent [6 ]
Levijarvi, Emmanuel [7 ]
Shumavon, Aram [8 ]
机构
[1] Cornell Tech, Dept Elect & Comp Engn, New York, NY 10044 USA
[2] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[3] Sungkyunkwan Univ, Suwon 16419, Gyeonggi, South Korea
[4] Lawrence Berkeley Natl Lab, Computat Res, Berkeley, CA 94720 USA
[5] Amazon, Networking Dept, Seattle, WA 98170 USA
[6] Kevala Inc, Data Sci & Methodol Implementat, San Francisco, CA 94133 USA
[7] Kevala Inc, Software Engn Dept, San Francisco, CA 94133 USA
[8] Kevala Inc, San Francisco, CA 94133 USA
关键词
Differential privacy; clustering; smart grids; summary statistics; synthetic load generation; NOISE;
D O I
10.1109/TSG.2022.3184252
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The proliferation of smart meters has resulted in a large amount of data being generated. It is increasingly apparent that methods are required for allowing a variety of stakeholders to leverage the data in a manner that preserves the privacy of the consumers. The sector is scrambling to define policies, such as the so called '15/15 rule', to respond to the need. However, the current policies fail to adequately guarantee privacy. In this paper, we address the problem of allowing third parties to apply K-means clustering, obtaining customer labels and centroids for a set of load time series by applying the framework of differential privacy. We leverage the method to design an algorithm that generates differentially private synthetic load data consistent with the labeled data. We test our algorithm's utility by answering summary statistics such as average daily load profiles for a 2-dimensional synthetic dataset and a real-world power load dataset.
引用
收藏
页码:4801 / 4814
页数:14
相关论文
共 50 条
  • [41] K-MEANS plus : A DEVELOPED CLUSTERING ALGORITHM FOR BIG DATA
    Niu, Kun
    Gao, Zhipeng
    Jiao, Haizhen
    Deng, Nanjie
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 141 - 144
  • [42] Using K-Means Clustering Algorithm for Handling Data Precision
    Suganthi, P.
    Kala, K.
    Balasubramanian, C.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [43] The Border K-Means Clustering Algorithm for One Dimensional Data
    Froese, Ryan
    Klassen, James W.
    Leung, Carson K.
    Loewen, Tyler S.
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 35 - 42
  • [44] An evolutionary K-means algorithm for clustering time series data
    Zhang, H
    Ho, TB
    Lin, MS
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1282 - 1287
  • [45] Efficient MapReduce Kernel k-Means for Big Data Clustering
    Tsapanos, Nikolaos
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    9TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2016), 2016,
  • [46] The fast clustering algorithm for the big data based on K-means
    Xie, Ting
    Zhang, Taiping
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (06)
  • [47] Diffusing-CRN k-means: an improved k-means clustering algorithm applied in cognitive radio ad hoc networks
    Badr Benmammar
    Mohammed Housseyn Taleb
    Francine Krief
    Wireless Networks, 2017, 23 : 1849 - 1861
  • [48] The SKM Algorithm: A K-Means Algorithm for Clustering Sequential Data
    Dias, Jose G.
    Cortinhal, Maria Joao
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 173 - 182
  • [49] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
  • [50] ADAPTIVE USAGE OF K-MEANS IN EVOLUTIONARY OPTIMIZED DATA CLUSTERING
    Wang, Xi
    Sheng, Weiguo
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2017, : 15 - 20