Hierarchical K-means Method for Clustering Large-Scale Advanced Metering Infrastructure Data

被引:78
作者
Xu, Tian-Shi [1 ]
Chiang, Hsiao-Dong [2 ]
Liu, Guang-Yi [3 ,4 ]
Tan, Chin-Woo [5 ]
机构
[1] Tianjin Univ, Sch Elect Engn & Automat, Tianjin 300072, Peoples R China
[2] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14853 USA
[3] State Grid Corp China, Beijing 100031, Peoples R China
[4] SGRI North Amer, Beijing 100031, Peoples R China
[5] Stanford Univ, Stanford Sustainabil Syst Lab, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Advanced metering infrastructure (AMI); big data problems; clustering; hierarchical K-means (H-K-means); load patterns; PATTERN-RECOGNITION; LOAD PROFILES; CLASSIFICATION; CUSTOMER;
D O I
10.1109/TPWRD.2015.2479941
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clustering of the load patterns from distribution network customers is of vital importance for several applications. However, the growing number of advanced metering infrastructures (AMI) and a variety of customer behaviors make the clustering task quite challenging due to the increasing amount of load data. K-means is a widely used clustering algorithm in processing a large dataset with acceptable computational efficiency, but it suffers from local optimal solutions. To address this issue, this paper presents a hierarchical K-means (H-K-means) method for better clustering performance for big data problems. The proposed method is applied to a large-scale AMI dataset and its effectiveness is evaluated by benchmarking with several existing clustering methods in terms of five common adequacy indices, outliers detection, and computation time.
引用
收藏
页码:609 / 616
页数:8
相关论文
共 34 条
[1]  
[Anonymous], 2001, ADAP COMP MACH LEARN
[2]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[3]  
Chang R. F., 2003, IEEE POW ENG SOC GEN
[4]  
Cheng BW, 2005, 2005 2ND IEEE INTERNATIONAL CONFERENCE ON GROUP IV PHOTONICS, P105
[5]   Comparisons among clustering techniques for electricity customer classification [J].
Chicco, G ;
Napoli, R ;
Piglione, F .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2006, 21 (02) :933-940
[6]   Load pattern-based classification of electricity customers [J].
Chicco, G ;
Napoli, R ;
Piglione, F ;
Postolache, P ;
Scutariu, M ;
Toader, C .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2004, 19 (02) :1232-1239
[7]   Customer characterization options for improving the tariff offer [J].
Chicco, G ;
Napoli, R ;
Postolache, P ;
Scutariu, M ;
Toader, CM .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (01) :381-387
[8]   Renyi entropy-based classification of daily electrical load patterns [J].
Chicco, G. ;
Akilimali, J. Sumaili .
IET GENERATION TRANSMISSION & DISTRIBUTION, 2010, 4 (06) :736-745
[9]  
Chicco G., 2003, P IEEE BOL POWERTECH
[10]   Support Vector Clustering of Electrical Load Pattern Data [J].
Chicco, Gianfranco ;
Ilie, Irinel-Sorin .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2009, 24 (03) :1619-1628