Clustering district heat exchange stations using smart meter consumption data

被引:28
作者
Tureczek, Alexander Martin [1 ]
Nielsen, Per Sieverts [1 ]
Madsen, Henrik [2 ]
Brun, Adam [3 ]
机构
[1] Tech Univ Denmark, Management Engn, Syst Anal, DK-2800 Lyngby, Denmark
[2] Tech Univ Denmark, Dynam Syst, Compute, DK-2800 Lyngby, Denmark
[3] Affald Varme Aarhus, Business Dev, DK-8210 Aarhus V, Denmark
关键词
Clustering; Feature extraction; Autocorrelation; Wavelet analysis; Smart meter data; Load pattern; ELECTRICITY CONSUMPTION; CLASSIFICATION; HOUSEHOLDS;
D O I
10.1016/j.enbuild.2018.10.009
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Contrary to electricity smart meter data analysis, little research regarding district heat smart meter data has been published. Previous papers on smart meter data analytics have not investigated autocorrelation in smart meter data. This paper examines district heat smart meter data from the largest district heat supplier in Denmark and autocorrelation is identified in the data. The K-Means algorithm is not able to take autocorrelation into account when clustering. We propose different data transformation methods to enable K-Means to account for this autocorrelation information in the data by using wavelet transformation and autocorrelation features. Our results show that the K-Means yield acceptable clustering results for district heat data when clustering normalized data, inclusion of autocorrelation improves the clustering. The clusters on normalized data are similar to the wavelet transformed clusters, where the autocorrelation has been accounted for. The clustering achieved with the autocorrelation transformation yields finer clusters through accounting for autocorrelation. We are not able to statistically show a difference between the transformations. All transformations result in shadowing clusters, but the autocorrelation transformation generates fewer shadow clusters and reduce the number of dimensions from 744 to 24, resulting in a dramatic reduction in K-Means runtime. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:144 / 158
页数:15
相关论文
共 42 条
[1]  
[Anonymous], 2008, The Elements of Statistical Learning
[2]  
Arthur D., 2006, Proceedings of the Twenty-Second Annual Symposium on Computational Geometry (SCG'06), P144, DOI 10.1145/1137856.1137880
[3]   Time series distance-based methods for non-intrusive load monitoring in residential buildings [J].
Basu, Kaustav ;
Debusschere, Vincent ;
Douzal-Chouakria, Ahlame ;
Bacha, Seddik .
ENERGY AND BUILDINGS, 2015, 96 :109-117
[4]  
BISHOP C. M., 2006, Pattern recognition and machine learning, DOI [DOI 10.1117/1.2819119, 10.1007/978-0-387-45528-0]
[5]   Electricity customer classification using frequency-domain load pattern data [J].
Carpaneto, E ;
Chicco, G ;
Napoli, R ;
Scutariu, M .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2006, 28 (01) :13-20
[6]   Comparisons among clustering techniques for electricity customer classification [J].
Chicco, G ;
Napoli, R ;
Piglione, F .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2006, 21 (02) :933-940
[7]   Load pattern-based classification of electricity customers [J].
Chicco, G ;
Napoli, R ;
Piglione, F ;
Postolache, P ;
Scutariu, M ;
Toader, C .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2004, 19 (02) :1232-1239
[8]   Renyi entropy-based classification of daily electrical load patterns [J].
Chicco, G. ;
Akilimali, J. Sumaili .
IET GENERATION TRANSMISSION & DISTRIBUTION, 2010, 4 (06) :736-745
[9]  
Chui C.K., 1992, Construction, V2, P1, DOI DOI 10.1109/99.388960.TECH
[10]  
Cormen Thomas H., 2001, Introduction to Algorithms