A time-series clustering methodology for knowledge extraction in energy consumption data

被引:49
|
作者
Ruiz, L. G. B. [1 ,2 ]
Pegalajar, M. C. [1 ]
Arcucci, R. [2 ]
Molina-Solana, M. [1 ,2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] Imperial Coll London, Data Sci Inst, London, England
关键词
Time-series clustering; Energy efficiency; Knowledge extraction; Data mining; DECISION-MAKING; NEURAL-NETWORKS; ALGORITHM; IDENTIFICATION; SEGMENTATION; MANAGEMENT; BUILDINGS; DISTANCE; SYSTEM; MODEL;
D O I
10.1016/j.eswa.2020.113731
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Energy Efficiency field, the incorporation of intelligent systems in cities and buildings is motivated by the energy savings and pollution reduction that can be attained. To achieve this goal, energy modelling and a better understanding of how energy is consumed are fundamental factors. As a result, this study proposes a methodology for knowledge acquisition in energy-related data through Time-Series Clustering (TSC) techniques. In our experimentation, we utilize data from the buildings at the University of Granada (Spain) and compare several clustering methods to get the optimum model, in particular, we tested k-Means, k-Medoids, Hierarchical clustering and Gaussian Mixtures; as well as several algorithms to obtain the best grouping, such as PAM, CLARA, and two variants of Lloyd's method, Small and Large. Thus, our methodology can provide non-trivial knowledge from raw energy data. In contrast to previous studies in this field, not only do we propose a clustering methodology to group time series straightforwardly, but we also present an automatic strategy to search and analyse energy periodicity in these series recursively so that we can deepen granularity and extract information at different levels of detail. The results show that k-Medoids with PAM is the best approach in virtually all cases, and the Squared Euclidean distance outperforms the rest of the metrics. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A time-series clustering methodology for knowledge extraction in energy consumption data
    Ruiz, L.G.B.
    Pegalajar, M.C.
    Arcucci, R.
    Molina-Solana, M.
    Expert Systems with Applications, 2020, 160
  • [2] A novel pattern based clustering methodology for time-series microarray data
    Phan, Sieu
    Famili, Fazel
    Tang, Zoujian
    Pan, Youlian
    Liu, Ziying
    Ouyang, Junjun
    Lenferink, Anne
    O'Connor, Maureen Mc-Court
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2007, 84 (05) : 585 - 597
  • [3] Clustering time-series energy data from smart meters
    Lavin, Alexander
    Klabjan, Diego
    ENERGY EFFICIENCY, 2015, 8 (04) : 681 - 689
  • [4] Clustering time-series energy data from smart meters
    Alexander Lavin
    Diego Klabjan
    Energy Efficiency, 2015, 8 : 681 - 689
  • [5] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [6] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [7] Time-series data clustering with load-shape preservation for identifying residential energy consumption behaviors
    Kim, Jinwoo
    Song, Kwonsik
    Lee, Gaang
    Lee, SangHyun
    ENERGY AND BUILDINGS, 2024, 311
  • [8] A methodology for index tracking based on time-series clustering
    Focardi, SM
    Fabozzi, FJ
    QUANTITATIVE FINANCE, 2004, 4 (04) : 417 - 425
  • [9] Alcohol consumption, time-series methodology and disease outcomes
    Rehm, J
    Gmel, G
    Her, M
    ADDICTION, 2000, 95 (03) : 352 - 354
  • [10] Clustering to Forecast Sparse Time-Series Data
    Jha, Abhay
    Ray, Shubhankar
    Seaman, Brian
    Dhillon, Inderjit S.
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1388 - 1399