A time-series clustering methodology for knowledge extraction in energy consumption data

被引:49
|
作者
Ruiz, L. G. B. [1 ,2 ]
Pegalajar, M. C. [1 ]
Arcucci, R. [2 ]
Molina-Solana, M. [1 ,2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] Imperial Coll London, Data Sci Inst, London, England
关键词
Time-series clustering; Energy efficiency; Knowledge extraction; Data mining; DECISION-MAKING; NEURAL-NETWORKS; ALGORITHM; IDENTIFICATION; SEGMENTATION; MANAGEMENT; BUILDINGS; DISTANCE; SYSTEM; MODEL;
D O I
10.1016/j.eswa.2020.113731
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Energy Efficiency field, the incorporation of intelligent systems in cities and buildings is motivated by the energy savings and pollution reduction that can be attained. To achieve this goal, energy modelling and a better understanding of how energy is consumed are fundamental factors. As a result, this study proposes a methodology for knowledge acquisition in energy-related data through Time-Series Clustering (TSC) techniques. In our experimentation, we utilize data from the buildings at the University of Granada (Spain) and compare several clustering methods to get the optimum model, in particular, we tested k-Means, k-Medoids, Hierarchical clustering and Gaussian Mixtures; as well as several algorithms to obtain the best grouping, such as PAM, CLARA, and two variants of Lloyd's method, Small and Large. Thus, our methodology can provide non-trivial knowledge from raw energy data. In contrast to previous studies in this field, not only do we propose a clustering methodology to group time series straightforwardly, but we also present an automatic strategy to search and analyse energy periodicity in these series recursively so that we can deepen granularity and extract information at different levels of detail. The results show that k-Medoids with PAM is the best approach in virtually all cases, and the Squared Euclidean distance outperforms the rest of the metrics. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Time-Series Clustering Methodology for Estimating Atmospheric Phase Screen in Ground-Based InSAR Data
    Izumi, Yuta
    Nico, Giovanni
    Sato, Motoyuki
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [22] Utilizing Mixture Regression Models for Clustering Time-Series Energy Consumption of a Plastic Injection Molding Process
    Pacella, Massimo
    Mangini, Matteo
    Papadia, Gabriele
    ALGORITHMS, 2023, 16 (11)
  • [23] Clustering of unevenly sampled gene expression time-series data
    Möller-Levet, CS
    Klawonn, F
    Cho, KH
    Yin, H
    Wolkenhauer, O
    FUZZY SETS AND SYSTEMS, 2005, 152 (01) : 49 - 66
  • [24] Clustering Individuals Based on Multivariate EMA Time-Series Data
    Ntekouli, Mandani
    Spanakis, Gerasimos
    Waldorp, Lourens
    Roefs, Anne
    QUANTITATIVE PSYCHOLOGY, 2023, 422 : 161 - 171
  • [25] Incremental Clustering of Time-Series by Fuzzy Clustering
    Aghabozorgi, Saeed
    Saybani, Mahmoud Reza
    Teh, Ying Wah
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (04) : 671 - 688
  • [26] Time Series Clustering of Energy Meter Data
    Majumder, Patrali
    Richter, Marc
    Gotze, Jens
    2022 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2022 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2022,
  • [27] Deep Green: modelling time-series of software energy consumption
    Romansky, Stephen
    Borle, Neil C.
    Chowdhury, Shaiful
    Hindle, Abram
    Greiner, Russ
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2017, : 273 - 283
  • [28] A Workflow Investigating the Information behind the Time-Series Energy Consumption Condition via Data Mining
    Liu, Xiaodong
    Zhang, Shuming
    Cui, Weiwen
    Zhang, Hong
    Wu, Rui
    Huang, Jie
    Li, Zhixin
    Wang, Xiaohan
    Wu, Jianing
    Yang, Junqi
    BUILDINGS, 2023, 13 (09)
  • [29] THE EFFECT OF SAMPLING ERROR ON THE TIME-SERIES BEHAVIOR OF CONSUMPTION DATA
    BELL, WR
    WILCOX, DW
    JOURNAL OF ECONOMETRICS, 1993, 55 (1-2) : 235 - 265
  • [30] Methodology of Data Popularity Forecasting in High-Energy Physics Experiments on Unbalanced and Irregular Time-series Data
    Grigorieva, M. A.
    Popova, N. N.
    Vartanov, D. A.
    Shubin, M. V.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (07) : 3072 - 3084