A time-series clustering methodology for knowledge extraction in energy consumption data

被引:49
作者
Ruiz, L. G. B. [1 ,2 ]
Pegalajar, M. C. [1 ]
Arcucci, R. [2 ]
Molina-Solana, M. [1 ,2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] Imperial Coll London, Data Sci Inst, London, England
关键词
Time-series clustering; Energy efficiency; Knowledge extraction; Data mining; DECISION-MAKING; NEURAL-NETWORKS; ALGORITHM; IDENTIFICATION; SEGMENTATION; MANAGEMENT; BUILDINGS; DISTANCE; SYSTEM; MODEL;
D O I
10.1016/j.eswa.2020.113731
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Energy Efficiency field, the incorporation of intelligent systems in cities and buildings is motivated by the energy savings and pollution reduction that can be attained. To achieve this goal, energy modelling and a better understanding of how energy is consumed are fundamental factors. As a result, this study proposes a methodology for knowledge acquisition in energy-related data through Time-Series Clustering (TSC) techniques. In our experimentation, we utilize data from the buildings at the University of Granada (Spain) and compare several clustering methods to get the optimum model, in particular, we tested k-Means, k-Medoids, Hierarchical clustering and Gaussian Mixtures; as well as several algorithms to obtain the best grouping, such as PAM, CLARA, and two variants of Lloyd's method, Small and Large. Thus, our methodology can provide non-trivial knowledge from raw energy data. In contrast to previous studies in this field, not only do we propose a clustering methodology to group time series straightforwardly, but we also present an automatic strategy to search and analyse energy periodicity in these series recursively so that we can deepen granularity and extract information at different levels of detail. The results show that k-Medoids with PAM is the best approach in virtually all cases, and the Squared Euclidean distance outperforms the rest of the metrics. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [32] Multiple gene expression profile alignment for microarray time-series data clustering
    Subhani, Numanul
    Rueda, Luis
    Ngom, Alioune
    Burden, Conrad J.
    BIOINFORMATICS, 2010, 26 (18) : 2281 - 2288
  • [33] Knowledge Extraction from Time Series of Electric Energy Demand using Temporal Data Mining
    Saraiva de Queiroz, Alynne C.
    Costa, Jose Alfredo F.
    2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [34] A time-series approach for clustering farms based on slaughterhouse health aberration data
    Hulsegge, B.
    de Greef, K. H.
    PREVENTIVE VETERINARY MEDICINE, 2018, 153 : 64 - 70
  • [35] A density-based time-series data analysis methodology for shadow detection in rooftop photovoltaic systems
    Tsafarakis, Odysseas
    van Sark, Wilfried G. J. H. M.
    PROGRESS IN PHOTOVOLTAICS, 2023, 31 (05): : 506 - 523
  • [36] Clustering of Spatial Data for Knowledge Extraction
    Martins, Eduardo S.
    Ribeiro, Marcos
    Lisboa-Filho, Jugurta
    Reinaldo, Francisco
    Freddo, Ademir
    Reis, Luis Paulo
    2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [37] VARTOOLS: A program for analyzing astronomical time-series data
    Hartman, J. D.
    Bakos, G. A.
    ASTRONOMY AND COMPUTING, 2016, 17 : 1 - 72
  • [38] Dealing with Time-Series Data in Predictive Maintenance Problems
    Susto, Gian Antonio
    Beghi, Alessandro
    2016 IEEE 21ST INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2016,
  • [39] Unsupervised Time-Series Clustering Over Lab Data for Automatic Identification of Uncontrolled Diabetes
    Rusanov, Alexander
    Prado, Patric V.
    Weng, Chunhua
    2016 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2016, : 72 - 80
  • [40] Quantification and clustering of phenotypic screening data using time-series analysis for chemotherapy of schistosomiasis
    Lee, Hyokyeong
    Moody-Davis, Asher
    Saha, Utsab
    Suzuki, Brian M.
    Asarnow, Daniel
    Chen, Steven
    Arkin, Michelle
    Caffrey, Conor R.
    Singh, Rahul
    BMC GENOMICS, 2012, 13