DeepHYDRA: Resource-Efficient Time-Series Anomaly Detection in Dynamically-Configured Systems

被引:0
|
作者
Stehle, Franz Kevin [1 ,2 ]
Vandelli, Wainer [2 ]
Avolio, Giuseppe [2 ]
Zahn, Felix [2 ]
Froening, Holger [1 ]
机构
[1] Heidelberg Univ, Heidelberg, Germany
[2] CERN, Geneva, Switzerland
关键词
TRANSFORMER;
D O I
10.1145/3650200.3656637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection in distributed systems such as High-Performance Computing (HPC) clusters is vital for early fault detection, performance optimisation, security monitoring, reliability in general but also operational insights. It enables proactive measures to address issues, ensuring system reliability, resource efficiency, and protection against potential threats. Deep Neural Networks have seen successful use in detecting long-term anomalies in multidimensional data, originating for instance from industrial or medical systems, or weather prediction. A downside of such methods is that they require a static input size, or lose data through cropping, sampling, or other dimensionality reduction methods, making deployment on systems with variability on monitored data channels, such as computing clusters difficult. To address these problems, we present DeepHYDRA (Deep Hybrid DBSCAN/Reduction-Based Anomaly Detection) which combines DBSCAN and learning-based anomaly detection. DBSCAN clustering is used to find point anomalies in time-series data, mitigating the risk of missing outliers through loss of information when reducing input data to a fixed number of channels. A deep learning-based time-series anomaly detection method is then applied to the reduced data in order to identify long-term outliers. This hybrid approach reduces the chances of missing anomalies that might be made indistinguishable from normal data by the reduction process, and likewise enables the algorithm to be scalable and tolerate partial system failures while retaining its detection capabilities. Using a subset of the well-known SMD dataset family, a modified variant of the Eclipse dataset, as well as an in-house dataset with a large variability in active data channels, made publicly available with this work, we furthermore analyse computational intensity, memory footprint, and activation counts. DeepHYDRA is shown to reliably detect different types of anomalies in both large and complex datasets. At the same time, the applied reduction approach is shown to enable real-time anomaly detection of a whole computing cluster while occupying proportionally miniscule compute resources, enabling its usage on existing systems without the need for hardware changes.
引用
收藏
页码:272 / 285
页数:14
相关论文
共 50 条
  • [31] Reconstructive reservoir computing for anomaly detection in time-series signals
    Kato, Junya
    Tanaka, Gouhei
    Nakane, Ryosho
    Hirose, Akira
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (01): : 183 - 204
  • [32] A Modified DBSCAN Algorithm for Anomaly Detection in Time-series Data with
    Jain, Praphula
    Bajpai, Mani Shankar
    Pamula, Rajendra
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (01) : 23 - 28
  • [33] Deep Quantile Regression for Unsupervised Anomaly Detection in Time-Series
    Tambuwal A.I.
    Neagu D.
    SN Computer Science, 2021, 2 (6)
  • [34] Anomaly Detection from Multivariate Time-Series with Sparse Representation
    Takeishi, Naoya
    Yairi, Takehisa
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2651 - 2656
  • [35] Generic and Scalable Framework for Automated Time-series Anomaly Detection
    Laptev, Nikolay
    Amizadeh, Saeed
    Flint, Ian
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1939 - 1947
  • [36] Anomaly Detection in COVID-19 Time-Series Data
    Homayouni H.
    Ray I.
    Ghosh S.
    Gondalia S.
    Kahn M.G.
    SN Computer Science, 2021, 2 (4)
  • [37] Non-Pattern-Based Anomaly Detection in Time-Series
    Tkach, Volodymyr
    Kudin, Anton
    Kebande, Victor R. R.
    Baranovskyi, Oleksii
    Kudin, Ivan
    ELECTRONICS, 2023, 12 (03)
  • [38] Time-Series Deep Learning Anomaly Detection for Particle Accelerators
    Marcato, Davide
    Bortolato, Damiano
    Martinelli, Valentina
    Savarese, Giovanni
    Susto, Gian Antonio
    IFAC PAPERSONLINE, 2023, 56 (02): : 1566 - 1571
  • [39] Contrastive time-series reconstruction method for satellite anomaly detection
    Li, Zhenyu
    Song, Yuchen
    Peng, Xiyuan
    Liu, Datong
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 17 - 26
  • [40] Resource-Efficient Parametric Recovery of Linear Time-Varying Systems
    Harms, Andrew
    Bajwa, Waheed U.
    Calderbank, Robert
    2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013), 2013, : 200 - +