Anomaly pattern detection for streaming data

被引:19
|
作者
Kim, Taegong [1 ]
Park, Cheong Hee [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, 220 Gung Dong, Daejeon 305763, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly pattern detection; Control charts; Hypothesis testing; Outlier detection; Streaming data; OUTLIER;
D O I
10.1016/j.eswa.2020.113252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection aims to find a data sample that is different from most other data samples. While outlier detection is performed at an individual instance level, anomaly pattern detection on a data stream means detecting a time point where a pattern to generate data is unusual and significantly different from normal behavior. Beyond predicting the outlierness of individual data samples in a data stream, it can be very useful to detect the occurrence of anomalous patterns in real time. In this paper, we propose a method for anomaly pattern detection in a data stream based on binary classification for outliers and statistical tests on a data stream of binary labels of normal or an outlier. In the first step, by applying the clustering-based outlier detection method, we transform a data stream into a stream of binary values where 0 stands for the prediction as normal data and 1 for outlier prediction. In the second step, anomaly pattern detection is performed on a stream of binary values by two approaches: testing the equality of parameters in the binomial distributions of a reference window and a detection window, and using control charts for the fraction defective. The proposed method obtained the average true positive detection rate of 94% in simulated experiments using real and artificial data. The experimental results also show that anomaly pattern occurrence can be detected reliably even when outlier detection performance is relatively low. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Unsupervised real-time anomaly detection for streaming data
    Ahmad, Subutai
    Lavin, Alexander
    Purdy, Scott
    Agha, Zuha
    NEUROCOMPUTING, 2017, 262 : 134 - 147
  • [22] Querying Streaming System Monitoring Data for Enterprise System Anomaly Detection
    Gao, Peng
    Xiao, Xusheng
    Li, Ding
    Jee, Kangkook
    Chen, Haifeng
    Kulkarni, Sanjeev R.
    Mittal, Prateek
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1774 - 1777
  • [23] Experience with anomaly detection using ensemble models on streaming data at HIPA
    de Portugal, Jaime Coello
    Snuverink, Jochem
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2021, 1020
  • [24] An Efficient Anomaly Detection Approach Using Cube Sampling with Streaming Data
    Jain, Seemandhar
    Jain, Prarthi
    Srivastava, Abhishek
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 498 - 505
  • [25] Real-time anomaly detection in gas sensor streaming data
    Wu, Haibo
    Shi, Shiliang
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2021, 14 (01) : 81 - 88
  • [26] Real-time Bayesian anomaly detection in streaming environmental data
    Hill, David J.
    Minsker, Barbara S.
    Amir, Eyal
    WATER RESOURCES RESEARCH, 2009, 45
  • [27] ADVERSARIAL ANOMALY DETECTION FOR MARKED SPATIO-TEMPORAL STREAMING DATA
    Zhu, Shixiang
    Yuchi, Henry Shaowu
    Xie, Yao
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8921 - 8925
  • [28] A Streaming Data Anomaly Detection Analytic Engine for Mobile Network Management
    Wang, MingXue
    Handurukande, Sidath
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 722 - 729
  • [29] Hardware Architecture Proposal for TEDA Algorithm to Data Streaming Anomaly Detection
    Da Silva, Lucileide M. D.
    Coutinho, Maria G. F.
    Santos, Carlos E. B., Jr.
    Santos, Mailson R.
    Ruiz, M. Dolores
    Guedes, Luiz Affonso
    Fernandes, Marcelo A. C.
    IEEE ACCESS, 2021, 9 : 103141 - 103152
  • [30] Performance Analysis of Hybrid RR Algorithm for Anomaly Detection in Streaming Data
    Amudha L.
    PushpaLakshmi R.
    Computer Systems Science and Engineering, 2023, 45 (03): : 2299 - 2312