Real-time Outlier Detection over Streaming Data

被引:7
|
作者
Yu, Kangqing [1 ]
Shi, Wei [2 ]
Santoro, Nicola [1 ]
Ma, Xiangyu [2 ]
机构
[1] Carleton Univ, Sch Comp Sci, Ottawa, ON, Canada
[2] Carleton Univ, Sch Informat Technol, Ottawa, ON, Canada
关键词
outlier detections; streaming data; parallel processing; sliding-window; CUDA;
D O I
10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Designing outlier detection algorithms over streaming data involves several issues such as concept drift, temporal context, transience, uncertainty, etc. Moreover, to produce results in real-time with limited memory resources, the processing of such data must occur in an online fashion. Therefore, real time detection of outliers on streaming data faces more challenges than performing the same task on batches of data. Several methods have been proposed to detect outliers over streaming data, among which a sliding window technique is frequently used. In this technique, only a chunk of data is kept in memory at each point in time and used to build predictive models. The size of the data in memory simultaneously is referred to as the size of a sliding window. The correctness of the outlier detection results depends largely on the choice of window size. Other similar techniques exist but most of them fail to address the properties of streaming data, and thus produce results exhibiting poor accuracy. In this paper, we present an online outlier detection algorithm, that addresses the aforementioned challenges. The proposed algorithm adopts the sliding window technique, however efficiently mines in memory a statistical summary of previous observed data, which contributes to the prediction of incoming data. It further addresses the concept drift problem that exists in streaming data. We evaluated the accuracy of our algorithm on both synthetic and real-world datasets. Results show that the proposed method detects outliers over streaming data with higher accuracy than SOD GPU algorithm proposed in [9], even when concept drifts occur. The algorithm does not require a secondary memory for processing and is further accelerated using CUDA GPU.
引用
收藏
页码:125 / 132
页数:8
相关论文
共 50 条
  • [1] Research on real-time outlier detection over big data streams
    Chen, Liangchen
    Gao, Shu
    Cao, Xiufeng
    International Journal of Computers and Applications, 2020, 42 (01): : 93 - 101
  • [2] RUTOD: real-time urban traffic outlier detection on streaming trajectory
    Shi, Juntian
    Pan, Zhicheng
    Fang, Junhua
    Chao, Pingfu
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (05): : 3625 - 3637
  • [3] RUTOD: real-time urban traffic outlier detection on streaming trajectory
    Juntian Shi
    Zhicheng Pan
    Junhua Fang
    Pingfu Chao
    Neural Computing and Applications, 2023, 35 : 3625 - 3637
  • [4] Real-Time Spread Burst Detection in Data Streaming
    Wang H.
    Melissourgos D.
    Ma C.
    Chen S.
    Performance Evaluation Review, 2023, 51 (01): : 51 - 52
  • [5] Real-time Spread Burst Detection in Data Streaming
    Wang, Haibo
    Melissourgos, Dimitrios
    Ma, Chaoyi
    Chen, Shigang
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2023, 7 (02) : 1 - 31
  • [6] Unsupervised real-time anomaly detection for streaming data
    Ahmad, Subutai
    Lavin, Alexander
    Purdy, Scott
    Agha, Zuha
    NEUROCOMPUTING, 2017, 262 : 134 - 147
  • [7] Real-Time Streaming Data Delivery over Named Data Networking
    Gusev, Peter
    Wang, Zhehao
    Burke, Jeff
    Zhang, Lixia
    Yoneda, Takahiro
    Ohnishi, Ryota
    Muramoto, Eiichi
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2016, E99B (05) : 974 - 991
  • [8] Real-time streaming of multichannel audio data over Internet
    Xu, AX
    Woszczyk, W
    Settel, Z
    Pennycook, B
    Rowe, R
    Galanter, P
    Bary, J
    Martin, G
    Corey, J
    Cooperstock, JR
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2000, 48 (7-8): : 627 - 641
  • [9] Real-time Bayesian anomaly detection in streaming environmental data
    Hill, David J.
    Minsker, Barbara S.
    Amir, Eyal
    WATER RESOURCES RESEARCH, 2009, 45
  • [10] Real-time anomaly detection in gas sensor streaming data
    Wu, Haibo
    Shi, Shiliang
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2021, 14 (01) : 81 - 88