Online Self-Evolving Anomaly Detection for Reliable Cloud Computing

被引:0
|
作者
Bai, Tianyu [1 ]
Wang, Haili [1 ]
Guo, Jingda [1 ]
Ma, Xu [2 ]
Talasila, Mahendra [1 ]
Tang, Sihai [1 ]
Fu, Song [1 ]
Yang, Qing [1 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
[2] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
来源
2022 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC | 2022年
基金
美国国家科学基金会;
关键词
Cloud Computing; Reliability; Anomaly Detection; Online Learning;
D O I
10.1109/UCC56403.2022.00014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Production cloud computing systems consist of hundreds to thousands of computing and storage nodes. Such a scale, combined with ever-growing system complexity, is causing a key challenge to failure and resource management for dependable cloud computing. Efficient system monitoring and failure detection are crucial for understanding emergent, cloudwide phenomena and intelligently managing cloud resources for system-level dependability assurance and application-level performance assurance. To detect failures, we need to monitor the cloud execution and collect runtime performance data. These data are usually unlabeled at runtime in real-world systems, and thus a prior failure history is not always available. In this paper, we present a self-evolving anomaly detection framework for cloud dependability assurance. Our framework does not require any prior failure history, and it self-evolves by continuously exploring newly verified anomaly records and continuously updating the anomaly detector at runtime without expensive model retraining. A distinct advantage of our framework is that cloud system operators only need to check a small number of detected anomalies (compared with thousands-millions of system/application event records) and their decisions are leveraged to update the detector. Thus, the detector evolves following the upgrade of system hardware, update of software stack, and change of user workloads. Moreover, we design two types of detectors, one for general anomaly detection and the other for type-specific anomaly detection. Leveraging self-evolution and online learning techniques, our detectors can achieve 88.94% of sensitivity and 94.60% of specificity on average, which makes them suitable for real-world deployment.
引用
收藏
页码:31 / 40
页数:10
相关论文
共 50 条
  • [1] The vision of self-evolving computing systems
    Weyns, Danny
    Back, Thomas
    Vidal, Rene
    Yao, Xin
    Belbachir, Ahmed Nabil
    JOURNAL OF INTEGRATED DESIGN & PROCESS SCIENCE, 2022, 26 (3-4) : 351 - 367
  • [2] A Self-Evolving Anomaly Detection Framework for Developing Highly Dependable Utility Clouds
    Pannu, Husanbir S.
    Liu, Jianguo
    Fu, Song
    2012 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2012, : 1605 - 1610
  • [3] Online self-evolving fuzzy controller with global learning capabilities
    Cara A.B.
    Pomares H.
    Rojas I.
    Lendek Z.
    Babuška R.
    Evolving Systems, 2010, 1 (04) : 225 - 239
  • [4] Self-adaptive cloud monitoring with online anomaly detection
    Wang, Tao
    Xu, Jiwei
    Zhang, Wenbo
    Gu, Zeyu
    Zhong, Hua
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 80 : 89 - 101
  • [5] DroidEvolver: Self-Evolving Android Malware Detection System
    Xu, Ke
    Li, Yingjiu
    Deng, Robert
    Chen, Kai
    Xu, Jiayun
    2019 4TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P), 2019, : 47 - 62
  • [6] A Self-Evolving Agent System for Power System Online Corrective Control
    Gao, Tianlu
    Zhang, Tianyun
    Si, Ruiqi
    Xu, Peidong
    Lv, Chen
    Zhang, Jun
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 876 - 880
  • [7] Self-evolving ghost imaging
    Liu, Baolei
    Wang, Fan
    Chen, Chaohao
    Dong, Fei
    Mcgloin, David
    OPTICA, 2021, 8 (10): : 1340 - 1349
  • [8] Self-evolving Petri Nets
    Capra, Lorenzo
    Cazzola, Walter
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2007, 13 (13) : 2002 - 2034
  • [9] A Deep Learning Framework for Self-evolving Hierarchical Community Detection
    Ding, Daizong
    Zhang, Mi
    Wang, Hanrui
    Pan, Xudong
    Yang, Min
    He, Xiangnan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 372 - 381
  • [10] Self-Evolving Software Architectures
    Spinellis, Diomidis
    IEEE SOFTWARE, 2018, 35 (03) : 4 - 7