Detecting performance anomalies in large-scale software systems using entropy

被引:1
|
作者
Malik, Haroon [1 ]
Shakshuki, Elhadi M. [2 ]
机构
[1] Marshall Univ, Weisberg Div Comp Sci, Huntington, WV 25755 USA
[2] Acadia Univ, Jodrey Sch Comp Sci, Wolfville, NS, Canada
关键词
Performance counters; Large-scale systems; Data center; Performance; Load test;
D O I
10.1007/s00779-017-1036-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale software systems (LSSs) are composed of hundreds of subsystems that interact with each other in an unforeseen and complex ways. The operators of these LSSs strictly monitor thousands of metrics (performance counters) to quickly identify performance anomalies before a catastrophe. The existing monitoring tools and methodologies have not kept in pace with the rapid growth and inherit complexity of these LSSs; hence are ineffective in assisting practitioners to effectively pinpoint performance anomalies. We propose two methodologies that use entropy measure to assist practitioners/operators of LSSs in quickly detecting both system-wide and underlying localized subsystem anomalies. Our performance tests conducted on an open-source benchmark system reveal that the proposed methodologies are robust in pinpointing anomalies, do not require any domain knowledge to operate, and avoid information overload on practitioners.
引用
收藏
页码:1127 / 1137
页数:11
相关论文
共 50 条
  • [21] Automated Testing for Large-Scale Critical Software Systems
    Liu, Zheng
    Mei, Paul
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 200 - 203
  • [22] A Survey on Load Testing of Large-Scale Software Systems
    Jiang, Zhen Ming
    Hassan, Ahmed E.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2015, 41 (11) : 1091 - 1118
  • [23] Visual software analytics for the build optimization of large-scale software systems
    Alexandru Telea
    Lucian Voinea
    Computational Statistics, 2011, 26 : 635 - 654
  • [24] Visual software analytics for the build optimization of large-scale software systems
    Telea, Alexandru
    Voinea, Lucian
    COMPUTATIONAL STATISTICS, 2011, 26 (04) : 635 - 654
  • [25] Classification of Post-Deployment Performance Diagnostic Techniques for Large-Scale Software Systems
    Malik, Haroon
    Shakshuki, Elhadi M.
    5TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS / THE 4TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE / AFFILIATED WORKSHOPS, 2014, 37 : 244 - +
  • [26] PERFORMANCE OF SRF SYSTEMS IN LARGE-SCALE APPLICATIONS
    HOVATER, JC
    PARTICLE ACCELERATORS, 1994, 46 (1-3): : 19 - 33
  • [27] Multicast Performance of Large-Scale Antenna Systems
    Yang, Hong
    Marzetta, Thomas L.
    Ashikhmin, Alexei
    2013 IEEE 14TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC), 2013, : 604 - 608
  • [28] PERFORMANCE PROPERTIES OF LARGE-SCALE PARALLEL SYSTEMS
    GUPTA, A
    KUMAR, V
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 19 (03) : 234 - 244
  • [29] Performance virtualization for large-scale storage systems
    Chambliss, DD
    Alvarez, GA
    Pandey, P
    Jadav, D
    Xu, J
    Menon, R
    Lee, TP
    22ND INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2003, : 109 - 118
  • [30] SOFTWARE AS A LARGE-SCALE SYSTEM
    SAGE, AP
    LARGE SCALE SYSTEMS IN INFORMATION AND DECISION TECHNOLOGIES, 1987, 12 (03): : 185 - 188