Detecting performance anomalies in large-scale software systems using entropy

被引:1
|
作者
Malik, Haroon [1 ]
Shakshuki, Elhadi M. [2 ]
机构
[1] Marshall Univ, Weisberg Div Comp Sci, Huntington, WV 25755 USA
[2] Acadia Univ, Jodrey Sch Comp Sci, Wolfville, NS, Canada
关键词
Performance counters; Large-scale systems; Data center; Performance; Load test;
D O I
10.1007/s00779-017-1036-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale software systems (LSSs) are composed of hundreds of subsystems that interact with each other in an unforeseen and complex ways. The operators of these LSSs strictly monitor thousands of metrics (performance counters) to quickly identify performance anomalies before a catastrophe. The existing monitoring tools and methodologies have not kept in pace with the rapid growth and inherit complexity of these LSSs; hence are ineffective in assisting practitioners to effectively pinpoint performance anomalies. We propose two methodologies that use entropy measure to assist practitioners/operators of LSSs in quickly detecting both system-wide and underlying localized subsystem anomalies. Our performance tests conducted on an open-source benchmark system reveal that the proposed methodologies are robust in pinpointing anomalies, do not require any domain knowledge to operate, and avoid information overload on practitioners.
引用
收藏
页码:1127 / 1137
页数:11
相关论文
共 50 条
  • [31] LARGE-SCALE HCN HYPERFINE ANOMALIES
    CERNICHARO, J
    CASTETS, A
    DUVERT, G
    GUILLOTEAU, S
    ASTRONOMY & ASTROPHYSICS, 1984, 139 (01): : L13 - L15
  • [32] Roundtable: Research Opportunities and Challenges for Large-Scale Software Systems
    Xiao, Xusheng
    Lou, Jian-Guang
    Lu, Shan
    Shepherd, David C.
    Peng, Xin
    Wang, Qian-Xiang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (05) : 851 - 860
  • [33] Roundtable: Research Opportunities and Challenges for Large-Scale Software Systems
    Xusheng Xiao
    Jian-Guang Lou
    Shan Lu
    David C. Shepherd
    Xin Peng
    Qian-Xiang Wang
    Journal of Computer Science and Technology, 2016, 31 : 851 - 860
  • [34] Architectural integration styles for large-scale enterprise software systems
    Andersson, J
    Johnson, P
    FIFTH IEEE INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE, PROCEEDINGS, 2001, : 224 - 236
  • [35] Achieving software robustness via large-scale multiagent systems
    Huhns, MN
    Holderfield, VT
    Gutierrez, RLZ
    SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 199 - 215
  • [36] New Software Engineering Requirements in Clouds and Large-Scale Systems
    Schubert, Lutz
    Jeffery, Keith
    IEEE CLOUD COMPUTING, 2015, 2 (01): : 48 - 58
  • [37] Employing multiple views to separate large-scale software systems
    Deubler, HH
    JOURNAL OF SYSTEMS AND SOFTWARE, 2001, 56 (02) : 105 - 113
  • [38] Efficient and effective exploratory testing of large-scale software systems
    Martensson, Torvald
    Stahl, Daniel
    Martini, Antonio
    Bosch, Jan
    JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 174 (174)
  • [39] A Fast Clustering Algorithm for Modularization of Large-Scale Software Systems
    Teymourian, Navid
    Izadkhah, Habib
    Isazadeh, Ayaz
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (04) : 1451 - 1462
  • [40] On the challenges of maintaining large-scale software systems at Lockheed Martin
    Alexander, Lawrence D.
    ICSM 2006: 22nd IEEE International Conference on Software Maintenance, Proceedings, 2006, : 2 - 2