Outlier detection in performance data of parallel applications

被引:0
|
作者
Benkert, Katharina [1 ]
Gabriel, Edgar [1 ]
Resch, Michael M. [2 ]
机构
[1] Univ Houston, Dept Comp Sci, Parallel Software Technol Lab, Houston, TX 77204 USA
[2] High Performance Comp Ctr Stuttgart, HLRS, D-70569 Stuttgart, Germany
来源
2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8 | 2008年
关键词
outlier detection; performance analysis; adaptive communication libraries;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When an adaptive software component is employed to select the best-performing implementation for a communication operation at runtime, the correctness of the decision taken strongly depends on detecting and removing outliers in the data used for the comparison. This automatic decision is greatly complicated by the fact that the types and quantities of outliers depend on the network interconnect and the nodes assigned to the job by the batch scheduler. This paper evaluates four different statistical methods used for handling outliers, namely a standard interquartile range method, a heuristic derived from the trimmed mean value, cluster analysis and a method using robust statistics. Using performance data from the Abstract Data and Communication Library (ADCL) we evaluate the correctness of the decisions made with each statistical approach over three fundamentally different network interconnects, namely a highly reliable InfiniBand network, a Gigabit Ethernet network having a larger variance in the performance, and a hierarchical Gigabit Ethernet network.
引用
收藏
页码:2929 / +
页数:2
相关论文
共 50 条
  • [41] In Situ Visualization of Performance-Related Data in Parallel CFD Applications
    Alves, Rigel F. C.
    Knuepfer, Andreas
    EURO-PAR 2019: PARALLEL PROCESSING WORKSHOPS, 2020, 11997 : 400 - 412
  • [42] Further enhancing the in situ visualization of performance data in parallel CFD applications
    Alves, Rigel F. C.
    Knuepfer, Andreas
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [43] A grouped spatial-temporal model for PM2.5 data and its applications on outlier detection
    Guo, Baishan
    Wang, Lei
    Pan, Rui
    Zhu, Xuening
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (05) : 2565 - 2577
  • [44] Outlier detection for multinomial data with a large number of categories
    Yang, Xiaona
    Wang, Zhaojun
    Zi, Xuemin
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2020, 9 (03)
  • [45] Outlier Detection and Elimination in Stream Data - An Experimental Approach
    Kalisch, Mateusz
    Michalak, Marcin
    Przystalka, Piotr
    Sikora, Marek
    Wrobel, Lukasz
    ROUGH SETS, (IJCRS 2016), 2016, 9920 : 416 - 426
  • [46] Outlier detection in satellite data using spatial coherence
    Alvera-Azcarate, A.
    Sirjacobs, D.
    Barth, A.
    Beckers, J. -M.
    REMOTE SENSING OF ENVIRONMENT, 2012, 119 : 84 - 91
  • [47] SVDD-based outlier detection on uncertain data
    Bo Liu
    Yanshan Xiao
    Longbing Cao
    Zhifeng Hao
    Feiqi Deng
    Knowledge and Information Systems, 2013, 34 : 597 - 618
  • [48] Minimum distance method for directional data and outlier detection
    Sau, Mercedes Fernandez
    Rodriguez, Daniela
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (03) : 587 - 603
  • [49] A Survey for Different Approaches of Outlier Detection in Data Mining
    Chandarana, Dhaval R.
    Dhamecha, Maulik V.
    2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
  • [50] Density kernel depth for outlier detection in functional data
    Hernandez, Nicolas
    Munoz, Alberto
    Martos, Gabriel
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 16 (04) : 481 - 488