Outlier detection in performance data of parallel applications

被引:0
|
作者
Benkert, Katharina [1 ]
Gabriel, Edgar [1 ]
Resch, Michael M. [2 ]
机构
[1] Univ Houston, Dept Comp Sci, Parallel Software Technol Lab, Houston, TX 77204 USA
[2] High Performance Comp Ctr Stuttgart, HLRS, D-70569 Stuttgart, Germany
来源
2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8 | 2008年
关键词
outlier detection; performance analysis; adaptive communication libraries;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When an adaptive software component is employed to select the best-performing implementation for a communication operation at runtime, the correctness of the decision taken strongly depends on detecting and removing outliers in the data used for the comparison. This automatic decision is greatly complicated by the fact that the types and quantities of outliers depend on the network interconnect and the nodes assigned to the job by the batch scheduler. This paper evaluates four different statistical methods used for handling outliers, namely a standard interquartile range method, a heuristic derived from the trimmed mean value, cluster analysis and a method using robust statistics. Using performance data from the Abstract Data and Communication Library (ADCL) we evaluate the correctness of the decisions made with each statistical approach over three fundamentally different network interconnects, namely a highly reliable InfiniBand network, a Gigabit Ethernet network having a larger variance in the performance, and a hierarchical Gigabit Ethernet network.
引用
收藏
页码:2929 / +
页数:2
相关论文
共 50 条
  • [21] Outlier and anomaly pattern detection on data streams
    Cheong Hee Park
    The Journal of Supercomputing, 2019, 75 : 6118 - 6128
  • [22] A Nonparametric Outlier Detection Method for Financial Data
    Qu Ji-lin
    Qin Wen
    Sai Ying
    Feng Yu-mei
    2009 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (16TH), VOLS I AND II, CONFERENCE PROCEEDINGS, 2009, : 1442 - +
  • [23] Efficient outlier detection in numerical and categorical data
    Cabral, Eugenio F.
    Vinces, Braulio V. Sanchez
    Silva, Guilherme D. F.
    Sander, Jorg
    Cordeiro, Robson L. F.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2025, 39 (03)
  • [24] POD: A Parallel Outlier Detection Algorithm Using Weighted kNN
    Ma, Yang
    Zhao, Xujun
    IEEE ACCESS, 2021, 9 : 81765 - 81777
  • [25] A Novel Outlier Detection Method for Multivariate Data
    Almardeny, Yahya
    Boujnah, Noureddine
    Cleary, Frances
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4052 - 4062
  • [26] Adaptive Threshold for Outlier Detection on Data Streams
    Clark, James P.
    Liu, Zhen
    Japkowicz, Nathalie
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 41 - 49
  • [27] WMEVF: AN OUTLIER DETECTION METHODS FOR CATEGORICAL DATA
    Rokhman, Nur
    Subanar
    Winarko, Edi
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 37 - 42
  • [28] Outlier detection in chemical data by fractal analysis
    Cramer, JA
    Shah, SS
    Battaglia, TM
    Banerji, SN
    Obando, LA
    Booksh, KS
    JOURNAL OF CHEMOMETRICS, 2004, 18 (7-8) : 317 - 326
  • [29] On-line outlier detection and data cleaning
    Liu, HC
    Shah, S
    Jiang, W
    COMPUTERS & CHEMICAL ENGINEERING, 2004, 28 (09) : 1635 - 1647
  • [30] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams
    Alghushairy, Omar
    Alsini, Raed
    Soule, Terence
    Ma, Xiaogang
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) : 1 - 24