Outlier detection in performance data of parallel applications

被引:0
|
作者
Benkert, Katharina [1 ]
Gabriel, Edgar [1 ]
Resch, Michael M. [2 ]
机构
[1] Univ Houston, Dept Comp Sci, Parallel Software Technol Lab, Houston, TX 77204 USA
[2] High Performance Comp Ctr Stuttgart, HLRS, D-70569 Stuttgart, Germany
来源
2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8 | 2008年
关键词
outlier detection; performance analysis; adaptive communication libraries;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When an adaptive software component is employed to select the best-performing implementation for a communication operation at runtime, the correctness of the decision taken strongly depends on detecting and removing outliers in the data used for the comparison. This automatic decision is greatly complicated by the fact that the types and quantities of outliers depend on the network interconnect and the nodes assigned to the job by the batch scheduler. This paper evaluates four different statistical methods used for handling outliers, namely a standard interquartile range method, a heuristic derived from the trimmed mean value, cluster analysis and a method using robust statistics. Using performance data from the Abstract Data and Communication Library (ADCL) we evaluate the correctness of the decisions made with each statistical approach over three fundamentally different network interconnects, namely a highly reliable InfiniBand network, a Gigabit Ethernet network having a larger variance in the performance, and a hierarchical Gigabit Ethernet network.
引用
收藏
页码:2929 / +
页数:2
相关论文
共 50 条
  • [1] Parallel outlier detection on uncertain data for GPUs
    Matsumoto, Takazumi
    Hung, Edward
    Yiu, Man Lung
    DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (03) : 417 - 447
  • [2] Parallel outlier detection on uncertain data for GPUs
    Takazumi Matsumoto
    Edward Hung
    Man Lung Yiu
    Distributed and Parallel Databases, 2015, 33 : 417 - 447
  • [3] Outlier Detection in Streaming Data for Telecommunications and Industrial Applications: A Survey
    Mfondoum, Roland N.
    Ivanov, Antoni
    Koleva, Pavlina
    Poulkov, Vladimir
    Manolova, Agata
    ELECTRONICS, 2024, 13 (16)
  • [4] Outlier detection for skewed data
    Hubert, Mia
    Van der Veeken, Stephan
    JOURNAL OF CHEMOMETRICS, 2008, 22 (3-4) : 235 - 246
  • [5] Outlier detection in transactional data
    Dash, Manoranjan
    Lie, Ng Wil
    INTELLIGENT DATA ANALYSIS, 2010, 14 (03) : 283 - 298
  • [6] A Survey of Outlier Detection Methodologies and Their Applications
    Niu, Zhixian
    Shi, Shuping
    Sun, Jingyu
    He, Xiu
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 380 - 387
  • [7] Attribute-weighted outlier detection for mixed data based on parallel mutual information
    Li, Junli
    Liu, Zhanfeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 236
  • [8] Parallel Auto-encoder for Efficient Outlier Detection
    Ma, Yunlong
    Zhang, Peng
    Cao, Yanan
    Guo, Li
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [9] Parallel Outlier Detection Algorithm in Heterogeneous Distributed Environment
    Wang X.
    Zhu Z.
    Yu X.
    Bai M.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (10): : 100 - 110
  • [10] Outlier detection for questionnaire data in biobanks
    Sakurai, Rieko
    Ueki, Masao
    Makino, Satoshi
    Hozawa, Atsushi
    Kuriyama, Shinichi
    Takai-Igarashi, Takako
    Kinoshita, Kengo
    Yamamoto, Masayuki
    Tamiya, Gen
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2019, 48 (04) : 1305 - 1315