I/O-signature-based feature analysis and classification of high-performance computing applications

被引:1
|
作者
Park, Ju-Won [1 ]
Huang, Xin [2 ]
Lee, Jae-Kook [1 ]
Hong, Taeyoung [1 ]
机构
[1] Korea Inst Sci & Technol Informat, 245 Daehak Ro, Daejeon 34141, South Korea
[2] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
关键词
I/O patterns analysis; Key features; High performance computing;
D O I
10.1007/s10586-023-04139-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The demand for high-performance computing (HPC) resources in computing fields such as machine learning has increased significantly in recent years. Computing power has been growing exponentially to keep up with this demand. However, these gains have not been able to translate to performance improvement in real-world applications. One of the biggest reasons for this is performance degradation in terms of input/output (I/O) due to the increased storage latency and complex parallel I/O architecture of accessing data in distributed storage systems. In this study, we analyze application-specific I/O patterns to gain a deeper understanding of I/O throughput and the interaction between applications and the I/O system. Specifically, we analyze the importance of each feature of I/O patterns through feature analysis based on the collected monitoring information. We also investigate the feasibility of identifying the application based on the analyzed key features. To this end, we present the analysis accuracy and confusion matrix of four algorithms, including random forest, which are widely used as classification algorithms in the experimental results. The experiment results confirm that we can distinguish applications with an accuracy of more than 90% by using application-specific I/O patterns.
引用
收藏
页码:3219 / 3231
页数:13
相关论文
共 50 条
  • [1] Hierarchical Collective I/O Scheduling for High-Performance Computing
    Liu, Jialin
    Zhuang, Yu
    Chen, Yong
    BIG DATA RESEARCH, 2015, 2 (03) : 117 - 126
  • [2] A Checkpoint of Research on Parallel I/O for High-Performance Computing
    Boito, Francieli Zanon
    Inacio, Eduardo C.
    Bez, Jean Luca
    Navaux, Philippe O. A.
    Dantas, Mario A. R.
    Denneulin, Yves
    ACM COMPUTING SURVEYS, 2018, 51 (02)
  • [3] Efficient I/O Performance-Focused Scheduling in High-Performance Computing
    Kim, Soeun
    Kim, Sunggon
    Kim, Hwajung
    APPLIED SCIENCES-BASEL, 2024, 14 (21):
  • [4] Autotuning in High-Performance Computing Applications
    Balaprakash, Prasanna
    Dongarra, Jack
    Gamblin, Todd
    Hall, Mary
    Hollingsworth, Jeffrey K.
    Norris, Boyana
    Vuduc, Richard
    PROCEEDINGS OF THE IEEE, 2018, 106 (11) : 2068 - 2083
  • [5] COTS-Based High-Performance Computing for Space Applications
    Esposito, S.
    Albanese, C.
    Alderighi, M.
    Casini, F.
    Giganti, L.
    Esposti, M. L.
    Monteleone, C.
    Violante, M.
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2015, 62 (06) : 2687 - 2694
  • [6] Scalable I/O Forwarding Framework for High-Performance Computing Systems
    Ali, Nawab
    Carns, Philip
    Iskra, Kamil
    Kimpe, Dries
    Lang, Samuel
    Latham, Robert
    Ross, Robert
    Ward, Lee
    Sadayappan, P.
    2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 86 - +
  • [7] Packaging Materials in High-Performance Computing Applications
    Markondeya Raj Pulugurtha
    Himani Sharma
    Raghuram Pucha
    Mohanalingam Kathaperumal
    Rao Tummala
    Journal of the Indian Institute of Science, 2022, 102 : 461 - 487
  • [8] High-Performance Computing and Engineering Applications in Electromagnetics
    Yuan, Ning
    Li, Joshua Le-Wei
    Hu, Jun
    Bhardwaj, Ashutosh
    INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2012, 2012
  • [9] MULTIMEDIA APPLICATIONS AND HIGH-PERFORMANCE COMPUTING - INTRODUCTION
    CHOUDHARY, A
    FOSTER, I
    STEVENS, R
    IEEE PARALLEL & DISTRIBUTED TECHNOLOGY, 1995, 3 (02): : 2 - 3
  • [10] Measuring high-performance computing with real applications
    Sayeed, Mohamed
    Bae, Hansang
    Zheng, Yili
    Armstrong, Brian
    Eigenmann, Rudolf
    Saied, Faisal
    COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (04) : 60 - 70