I/O Bottleneck Detection and Tuning: Connecting the Dots using Interactive Log Analysis

被引:11
作者
Bez, Jean Luca [1 ]
Tang, Houjun [1 ]
Xie, Bing [2 ]
Williams-Young, David [1 ]
Latham, Rob [3 ]
Ross, Rob [3 ]
Oral, Sarp [2 ]
Byna, Suren [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[2] Oak Ridge Natl Lab, Oak Ridge, TN USA
[3] Argonne Natl Lab, Lemont, IL USA
来源
PROCEEDINGS OF IEEE/ACM SIXTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW 2021) | 2021年
关键词
D O I
10.1109/PDSW54622.2021.00008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Using parallel file systems efficiently is a tricky problem due to inter-dependencies among multiple layers of I/O software, including high-level I/O libraries (HDF5, netCDF, etc.), MPI-IO, POSIX, and file systems (GPFS, Lustre, etc.). Profiling tools such as Darshan collect traces to help understand the I/O performance behavior. However, there are significant gaps in analyzing the collected traces and then applying tuning options offered by various layers of I/O software. Seeking to connect the dots between I/O bottleneck detection and tuning, we propose DXT Explorer, an interactive log analysis tool. In this paper, we present a case study using our interactive log analysis tool to identify and apply various I/O optimizations. We report an evaluation of performance improvement achieved for four I/O kernels extracted from science applications.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 25 条
  • [1] Active Learning-based Automatic Tuning and Prediction of Parallel I/O Performance
    Agarwal, Megha
    Singhvi, Divyansh
    Malakar, Preeti
    Byna, Suren
    [J]. PROCEEDINGS OF PDSW 2019: 2019 IEEE/ACM FOURTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW), 2019, : 20 - 29
  • [2] [Anonymous], 1997, Scalapack Users Guide
  • [3] Behzad B, 2013, INT CONF HIGH PERFOR
  • [4] Optimizing I/O Performance of HPC Applications with Autotuning
    Behzad, Babak
    Byna, Surendra
    Prabhat
    Snir, Marc
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2019, 5 (04)
  • [5] ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems
    Byna, Suren
    Breitenfeld, M. Scot
    Dong, Bin
    Koziol, Quincey
    Pourmal, Elena
    Robinson, Dana
    Soumagne, Jerome
    Tang, Houjun
    Vishwanath, Venkatram
    Warren, Richard
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (01) : 145 - 160
  • [6] Understanding and Improving Computational Science Storage Access through Continuous Characterization
    Carns, Philip
    Harms, Kevin
    Allcock, William
    Bacon, Charles
    Lang, Samuel
    Latham, Robert
    Ross, Robert
    [J]. ACM TRANSACTIONS ON STORAGE, 2011, 7 (03)
  • [7] Dubey A, 2013, 2013 5TH INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR COMPUTATIONAL SCIENCE AND ENGINEERING (SE-CSE), P1, DOI 10.1109/SECSE.2013.6615093
  • [8] Gates M., 2020, SLATE USERS GUIDE
  • [9] Huebl Axel, 2015, **DATA OBJECT**, DOI 10.5281/zenodo.1167843
  • [10] Koller F., 2019, OPENPMD API 0 10 3 C, DOI [10.14278/rodare.209, DOI 10.14278/RODARE.209]