Hierarchical Collective I/O Scheduling for High-Performance Computing

被引:9
|
作者
Liu, Jialin [1 ]
Zhuang, Yu [1 ]
Chen, Yong [1 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Collective I/O; Scheduling; High-performance computing; Big data; Data intensive computing;
D O I
10.1016/j.bdr.2015.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The non-contiguous access pattern of many scientific applications results in a large number of I/O requests, which can seriously limit the data-access performance. Collective I/O has been widely used to address this issue. However, the performance of collective I/O could be dramatically degraded in today's high-performance computing systems due to the increasing shuffle cost caused by highly concurrent data accesses. This situation tends to be even worse as many applications become more and more data intensive. Previous research has primarily focused on optimizing I/O access cost in collective I/O but largely ignored the shuffle cost involved. Previous works assume that the lowest average response time leads to the best QoS and performance, while that is not always true for collective requests when considering the additional shuffle cost. In this study, we propose a new hierarchical I/O scheduling (HIO) algorithm to address the increasing shuffle cost in collective I/O. The fundamental idea is to schedule applications' I/O requests based on a shuffle cost analysis to achieve the optimal overall performance, instead of achieving optimal I/O accesses only. The algorithm is currently evaluated with the MPICH3 and PVFS2. Both theoretical analysis and experimental tests show that the proposed hierarchical I/O scheduling has a potential in addressing the degraded performance issue of collective I/O with highly concurrent accesses. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 50 条
  • [31] Teaching high-performance computing on a high-performance cluster
    Bernreuther, M
    Brenk, M
    Bungartz, HJ
    Mundani, RP
    Muntean, IL
    COMPUTATIONAL SCIENCE - ICCS 2005, PT 2, 2005, 3515 : 1 - 9
  • [32] HIGH-PERFORMANCE COMPUTING MEETS HIGH-PERFORMANCE MEDICINE
    Verma, Anurag
    Huffman, Jennifer
    Torkamani, Ali
    Madduri, Ravi
    BIOCOMPUTING 2023, PSB 2023, 2023, : 541 - 545
  • [33] Early Experience with Optimizing I/O Performance Using High-Performance SSDs for In-Memory Cluster Computing
    Choi, I. Stephen
    Yang, Weiqing
    Kee, Yang-Suk
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1073 - 1083
  • [34] Making Resonance a Common Case: A High-Performance Implementation of Collective I/O on Parallel File Systems
    Zhang, Xuechen
    Jiang, Song
    Davis, Kei
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 975 - +
  • [35] Performance analysis of parallel I/O scheduling approaches on cluster computing systems
    Abawajy, JH
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 724 - 729
  • [36] GPGPU based job scheduling simulator for hybrid high-performance computing systems
    Skrinarova, Jarmila
    Povinsky, Michal
    2015 IEEE 13TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATICS, 2015, : 269 - 274
  • [37] Optimization of High-Performance Computing Job Scheduling Based on Offline Reinforcement Learning
    Li, Shihao
    Dai, Wei
    Chen, Yongyan
    Liang, Bo
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [38] Deep Reinforcement Learning with Different Rewards for Scheduling in High-Performance Computing Systems
    Reza, Md Farhadur
    Zhao, Bo
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 183 - 186
  • [39] Reconfiguration and Communication-Aware Task Scheduling for High-Performance Reconfigurable Computing
    Huang, Miaoqing
    Narayana, Vikram K.
    Simmler, Harald
    Serres, Olivier
    El-Ghazawi, Tarek
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (04)
  • [40] TRENDS IN HIGH-PERFORMANCE COMPUTING
    Kindratenko, Volodymyr
    Trancoso, Pedro
    COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (03) : 92 - 95