Hierarchical Collective I/O Scheduling for High-Performance Computing

被引:9
|
作者
Liu, Jialin [1 ]
Zhuang, Yu [1 ]
Chen, Yong [1 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Collective I/O; Scheduling; High-performance computing; Big data; Data intensive computing;
D O I
10.1016/j.bdr.2015.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The non-contiguous access pattern of many scientific applications results in a large number of I/O requests, which can seriously limit the data-access performance. Collective I/O has been widely used to address this issue. However, the performance of collective I/O could be dramatically degraded in today's high-performance computing systems due to the increasing shuffle cost caused by highly concurrent data accesses. This situation tends to be even worse as many applications become more and more data intensive. Previous research has primarily focused on optimizing I/O access cost in collective I/O but largely ignored the shuffle cost involved. Previous works assume that the lowest average response time leads to the best QoS and performance, while that is not always true for collective requests when considering the additional shuffle cost. In this study, we propose a new hierarchical I/O scheduling (HIO) algorithm to address the increasing shuffle cost in collective I/O. The fundamental idea is to schedule applications' I/O requests based on a shuffle cost analysis to achieve the optimal overall performance, instead of achieving optimal I/O accesses only. The algorithm is currently evaluated with the MPICH3 and PVFS2. Both theoretical analysis and experimental tests show that the proposed hierarchical I/O scheduling has a potential in addressing the degraded performance issue of collective I/O with highly concurrent accesses. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 50 条
  • [21] Group Based Job Scheduling to Increase the High-Performance Computing Efficiency
    Lyakhovets, D. S.
    Baranov, A. V.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2020, 41 (12) : 2558 - 2565
  • [22] A Dynamic Job Scheduling Method for Reliable and High-Performance Volunteer Computing
    Yasuda, Shinya
    Nogami, Yasuyuki
    Fukushi, Masaru
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2015, : 100 - 103
  • [23] Energy-Aware Scheduling for High-Performance Computing Systems: A Survey
    Kocot, Bartlomiej
    Czarnul, Pawel
    Proficz, Jerzy
    ENERGIES, 2023, 16 (02)
  • [24] Group Based Job Scheduling to Increase the High-Performance Computing Efficiency
    D. S. Lyakhovets
    A. V. Baranov
    Lobachevskii Journal of Mathematics, 2020, 41 : 2558 - 2565
  • [25] High-Performance Computing
    Bungartz, Hans-Joachim
    IT-INFORMATION TECHNOLOGY, 2013, 55 (03): : 83 - 85
  • [27] High-performance computing
    Holland, CJ
    Peterkin, RE
    COMPUTING IN SCIENCE & ENGINEERING, 2004, 6 (06) : 8 - 11
  • [28] HIGH-PERFORMANCE COMPUTING
    KOCHER, B
    COMMUNICATIONS OF THE ACM, 1990, 33 (01) : 3 - 3
  • [29] HIGH-PERFORMANCE COMPUTING
    不详
    I-S ANALYZER, 1991, 29 (05): : 1 - 12
  • [30] I/O-signature-based feature analysis and classification of high-performance computing applications
    Park, Ju-Won
    Huang, Xin
    Lee, Jae-Kook
    Hong, Taeyoung
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3219 - 3231