Scalable I/O Forwarding Framework for High-Performance Computing Systems

被引:0
|
作者
Ali, Nawab [1 ]
Carns, Philip [2 ]
Iskra, Kamil [2 ]
Kimpe, Dries [3 ]
Lang, Samuel [2 ]
Latham, Robert [2 ]
Ross, Robert [2 ]
Ward, Lee [4 ]
Sadayappan, P. [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Argonne Natl Lab, Argonne, IL 60439 USA
[3] Univ Chicago, Chicago, IL 60637 USA
[4] Sandia Natl Labs, Albuquerque, NM 87185 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS | 2009年
关键词
I/O forwarding; Parallel file systems; Leadership-class machines;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current leadership-class machines suffer from a significant imbalance between their computational power and their I/O bandwidth. While Moore's law ensures that the computational power of high-performance computing systems increases with every generation, the same is not true for their I/O subsystems. The scalability challenges faced by existing parallel file systems with respect to the increasing number of clients, coupled with the minimalistic compute node kernels running on these machines, call for a new I/O paradigm to meet the requirements of data-intensive scientific applications. I/O forwarding is a technique that attempts to bridge the increasing performance and scalability gap between the compute and I/O components of leadership-class machines by shipping I/O calls from compute nodes to dedicated I/O nodes. The I/O nodes perform operations on behalf of the compute nodes and can reduce file system traffic by aggregating, rescheduling, and caching I/O requests. This paper presents an open, scalable I/O forwarding framework for high-performance computing systems. We describe an I/O protocol and API for shipping function calls from compute nodes to I/O nodes, and we present a quantitative analysis of the overhead associated with I/O forwarding.
引用
收藏
页码:86 / +
页数:3
相关论文
共 50 条
  • [1] IKAROS: A scalable I/O framework for high-performance computing systems.
    Filippidis, Christos
    Tsanakas, Panayiotis
    Cotronis, Yiannis
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 118 : 277 - 287
  • [2] A Scalable Runtime Fault Localization Framework for High-Performance Computing Systems
    Gao, Jian
    Wei, Hongmei
    Yu, Kang
    Qing, Peng
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (04) : 749 - 761
  • [3] A Scalable Runtime Fault Localization Framework for High-Performance Computing Systems
    Jian Gao
    Hongmei Wei
    Kang Yu
    Peng Qing
    International Journal of Parallel Programming, 2018, 46 : 749 - 761
  • [4] Scalable Approach to Failure Analysis of High-Performance Computing Systems
    Shawky, Doaa
    ETRI JOURNAL, 2014, 36 (06) : 1023 - 1031
  • [5] Scalable Embedded Systems: Towards the Convergence of High-Performance and Embedded Computing
    Giorgi, Roberto
    PROCEEDINGS IEEE/IFIP 13TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING 2015, 2015, : 148 - 153
  • [6] A scalable framework for online power modelling of high-performance computing nodes in production
    Pittino, Federico
    Beneventi, Francesco
    Bartolini, Andrea
    Benini, Luca
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 300 - 307
  • [7] An Extended IMS Framework With a High-Performance and Scalable Distributed Storage and Computing System
    Seraoui, Youssef
    Raouyane, Brahim
    Bellafkih, Mostafa
    2017 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC), 2017,
  • [8] Modeling I/O performance variability in high-performance computing systems using mixture distributions
    Xu, Li
    Wang, Yueyao
    Lux, Thomas
    Chang, Tyler
    Bernard, Jon
    Li, Bo
    Hong, Yili
    Cameron, Kirk
    Watson, Layne
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 139 : 87 - 98
  • [9] Hierarchical Collective I/O Scheduling for High-Performance Computing
    Liu, Jialin
    Zhuang, Yu
    Chen, Yong
    BIG DATA RESEARCH, 2015, 2 (03) : 117 - 126
  • [10] A Checkpoint of Research on Parallel I/O for High-Performance Computing
    Boito, Francieli Zanon
    Inacio, Eduardo C.
    Bez, Jean Luca
    Navaux, Philippe O. A.
    Dantas, Mario A. R.
    Denneulin, Yves
    ACM COMPUTING SURVEYS, 2018, 51 (02)