A Checkpoint of Research on Parallel I/O for High-Performance Computing

被引:28
作者
Boito, Francieli Zanon [1 ]
Inacio, Eduardo C. [2 ]
Bez, Jean Luca [3 ]
Navaux, Philippe O. A. [3 ]
Dantas, Mario A. R. [2 ]
Denneulin, Yves
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Inria GIANT, Minatec Campus,17 Ave Martyrs, F-38000 Grenoble, France
[2] Univ Fed Santa Catarina, Dept Informat & Stat, INE, Campus Reitor Joao,DF Lima, BR-88040900 Florianopolis, SC, Brazil
[3] Univ Fed Rio Grande do Sul, Inst Informat, Av Bento Goncalves 9500, BR-90650001 Porto Alegre, RS, Brazil
基金
欧盟地平线“2020”;
关键词
Parallel file systems; high-performance computing; storage systems; MANAGEMENT; STRATEGY; DESIGN; SYSTEM; SSD;
D O I
10.1145/3152891
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a comprehensive survey on parallel I/O in the high-performance computing (HPC) context. This is an important field for HPC because of the historic gap between processing power and storage latency, which causes application performance to be impaired when accessing or generating large amounts of data. As the available processing power and amount of data increase, I/O remains a central issue for the scientific community. In this survey article, we focus on a traditional I/O stack, with a POSIX parallel file system. We present background concepts everyone could benefit from. Moreover, through the comprehensive study of publications from the most important conferences and journals in a 5-year time window, we discuss the state of the art in I/O optimization approaches, access pattern extraction techniques, and performance modeling, in addition to general aspects of parallel I/O research. With this approach, we aim at identifying the general characteristics of the field and the main current and future research topics.
引用
收藏
页数:35
相关论文
共 119 条
  • [31] He Jun, 2013, P 22 INT S HIGHPERFO, P25, DOI 10.1145/2493123.2462909
  • [32] He S., 2013, CLUSTER, 2013 IEEE International Conference on, P1
  • [33] Henschel R., 2012, Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond
  • [34] Supporting Scalable and Adaptive Metadata Management in Ultralarge-Scale File Systems
    Hua, Yu
    Zhu, Yifeng
    Jiang, Hong
    Feng, Dan
    Tian, Lei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (04) : 580 - 593
  • [35] Huaiming Song, 2011, 2011 Proceedings of 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2011), P414, DOI 10.1109/CCGrid.2011.26
  • [36] Design and Evaluation of Multiple-Level Data Staging for Blue Gene Systems
    Isaila, Florin
    Blas, Javier Garcia
    Carretero, Jesus
    Latham, Robert
    Ross, Robert
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) : 946 - 959
  • [37] Islam TZ, 2013, SCI PROGRAMMING-NETH, V21, P149, DOI [10.3233/SPR-130371, 10.1155/2013/341672, 10.5402/2012/472586]
  • [38] Jenkins J., 2012, Proceedings of the international conference on high performance computing, networking, storage and analysis (supercomputing), P1
  • [39] Triple-A: A Non-SSD Based Autonomic All-Flash Array for High Performance Storage Systems
    Jung, Myoungsoo
    Choi, Wonil
    Shalf, John
    Kandemir, Mahmut Taylan
    [J]. ACM SIGPLAN NOTICES, 2014, 49 (04) : 441 - 454
  • [40] Kandemir M., 2012, Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), P188, DOI 10.1109/CCGrid.2012.40