Input/Output APIs and Data Organization for High Performance Scientific Computing

被引:0
|
作者
Lofstead, Jay [1 ]
Zheng, Fang [1 ]
Klasky, Scott [2 ]
Schwan, Karsten [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
[2] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific Data Management has become essential to the productivity of scientists using ever larger machines and running applications that produce ever more data. There are several specific issues when running on petascale (and beyond) machines. One is the need for massively parallel data output, which in part, depends on the data formats and semantics being used. Here, the inhibition of parallelism by file system notions of strict and immediate consistency can be addressed with 'delayed data consistency' methods. Such methods can also be used to remove the runtime coordination steps required for immediate consistency from machine resources like Bluegene's separate networks for barrier calls and its dedicated IO nodes, thereby freeing them to instead, perform alternate tasks that enhance data output performance and/or richness. Second, once data is generated, it is important to be able to efficiently access it, which implies the need for rapid data characterization and indexing. This can be achieved by adding small amounts of metadata to the output process, thereby permitting scientists to quickly make informed decisions about which files to process from large-scale science runs. Third, failure probabilities increase with an increasing number of nodes, which suggests the need for organizing output data to be resilient to failures in which the output from a single or from a small number of nodes is lost or corrupted. This paper demonstrates the utility of using delayed consistency methods for the process of data output from the compute nodes of petascale machines. It also demonstrates the advantages derived from resilient data organization coupled with lightweight methods for data indexing. An implementation of these techniques is realized in ADIOS, the Adaptable IO System, and its BP intermediate file format. The implementation is designed to be compatible with existing, well-known file formats like HDF-5 and NetCDF, thereby permitting end users to exploit the rich tool chains for these formats. Initial performance evaluations of the approach exhibit substantial performance advantages over using native parallel HDF-5 in the Chimera supernova code.
引用
收藏
页码:1 / +
页数:3
相关论文
共 50 条
  • [21] High performance scientific computing by a parallel cellular environment
    DiGregorio, S
    Rongo, R
    Spataro, W
    Spezzano, G
    Talia, D
    FUTURE GENERATION COMPUTER SYSTEMS, 1997, 12 (05) : 357 - 369
  • [22] The PlayStation 3 for high-performance scientific computing
    Kurzak, Jakub
    Buttari, Alfredo
    Luszczek, Piotr
    Dongarra, Jack
    COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (03) : 84 - 87
  • [23] A component architecture for high-performance scientific computing
    Allan, Benjamin A.
    Armstrong, Robert
    Bernholdt, David E.
    Bertrand, Felipe
    Chiu, Kenneth
    Dahlgren, Tamara L.
    Damevski, Kostadin
    Elwasif, Wael R.
    Epperly, Thomas G. W.
    Govindaraju, Madhusudhan
    Katz, Daniel S.
    Kohl, James A.
    Krishnan, Manoj
    Kumfert, Gary
    Larson, J. Walter
    Lefantzi, Sophia
    Lewis, Michael J.
    Malony, Allen D.
    McInnes, Lois C.
    Nieplocha, Jarek
    Norris, Boyana
    Parker, Steven G.
    Ray, Jaideep
    Shende, Sameer
    Windus, Theresa L.
    Zhou, Shujia
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2006, 20 (02): : 163 - 202
  • [24] Multicore challenges and benefits for high performance scientific computing
    Nielsen, Ida M. B.
    Janssen, Curtis L.
    SCIENTIFIC PROGRAMMING, 2008, 16 (04) : 277 - 285
  • [25] High Performance Computing Data Centers
    Beaty, Donald L.
    ASHRAE JOURNAL, 2013, 55 (12) : 142 - 144
  • [26] MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing
    Nichols, Daniel
    Tomov, Nathalie-Sofia
    Betancourt, Frank
    Tomov, Stanimire
    Wong, Kwai
    Dongarra, Jack
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 490 - 503
  • [27] High performance computing with geographical data
    Clematis, A
    Mineter, M
    Marciano, R
    PARALLEL COMPUTING, 2003, 29 (10) : 1275 - 1279
  • [28] Enabling interoperation of high performance, scientific computing applications: Modeling scientific data with the Sets & Fields (SAF) modeling system
    Miller, MC
    Reus, JF
    Matzke, RP
    Arrighi, WJ
    Schoof, LA
    Hitt, RT
    Espen, PK
    COMPUTATIONAL SCIENCE -- ICCS 2001, PROCEEDINGS PT 2, 2001, 2074 : 158 - 167
  • [29] High Performance Computing needs high performance data management
    Kleese, K
    PROCEEDINGS OF THE HIGH PERFORMANCE COMPUTING SYMPOSIUM - HPC '99, 1999, : 331 - 336
  • [30] Optimizing stream organization to improve the performance of scientific computing applications on the stream processor
    Zhang, Ying
    Li, Gen
    Yang, Xuejun
    Zeng, Kun
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2007, 4494 : 198 - +