NoaSci: A Numerical Object Array Library for I/O of Scientific Applications on Object Storage

被引:0
|
作者
Chien, Steven W. D. [1 ]
Podobas, Artur [1 ]
Svedin, Martin [1 ]
Tkachuk, Andriy [2 ]
El Sayed, Salem [3 ]
Herman, Pawel [1 ]
Umanesan, Ganesan [2 ]
Narasimhamurthy, Sai [2 ]
Markidis, Stefano [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] Seagate Syst UK, Portsmouth, Hants, England
[3] Forschungszentrum Julich, Julich Supercomp Ctr, Julich, Germany
来源
30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022) | 2022年
基金
瑞典研究理事会; 欧盟地平线“2020”;
关键词
Object-stores; Parallel I/O for Object Stores;
D O I
10.1109/PDP55904.2022.00034
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The strong consistency and stateful workflow are seen as the major factors for limiting parallel I/O performance because of the need for locking and state management. While the POSIX-based I/O model dominates modern HPC storage infrastructure, emerging object storage technology can potentially improve I/O performance by eliminating these bottlenecks. Despite a wide deployment on the cloud, its adoption in HPC remains low. We argue one reason is the lack of a suitable programming interface for parallel I/O in scientific applications. In this work, we introduce NoaSci, a Numerical Object Array library for scientific applications. NoaSci supports different data formats (e.g. HDF5, binary), and focuses on supporting node-local burst buffers and object stores. We demonstrate for the first time how scientific applications can perform parallel I/O on Seagate's Motr object store through NoaSci. We evaluate NoaSci's preliminary performance using the iPIC3D space weather application and position against existing I/O methods.
引用
收藏
页码:172 / 176
页数:5
相关论文
共 50 条
  • [1] Porting the clean object I/O library to Haskell
    Achten, P
    Jones, SP
    IMPLEMENTATION OF FUNCTIONAL LANGUAGES, 2001, 2011 : 194 - 213
  • [2] Parallel array object I/O support on distributed environments
    Lee, JK
    Tsaur, IK
    Hwang, SY
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 40 (02) : 227 - 241
  • [3] Object oriented design for library applications
    Raggad, BG
    Belhajali, M
    LIBRARY SOFTWARE REVIEW, 1995, 14 (01): : 9 - 13
  • [4] Evaluation of HPC Application I/O on Object Storage Systems
    Liu, Jialin
    Koziol, Quincey
    Butler, Gregory F.
    Fortner, Neil
    Chaarawi, Mohamad
    Tang, Houjun
    Byna, Suren
    Lockwood, Glenn K.
    Cheema, Ravi
    Kallback-Rose, Kristy A.
    Hazen, Damian
    Prabhat
    PROCEEDINGS OF 2018 IEEE/ACM 3RD JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE & DATA INTENSIVE SCALABLE COMPUTING SYSTEMS (PDSW-DISCS), 2018, : 24 - 34
  • [5] PARADEIS: An object library for parallel sparse array computation
    Delaplace, F
    Remy, D
    PARALLEL COMPUTATION, 1999, 1557 : 153 - 162
  • [6] Object-Oriented Design for Library Applications
    Libr Software Rev, 1 (09):
  • [7] I/O scheduling in huge object-based storage system
    Feng, Dan
    Liu, Hanbo
    FCST 2006: JAPAN-CHINA JOINT WORKSHOP ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2006, : 39 - +
  • [8] Parallel I/O in an object-oriented message-passing library
    Pinkenburg, S
    Rosenstiel, W
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2004, 3241 : 251 - 258
  • [9] Provenance-Based Object Storage Prediction Scheme for Scientific Big Data Applications
    Dai, Dong
    Chen, Yong
    Kimpe, Dries
    Ross, Rob
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 271 - 280
  • [10] Persistent Memory Object Storage and Indexing for Scientific Computing
    Khan, Awais
    Sim, Hyogi
    Vazhkudai, Sudharshan S.
    Ma, Jinsuk
    Oh, Myeong-Hoon
    Kim, Youngjae
    PROCEEDINGS OF 2020 IEEE/ACM WORKSHOP ON MEMORY CENTRIC HIGH PERFORMANCE COMPUTING (MCHPC 2020), 2020, : 1 - 9