A flexible I/O arbitration framework for netCDF-based big data processing workflows on high-end supercomputers

被引:10
作者
Liao, Jianwei [1 ,2 ]
Gerofi, Balazs [3 ]
Lien, Guo-Yuan [3 ]
Miyoshi, Takemasa [3 ]
Nishizawa, Seiya [3 ]
Tomita, Hirofumi [3 ]
Liao, Wei-Keng [4 ]
Choudhary, Alok [4 ]
Ishikawa, Yutaka [3 ]
机构
[1] Southwest Univ China, Coll Comp & Informat Sci, Tianshen Rd 2, Chongqing, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[3] RIKEN Adv Inst Computat Sci, Kobe, Hyogo, Japan
[4] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL USA
基金
中国国家自然科学基金;
关键词
asynchronous transfer; big data processing; customizability; netCDF; parallel direct data transfer; real time; ENSEMBLE DATA ASSIMILATION; SYSTEM; MODEL; COUPLER;
D O I
10.1002/cpe.4161
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
On the verge of the convergence between high-performance computing and Big Data processing, it has become increasingly prevalent to deploy large-scale data analytics workloads on high-end supercomputers. Such applications often come in the form of complex workflows with various different components, assimilating data from scientific simulations as well as from measurements streamed from sensor networks, such as radars and satellites. For example, as part of the Flagship 2020 (post-K) supercomputer project of Japan, RIKEN is investigating the feasibility of a highly accurate weather forecasting system that would provide a real-time outlook for severe guerrilla rainstorms. One of the main performance bottlenecks of this application is the lack of efficient communication among workflow components, which currently takes place over the parallel file system.In this paper, we present an initial study of a direct communication framework designed for complex workflows that eliminates unnecessary file I/O among components. Specifically, we propose an I/O arbitration layer that provides direct parallel data transfer (both synchronous and asynchronous) among job components that rely on the netCDF interface for performing I/O operations. Our solution requires only minimal modifications to application code. Moreover, we propose a configuration file-based approach that allows users to specify the desired data transfer pattern among workflow components, offering a general solution for different application contexts. We present a preliminary evaluation of the proposed framework on the K Computer (running on up to 4800 compute nodes) using RIKEN's experimental weather forecasting workflow as a case study.
引用
收藏
页数:12
相关论文
共 36 条
[1]   Coupling integrated Earth System Model components with BFG2 [J].
Armstrong, C. W. ;
Ford, R. W. ;
Riley, G. D. .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (06) :767-791
[2]  
Balaji V, 2007, FMS EXCHANGE GRID ME
[3]   A simple method for integrating a complex model into an ensemble data assimilation system using MPI [J].
Browne, P. A. ;
Wilson, S. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2015, 68 :122-128
[4]  
Chen F, 2001, MON WEATHER REV, V129, P569, DOI 10.1175/1520-0493(2001)129<0569:CAALSH>2.0.CO
[5]  
2
[6]   The integrated WRF/urban modelling system: development, evaluation, and applications to urban environmental problems [J].
Chen, Fei ;
Kusaka, Hiroyuki ;
Bornstein, Robert ;
Ching, Jason ;
Grimmond, C. S. B. ;
Grossman-Clarke, Susanne ;
Loridan, Thomas ;
Manning, Kevin W. ;
Martilli, Alberto ;
Miao, Shiguang ;
Sailor, David ;
Salamanca, Francisco P. ;
Taha, Haider ;
Tewari, Mukul ;
Wang, Xuemei ;
Wyszogrodzki, Andrzej A. ;
Zhang, Chaolin .
INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2011, 31 (02) :273-288
[7]  
Chen G., 2012, P INT C HIGH PERFORM, P71
[8]   A new flexible coupler for earth system modeling developed for CCSM4 and CESM1 [J].
Craig, Anthony P. ;
Vertenstein, Mariana ;
Jacob, Robert .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2012, 26 (01) :31-42
[9]   Cpl6: The new extensible, high performance parallel coupler for the Community Climate System Model [J].
Craig, AP ;
Jacob, R ;
Kauffman, B ;
Bettge, T ;
Larson, J ;
Ong, E ;
Dingo, C ;
He, Y .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2005, 19 (03) :309-327
[10]  
Dorier M., 2015, Proceedings of the First Workshop on, P19