High performance threaded data streaming for large scale simulations

被引:13
作者
Bhat, V [1 ]
Klasky, S [1 ]
Atchley, S [1 ]
Beck, M [1 ]
McCune, D [1 ]
Parashar, M [1 ]
机构
[1] Princeton Univ, Plasma Phys Lab, Princeton, NJ 08544 USA
来源
FIFTH IEEE/ACM INTERNATIONAL WORKSHOP ON GRID COMPUTING, PROCEEDINGS | 2004年
关键词
D O I
10.1109/GRID.2004.36
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We have developed a threaded parallel data streaming approach using Logistical Networking (LN) to transfer multi-terabyte simulation data from computers at NERSC to our local analysis/visualization cluster, as the simulation executes, with negligible overhead. Data transfer experiments show that this concurrent data transfer approach is more favorable compared with writing to local disk and later transferring this data to be post-processed. Our algorithms are network aware, and can stream data at up to 97Mbs on a 100Mbs link from CA to NJ during a live simulation, using less than 5% CPU overhead at NERSC. This method is the first step in setting up a pipeline for simulation workflow and data management.
引用
收藏
页码:243 / 250
页数:8
相关论文
共 11 条
[1]   Data management and transfer in high-performance computational grid environments [J].
Allcock, B ;
Bester, J ;
Bresnahan, J ;
Chervenak, AL ;
Foster, I ;
Kesselman, C ;
Meder, S ;
Nefedova, V ;
Quesnel, D ;
Tuecke, S .
PARALLEL COMPUTING, 2002, 28 (05) :749-771
[2]  
ALTINTAS I, 2003, 15 INT SCI STAT DAT
[3]  
CHERVENAK AL, 2004, INT IEEE S HIGH PERF
[4]  
DING J, 2003, SC2003 PHOEN AZ US N
[5]  
KLASKY S, 2003, ACM IEEE SC2003 C PH
[6]  
KOSAR T, 2003, CSTR20031487 U WISC
[7]  
MA X, 2003, 2003 INT PAR DISTR P
[8]  
MOORE T, 2002, ACM SIGCOMM 2002 MIC
[9]  
PLANK JS, 2003, PARALLEL PROCESSING, P207
[10]  
SIM A, 2000, P 16 INT C SCI STAT