Data Grid tools: enabling science on big distributed data

被引:4
作者
Allcock, B [1 ]
Chervenak, A [1 ]
Foster, I [1 ]
Kesselman, C [1 ]
Livny, M [1 ]
机构
[1] Argonne Natl Lab, Argonne, IL 60439 USA
来源
SCIDAC 2005: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING | 2005年 / 16卷
关键词
D O I
10.1088/1742-6596/16/1/079
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A particularly demanding and important challenge that we face as we attempt to construct the distributed computing machinery required to support SciDAC goals is the efficient, high-performance, reliable, secure, and policy-aware management of large-scale data movement. This problem is fundamental to diverse application domains including experimental physics (high energy physics, nuclear physics, light sources), simulation science (climate, computational chemistry, fusion, astrophysics), and large-scale collaboration. In each case, highly distributed user communities require high-speed access to valuable data, whether for visualization or analysis. The quantities of data involved (terabytes to petabytes), the scale of the demand (hundreds or thousands of users, data-intensive analyses, real-time constraints), and the complexity of the infrastructure that must be managed (networks, tertiary storage systems, network caches, computers, visualization systems) make the problem extremely challenging. Data management tools developed under the auspices of the SciDAC Data Grid Middleware project have become the de facto standard for data management in projects worldwide. Day in and day out, these tools provide the "plumbing" that allows scientists to do more science on an unprecedented scale in production environments.
引用
收藏
页码:571 / 575
页数:5
相关论文
共 11 条
  • [1] ALLCOCK W, 2005, JOINT WORKSH HIGH PE
  • [2] [Anonymous], 2005, SC 2005
  • [3] BENT J, 2004, GRID RESOURCE MANAGE
  • [4] The Earth System Grid: Supporting the next generation of climate modeling research
    Bernholdt, D
    Bharathi, S
    Brown, D
    Chanchio, K
    Chen, ML
    Chervenak, A
    Cinquini, L
    Drach, B
    Foster, I
    Fox, P
    Garcia, J
    Kesselman, C
    Markel, R
    Middleton, D
    Nefedova, V
    Pouchard, L
    Shoshani, A
    Sim, A
    Strand, G
    Williams, D
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (03) : 485 - 495
  • [5] The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets
    Chervenak, A
    Foster, I
    Kesselman, C
    Salisbury, C
    Tuecke, S
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2000, 23 (03) : 187 - 200
  • [6] CHERVENAK A, 2002, SC 02 HIGH PERFORMAN
  • [7] CHERVENAK AL, 2004, IEEE INT S HIGH PERF
  • [8] EEROLA P, 2003, 4 INT WORKSH GRID CO
  • [9] Modeling and managing state in distributed systems: The role of OGSI and WSRF
    Foster, I
    Czajkowski, K
    Ferguson, DF
    Frey, J
    Graham, S
    Maguire, T
    Snelling, D
    Tuecke, S
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (03) : 604 - 612
  • [10] FOSTER I, 2004, IEEE INT S HIGH PERF