The Earth System Grid Federation: An open infrastructure for access to distributed geospatial data

被引:143
作者
Cinquini, Luca [1 ,2 ]
Crichton, Daniel [1 ,2 ]
Mattmann, Chris [1 ,2 ]
Harney, John [3 ]
Shipman, Galen [4 ]
Wang, Feiyi [3 ]
Ananthakrishnan, Rachana [5 ,7 ]
Miller, Neill [6 ,7 ]
Denvil, Sebastian [8 ]
Morgan, Mark [9 ]
Pobre, Zed [10 ]
Bell, Gavin M. [11 ]
Doutriaux, Charles [11 ]
Drach, Robert [11 ]
Williams, Dean [11 ]
Kershaw, Philip [12 ,14 ]
Pascoe, Stephen [13 ,14 ]
Gonzalez, Estanislao [15 ]
Fiore, Sandro [16 ]
Schweitzer, Roland [17 ]
机构
[1] CALTECH, Jet Prop Lab, Pasadena, CA 91109 USA
[2] CALTECH, Pasadena, CA 91106 USA
[3] Oak Ridge Natl Lab, Oak Ridge, TN USA
[4] Oak Ridge Natl Lab, Comp & Computat Sci Directorate, Oak Ridge, TN USA
[5] Univ Chicago, Computat Inst, Chicago, IL 60637 USA
[6] Univ Chicago, Chicago, IL 60637 USA
[7] Argonne Natl Lab, Argonne, IL 60439 USA
[8] Inst Pierre Simon Laplace, Climate Modeling Grp, Paris, France
[9] Inst Pierre Simon Laplace, Earth Syst Modeling Platform, Paris, France
[10] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[11] Lawrence Livermore Natl Lab, Livermore, CA USA
[12] STEC Rutherford Appleton Lab, RAL Space, Ctr Environm Data Archival, Didcot, Oxon, England
[13] STEC Rutherford Appleton Lab, Didcot, Oxon, England
[14] NCAS BADC, Didcot, Oxon, England
[15] German Climate Comp Ctr DKRZ, Hamburg, Germany
[16] Euromediterranean Ctr Climate Change CMCC, Lecce, Italy
[17] NOAA, Pacific Marine Environm Lab, Seattle, WA 98115 USA
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE | 2014年 / 36卷
关键词
Climate science; Federation; Search; Discovery; Peer-to-peer; CMIP5;
D O I
10.1016/j.future.2013.07.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Earth System Grid Federation (ESGF) is a multi-agency, international collaboration that aims at developing the software infrastructure needed to facilitate and empower the study of climate change on a global scale. The ESGF's architecture employs a system of geographically distributed peer nodes, which are independently administered yet united by the adoption of common federation protocols and application programming interfaces (APIs). The cornerstones of its interoperability are the peer-to-peer messaging that is continuously exchanged among all nodes in the federation; a shared architecture and API for search and discovery; and a security infrastructure based on industry standards (OpenID, SSL, GSI and SAML). The ESGF software stack integrates custom components (for data publishing, searching, user interface, security and messaging), developed collaboratively by the team, with popular application engines (Tomcat, Solr) available from the open source community. The full ESGF infrastructure has now been adopted by multiple Earth science projects and allows access to petabytes of geophysical data, including the entire Fifth Coupled Model Intercomparison Project (CMIP5) output used by the Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment Report (AR5) and a suite of satellite observations (obs4MIPs) and reanalysis data sets (ANA4MIPs). This paper presents ESGF as a successful example of integration of disparate open source technologies into a cohesive, wide functional system, and describes our experience in building and operating a distributed and federated infrastructure to serve the needs of the global climate science community. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:400 / 417
页数:18
相关论文
共 18 条
  • [1] Software as a Service for Data Scientists
    Allen, Bryce
    Bresnahan, John
    Childers, Lisa
    Foster, Ian
    Kandaswamy, Gopi
    Kettimuthu, Raj
    Kordas, Jack
    Link, Mike
    Martin, Stuart
    Pickett, Karl
    Tuecke, Steven
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (02) : 81 - 88
  • [2] [Anonymous], SIGOPS OPER SYST REV
  • [3] Cinquini Luca, P 2012 IEEE C ESCIEN
  • [4] Demers Alan, 1987, P 6 ANN ACM S PRINC, P1, DOI [DOI 10.1145/41840.41841, 10.1145/41840.41841]
  • [5] Erickson Tyler A., 2011, OPENCLIMATEGIS WEB S
  • [6] The Climate-G Portal: The context, key features and a multi-dimensional analysis
    Fiore, Sandro
    Negro, Alessandro
    Aloisio, Giovanni
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (01): : 1 - 8
  • [7] The data access layer in the GReIC system architecture
    Fiore, Sandro
    Negro, Alessandro
    Aloisio, Giovanni
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (03): : 334 - 340
  • [9] Guilyardi Eric, 2011, CLIVAR EXCHANGES, V56, P42
  • [10] Kershaw Philip, 2011, P 2011 INT C GRID CO