The Materials Data Facility: Data Services to Advance Materials Science Research

被引:235
作者
Blaiszik, B. [1 ]
Chard, K. [1 ]
Pruyne, J. [1 ]
Ananthakrishnan, R. [1 ]
Tuecke, S. [1 ]
Foster, I. [1 ,2 ,3 ]
机构
[1] Univ Chicago, Computat Inst, 5735 South Ellis Ave, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[3] Argonne Natl Lab, Math & Comp Sci Div, Lemont, IL 60439 USA
关键词
Materials; data publication; data management; data preservation; software as a service;
D O I
10.1007/s11837-016-2001-3
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With increasingly strict data management requirements from funding agencies and institutions, expanding focus on the challenges of research replicability, and growing data sizes and heterogeneity, new data needs are emerging in the materials community. The materials data facility (MDF) operates two cloud-hosted services, data publication and data discovery, with features to promote open data sharing, self-service data publication and curation, and encourage data reuse, layered with powerful data discovery tools. The data publication service simplifies the process of copying data to a secure storage location, assigning data a citable persistent identifier, and recording custom (e.g., material, technique, or instrument specific) and automatically-extracted metadata in a registry while the data discovery service will provide advanced search capabilities (e.g., faceting, free text range querying, and full text search) against the registered data and metadata. The MDF services empower individual researchers, research projects, and institutions to (I) publish research datasets, regardless of size, from local storage, institutional data stores, or cloud storage, without involvement of third-party publishers; (II) build, share, and enforce extensible domain-specific custom metadata schemas; (III) interact with published data and metadata via representational state transfer (REST) application program interfaces (APIs) to facilitate automation, analysis, and feedback; and (IV) access a data discovery model that allows researchers to search, interrogate, and eventually build on existing published data. We describe MDF's design, current status, and future plans.
引用
收藏
页码:2045 / 2052
页数:8
相关论文
共 17 条
  • [1] Software as a Service for Data Scientists
    Allen, Bryce
    Bresnahan, John
    Childers, Lisa
    Foster, Ian
    Kandaswamy, Gopi
    Kettimuthu, Raj
    Kordas, Jack
    Link, Mike
    Martin, Stuart
    Pickett, Karl
    Tuecke, Steven
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (02) : 81 - 88
  • [2] [Anonymous], 2003, DSPACE OPEN SOURCE D
  • [3] Strategy for Extensible, Evolving Terminology for the Materials Genome Initiative Efforts
    Bhat, Talapady N.
    Bartolo, Laura M.
    Kattner, Ursula R.
    Campbell, Carelyn E.
    Elliott, John T.
    [J]. JOM, 2015, 67 (08) : 1866 - 1875
  • [4] Chard K., 2015, E SCI E SCI 2015 11, P401
  • [5] Efficient and Secure Transfer, Synchronization, and Sharing of Big Data
    Chard, Kyle
    Tuecke, Steven
    Foster, Ian
    [J]. IEEE CLOUD COMPUTING, 2014, 1 (03) : 46 - 55
  • [6] AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations
    Curtarolo, Stefano
    Setyawan, Wahyu
    Wang, Shidong
    Xue, Junkai
    Yang, Kesong
    Taylor, Richard H.
    Nelson, Lance J.
    Hart, Gus L. W.
    Sanvito, Stefano
    Buongiorno-Nardelli, Marco
    Mingo, Natalio
    Levy, Ohad
    [J]. COMPUTATIONAL MATERIALS SCIENCE, 2012, 58 : 227 - 235
  • [7] The Materials Genome Initiative, the interplay of experiment, theory and computation
    de Pablo, Juan J.
    Jones, Barbara
    Lind, Cora
    Ozolins, Vidvuds
    Ramirez, Arthur P.
    [J]. CURRENT OPINION IN SOLID STATE & MATERIALS SCIENCE, 2014, 18 (02) : 99 - 117
  • [9] Holdren J.P., 2011, MAT GEN IN GLOB COMP
  • [10] Commentary: The Materials Project: A materials genome approach to accelerating materials innovation
    Jain, Anubhav
    Shyue Ping Ong
    Hautier, Geoffroy
    Chen, Wei
    Richards, William Davidson
    Dacek, Stephen
    Cholia, Shreyas
    Gunter, Dan
    Skinner, David
    Ceder, Gerbrand
    Persson, Kristin A.
    [J]. APL MATERIALS, 2013, 1 (01):