Enabling modern data discovery for atmospheric measurements

被引:2
作者
Guntupally, Kavya [1 ]
Dumas, Kyle [1 ]
Prakash, Giri [1 ]
Devarakonda, Ranjeet [1 ]
Darnell, Wade [1 ]
Davis, Maggie [1 ]
Cederwall, Richard [1 ]
机构
[1] Oak Ridge Natl Lab, Div Environm Sci, POB 2008, Oak Ridge, TN 37831 USA
关键词
ARM data center; Metadata; Data archive; FAIR data; Metadata management; Data search;
D O I
10.1007/s12145-021-00635-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Atmospheric Radiation Measurement (ARM) user facility is a US Department of Energy Office of Science user facility that is managed and operated through a collaborative effort led by nine US Department of Energy national laboratories. The ARM Data Center, located at Oak Ridge National Laboratory, is responsible for the timely collection, processing, and delivery of data products to the scientific community. The ARM Data Center holds more than 11,000 data products, including metadata collected from field campaigns, instruments, value-added products, and principal investigator-contributed data. These data sets are checked for successful transfer (for most data, this transfer is carried out automatically via the network; however, some of the largest data sets and some of the most remote sites require manual shipping of hard disks) and both the data and metadata are processed to a standard format, which is an ARM-standardized structure, via the Network Common Data Form. The Network Common Data Form is a self-describing binary format with many compatible software tools. Once processed, the data are cataloged, stored in the ARM Data Archive, and made discoverable through association with an array of metadata-characterizing information, such as location and measurement classification. These metadata enable powerful search capabilities through the ARM Data Center Data Discovery interface. This paper discusses the workflow of how the new discovery system has been redesigned from user requirements and how the data are distributed to the scientific community.
引用
收藏
页码:1487 / 1502
页数:16
相关论文
共 17 条
[1]  
ARM, CAPABILITIES ATMOSPH
[2]  
Globus, DATA TRANSFER GLOBUS
[3]   Automated Indexing of Structured Scientific Metadata Using Apache Solr [J].
Guntupally, Kavya ;
Dumas, Kyle ;
Darnell, Wade ;
Crow, Michael ;
Devarakonda, Ranjeet ;
Giri, Prakash .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :5685-5687
[4]  
Guntupally K, 2018, IEEE INT CONF BIG DA, P5328, DOI 10.1109/BigData.2018.8621924
[5]  
Kumar J., 2019, 2019 IEEE INT C BIG, V6, DOI 10.1109/BigData47090.2019.9006051
[6]  
Microservices Architecture, MICROSERVICES PATTER
[7]  
Prakash G, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), P4026, DOI 10.1109/BigData.2016.7841098
[8]  
Simform, REACT VS VUE
[9]  
Solr, APACHE SOLR 860
[10]  
Solr, 2017, UPLOADING STRUCTURED