XLUM: an open data format for exchange and long-term preservation of luminescence data

被引:1
作者
Kreutzer, Sebastian [1 ,2 ,7 ]
Grehl, Steve [3 ]
Hoehne, Michael [4 ]
Simmank, Oliver [4 ]
Dornich, Kay [4 ]
Adamiec, Grzegorz [5 ]
Burow, Christoph [6 ]
Roberts, Helen M. [1 ]
Duller, Geoff A. T. [1 ]
机构
[1] Aberystwyth Univ, Geog & Earth Sci, Aberystwyth, Wales
[2] Univ Bordeaux Montaigne, Archeosci Bordeaux, CNRS, UMR 6034, Pessac, France
[3] HUK Coburg, Coburg, Germany
[4] Freiberg Instruments GmbH, Freiberg, Germany
[5] Silesian Tech Univ, Inst Phys, Div Geochronol & Environm Isotopes, Gliwice, Poland
[6] Piazza Blu2 GmbH, Cologne, Germany
[7] Ruprecht Karl Univ Heidelberg, Inst Geog, Heidelberg, Germany
来源
GEOCHRONOLOGY | 2023年 / 5卷 / 01期
关键词
SEDIMENTS; QUARTZ;
D O I
10.5194/gchron-5-271-2023
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The concept of open data has become the modern science meme, and major funding bodies and publishers support open data. On a daily basis, however, the open data mandate frequently encounters technical obstacles, such as a lack of a suitable data format for data sharing and long-term data preservation. Such issues are often community-specific and best addressed through community-tailored solutions. In Quaternary sciences, luminescence dating is widely used for constraining the timing of event-based processes (e.g. sediment transport). Every luminescence dating study produces a vast body of primary data that usually remains inaccessible and incompatible with future studies or adjacent scientific disciplines. To facilitate data exchange and long-term data preservation (in short, open data) in luminescence dating studies, we propose a new XML-based structured data format called XLUM. The format applies a hierarchical data storage concept consisting of a root node (node 0), a sample (node 1), a sequence (node 2), a record (node 3), and a curve (node 4). The curve level holds information on the technical component (e.g. photomultiplier, thermocouple). A finite number of curves represent a record (e.g. an optically stimulated luminescence curve). Records are part of a sequence measured for a particular sample. This design concept allows the user to retain information on a technical component level from the measurement process. The additional storage of related metadata fosters future data mining projects on large datasets. The XML-based format is less memory-efficient than binary formats; however, its focus is data exchange, preservation, and hence XLUM long-term format stability by design. XLUM is inherently stable to future updates and backwards-compatible. We support XLUM through a new R package xlum, facilitating the conversion of different formats into the new XLUM format. XLUM is licensed under the MIT licence and hence available for free to be used in open- and closed-source commercial and non-commercial software and research projects.
引用
收藏
页码:271 / 284
页数:14
相关论文
共 65 条
  • [1] Agence Nationale de la Recherche (ANR), 2019, Modele de Plan de gestion des donnees (PGD)
  • [2] Aitken M. J., 1985, Thermoluminescence dating
  • [3] [Anonymous], 2016, DTU Nutech- Center for Nuclear Technologies: The Sequence Editor
  • [4] [Anonymous], 2014, FORCE11: The FAIR Data Principles- FORCE11
  • [5] [Anonymous], 2008, Extensible Markup Language (XML) 1.0 (Fifth Edition). W3C Recommendation 26 November
  • [6] [Anonymous], 2017, The European Code of Conduct for Research Integrity
  • [7] Balco G., 2020, GEOCHRONOLOGY, V2, P169, DOI [10.5194/gchron-2-169-2020., DOI 10.5194/GCHRON-2-169-2020]
  • [8] Bateman M.D., 2019, HDB LUMINESCENCE DAT
  • [9] Strategies for flexibility in luminescence dating: procedure-oriented measurement and hardware modularity
    Bortolot, VJ
    Bluszcz, A
    [J]. RADIATION MEASUREMENTS, 2003, 37 (4-5) : 551 - 555
  • [10] A new modular high capacity OSL reader system
    Bortolot, VJ
    [J]. RADIATION MEASUREMENTS, 2000, 32 (5-6) : 751 - 757