Improvement of data and metadata quality in catalysis research: A use case-driven methodology

被引:0
作者
Huskova, Nadiia [1 ]
Dikova, Yuliia [1 ]
Petrenko, Taras [1 ]
Boenisch, Thomas [1 ]
机构
[1] Univ Stuttgart, High Performance Comp Ctr Stuttgart HLRS, Nobelstr 19, D-70569 Stuttgart, Germany
关键词
NFDI4Cat; Research data infrastructure; Catalysis; Use cases; Standardization; Metadata; Semantic representation;
D O I
10.1016/j.cattod.2024.115111
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
The goal of the NFDI4Cat project is to establish a National Research Data Infrastructure for catalysis research in Germany that ensures the data and metadata collected and shared by researchers are of high quality and adhere to established standards. To achieve this goal, a comprehensive use case (UC) collection methodology has been developed. The methodology is based on the collection and analysis of use cases for research workflows and data therein provided by the researchers working in the field of catalysis. The proposed methodology includes detailed guidelines for the information collected within a particular UC, ensuring that it is relevant, accurate and complete. The collected UC data are then evaluated based on established criteria for data and metadata quality. Any identified issues are addressed through the collaboration with the respective researchers, to ensure that the use cases meet the required standards. The collected use cases are then standardized, which in particular includes mapping the data and metadata to relevant ontologies and vocabularies, as well as ensuring consistency across different use cases. The standardization process is coupled with a semantic representation of metadata within the Resource Description Framework (RDF) followed by appropriate extension of the ontology being developed. The semantic framework allows for easy integration and cross-referencing of data. It ensures that the data are machine-readable, linked, and can be easily integrated with other datasets, making it more discoverable and useful for the catalysis research community. Within the project, special attention is paid on the collection of UCs from different fields, including biocatalysis, homogeneous catalysis, and heterogeneous catalysis. This will provide a comprehensive representation of the metadata related to catalysis. The proposed methodology serves as a valuable resource for the catalysis research community, promoting adherence to established standards and ensuring that the data and metadata shared by researchers are of high quality.
引用
收藏
页数:9
相关论文
共 21 条
[1]  
Baiker A, 2001, CHIMIA, V55, P796
[2]   Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management [J].
Behr, Alexander S. ;
Borgelt, Hendrik ;
Kockmann, Norbert .
JOURNAL OF CHEMINFORMATICS, 2024, 16 (01)
[3]  
Bonisch T., 2023, NFDI4Cat: Architecture document (Version 1), DOI [10.5281/zenodo.10391091, DOI 10.5281/ZENODO.10391091]
[4]  
Borgelt, 2023, ANN DIG CAT CAT REL, DOI [10.5231/senodo10265604, DOI 10.5231/SENODO10265604]
[5]  
Chawla T, 2016, 2016 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMMUNICATION TECHNOLOGIES (ETCT)
[6]  
dechemade, The Digitalization of Catalysis-Related Sciences
[7]  
Doerr M., 2023, Proc. Conf. Res. Data Infrastruct., V1, DOI 10.52825(cordi.v11359
[8]  
Dörr M, 2018, METHODS MOL BIOL, V1685, P269, DOI 10.1007/978-1-4939-7366-8_16
[9]  
Labra-Gayo JE, 2019, STUD COMPUT INTELL, V815, P121, DOI 10.1007/978-3-030-06149-4_6
[10]  
enmorepo github, About us