A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

被引:23
作者
Ammar, Ammar [1 ]
Bonaretti, Serena [1 ,2 ]
Winckers, Laurent [1 ]
Quik, Joris [3 ]
Bakker, Martine [3 ]
Maier, Dieter [4 ]
Lynch, Iseult [5 ]
van Rijn, Jeaphianne [1 ]
Willighagen, Egon [1 ]
机构
[1] Maastricht Univ, Dept Bioinformat BiGCaT, NUTRIM, NL-6200 MD Maastricht, Netherlands
[2] Transparent MSK Res, NL-6221 BN Maastricht, Netherlands
[3] Natl Inst Publ Hlth & Environm RIVM, NL-3720 BA Bilthoven, Netherlands
[4] Biomax Informat AG, D-82152 Martinsried, Germany
[5] Univ Birmingham, Sch Geog Earth & Environm Sci, Birmingham B15 2TT, W Midlands, England
基金
欧盟地平线“2020”;
关键词
FAIR guidelines; FAIR maturity indicators; life sciences; Jupyter Notebook; MINIMUM INFORMATION; NANOMATERIAL DATA; GENE-EXPRESSION; COMPLETENESS; STANDARDS; CURATION; QUALITY;
D O I
10.3390/nano10102068
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 41 条
[1]  
Afantitis A., 2019, DRIVING NANOINFORMAT, DOI [10.5281/ZENODO.3765525, DOI 10.5281/ZENODO.3765525]
[2]  
[Anonymous], 2009, ggplot2: elegant graphics for data analysis, DOI [10.1007/978-0-387-98141-3, DOI 10.1007/978-0-387-98141-3]
[3]  
[Anonymous], **NON-TRADITIONAL**
[4]  
[Anonymous], 2017, EFSA J
[5]  
[Anonymous], 2019, THIS REPOSITORY CONT
[6]  
[Anonymous], **NON-TRADITIONAL**
[7]   Revisiting Qualitative Data Reuse: A Decade On [J].
Bishop, Libby ;
Kuula-Lummi, Arja .
SAGE OPEN, 2017, 7 (01)
[8]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[9]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[10]   FAIRshake: Toolkit to Evaluate the FAIRness of Research Digital Resources [J].
Clarke, Daniel J. B. ;
Wang, Lily ;
Jones, Alex ;
Wojciechowicz, Megan L. ;
Torre, Denis ;
Jagodnik, Kathleen M. ;
Jenkins, Sherry L. ;
McQuilton, Peter ;
Flamholz, Zachary ;
Silverstein, Moshe C. ;
Schilder, Brian M. ;
Robasky, Kimberly ;
Castillo, Claris ;
Idaszak, Ray ;
Ahalt, Stanley C. ;
Williams, Jason ;
Schurer, Stephan ;
Cooper, Daniel J. ;
Azevedo, Ricardo de Miranda ;
Klenk, Juergen A. ;
Haendel, Melissa A. ;
Nedzel, Jared ;
Avillach, Paul ;
Shimoyama, Mary E. ;
Harris, Rayna M. ;
Gamble, Meredith ;
Poten, Rudy ;
Charbonneau, Amanda L. ;
Larkin, Jennie ;
Brown, C. Titus ;
Bonazzi, Vivien R. ;
Dumontier, Michel J. ;
Sansone, Susanna-Assunta ;
Ma'ayan, Avi .
CELL SYSTEMS, 2019, 9 (05) :417-421