Globally Accessible Distributed Data Sharing (GADDS): a decentralized FAIR platform to facilitate data sharing in the life sciences

被引:2
作者
Vazquez, Pavel [1 ]
Hirayama-Shoji, Kayoko [1 ]
Novik, Steffen [2 ]
Krauss, Stefan [1 ,3 ]
Rayner, Simon [1 ,4 ]
机构
[1] Univ Oslo, Inst Basic Med Sci, Fac Med, Hybrid Technol Hub,Ctr Excellence, N-0317 Oslo, Norway
[2] Univ Oslo, Fac Math & Nat Sci, Dept Informat, N-0315 Oslo, Norway
[3] Oslo Univ Hosp, Dept Immunol & Transfus Med, N-0424 Oslo, Norway
[4] Oslo Univ Hosp, Dept Med Genet, N-0407 Oslo, Norway
基金
芬兰科学院;
关键词
D O I
10.1093/bioinformatics/btac362
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Technical advances have revolutionized the life sciences and researchers commonly face challenges associated with handling large amounts of heterogeneous digital data. The Findable, Accessible, Interoperable and Reusable (FAIR) principles provide a framework to support effective data management. However, implementing this framework is beyond the means of most researchers in terms of resources and expertise, requiring awareness of metadata, policies, community agreements and other factors such as vocabularies and ontologies. Results: We have developed the Globally Accessible Distributed Data Sharing (GADDS) platform to facilitate FAIR-like data-sharing in cross-disciplinary research collaborations. The platform consists of (i) a blockchain-based metadata quality control system, (ii) a private cloud-like storage system and (iii) a version control system. GADDS is built with containerized technologies, providing minimal hardware standards and easing scalability, and offers decentralized trust via transparency of metadata, facilitating data exchange and collaboration. As a use case, we provide an example implementation in engineered living material technology within the Hybrid Technology Hub at the University of Oslo.
引用
收藏
页码:3812 / 3817
页数:6
相关论文
共 40 条
[1]   Comparing Performance of Commercial Cloud Storage Systems: The Case of Dropbox and One Drive [J].
Alotaibi, Shamsah ;
Alomair, Hadeel ;
Elhussein, Mariam .
2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, :299-303
[2]   Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators [J].
Barone, Lindsay ;
Williams, Jason ;
Micklos, David .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (10)
[3]   MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive [J].
Bernstein, Matthew N. ;
Doan, Anhai ;
Dewey, Colin N. .
BIOINFORMATICS, 2017, 33 (18) :2914-2923
[4]  
Cachin C., 2016, Architecture of the hyperledger Blockchain fabric[EB/OL], DOI DOI 10.4230/LIPICS.OPODIS.2016.24
[5]  
Chacon S., 2014, PRO GIT, DOI 10.1007/978-1-4842-0076-6
[6]  
Chevet S., 2018, BLOCKCHAIN TECHNOLOG
[7]   Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities [J].
Chiara, Matteo ;
D'Erchia, Anna Maria ;
Gissi, Carmela ;
Manzari, Caterina ;
Parisi, Antonio ;
Resta, Nicoletta ;
Zambelli, Federico ;
Picardi, Ernesto ;
Pavesi, Giulio ;
Horner, David S. ;
Pesole, Graziano .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) :616-630
[8]   Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities [J].
Cohen-Boulakia, Sarah ;
Belhajjame, Khalid ;
Collin, Olivier ;
Chopard, Jerome ;
Froidevaux, Christine ;
Gaignard, Alban ;
Hinsen, Konrad ;
Larmande, Pierre ;
Le Brass, Yvan ;
Lemoine, Frederic ;
Mareuil, Fabien ;
Menager, Herve ;
Pradal, Christophe ;
Blanchet, Christophe .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 75 :284-298
[9]   Ten simple rules for making a vocabulary FAIR [J].
Cox, Simon J. D. ;
Gonzalez-Beltran, Alejandra N. ;
Magagna, Barbara ;
Marinescu, Maria-Cristina .
PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (06)
[10]   Attitudes and norms affecting scientists' data reuse [J].
Curty, Renata Goncalves ;
Crowston, Kevin ;
Specht, Alison ;
Grant, Bruce W. ;
Dalton, Elizabeth D. .
PLOS ONE, 2017, 12 (12)