The eGenVar data management system-cataloguing and sharing sensitive data and metadata for the life sciences

被引:8
|
作者
Razick, Sabry [1 ]
Mocnik, Rok [1 ]
Thomas, Laurent F. [1 ]
Ryeng, Einar [1 ]
Drablos, Finn [1 ]
Saetrom, Pal [1 ,2 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, NO-7491 Trondheim, Norway
[2] Norwegian Univ Sci & Technol, Dept Comp & Informat Sci, NO-7491 Trondheim, Norway
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2014年
关键词
SUSCEPTIBILITY LOCI; ASSOCIATION; STANDARDS; RISK;
D O I
10.1093/database/bau027
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Systematic data management and controlled data sharing aim at increasing reproducibility, reducing redundancy in work, and providing a way to efficiently locate complementing or contradicting information. One method of achieving this is collecting data in a central repository or in a location that is part of a federated system and providing interfaces to the data. However, certain data, such as data from biobanks or clinical studies, may, for legal and privacy reasons, often not be stored in public repositories. Instead, we describe a metadata cataloguing system and a software suite for reporting the presence of data from the life sciences domain. The system stores three types of metadata: file information, file provenance and data lineage, and content descriptions. Our software suite includes both graphical and command line interfaces that allow users to report and tag files with these different metadata types. Importantly, the files remain in their original locations with their existing access- control mechanisms in place, while our system provides descriptions of their contents and relationships. Our system and software suite thereby provide a common framework for cataloguing and sharing both public and private data. Database URL: http://bigr.medisin.ntnu. no/data/eGenVar/
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A data management infrastructure for the integration of imaging and omics data in life sciences
    Luis Kuhn Cuellar
    Andreas Friedrich
    Gisela Gabernet
    Luis de la Garza
    Sven Fillinger
    Adrian Seyboldt
    Tobias Koch
    Sven zur Oven-Krockhaus
    Friederike Wanke
    Sandra Richter
    Wolfgang M. Thaiss
    Marius Horger
    Nisar Malek
    Klaus Harter
    Michael Bitzer
    Sven Nahnsen
    BMC Bioinformatics, 23
  • [22] Metadata Management on Data Processing in Data Lakes
    Megdiche, Imen
    Ravat, Franck
    Zhao, Yan
    SOFSEM 2021: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2021, 12607 : 553 - 562
  • [23] On the importance of metadata when sharing and opening data
    Francois Sabot
    BMC Genomic Data, 23
  • [24] INTEROPERABILITY AND DATA SHARING SETTINGS IN A HEALTHCARE DATA MANAGEMENT SYSTEM
    Masud, Mehedi
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [25] EXTENDING NXMX METADATA TO FACILITATE DATA SHARING
    Bernstein, Herbert J.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2019, 75 : E724 - E724
  • [26] B-Fabric: An Open Source Life Sciences Data Management System
    Tuerker, Can
    Akal, Fuat
    Joho, Dieter
    Schlapbach, Ralph
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 185 - 190
  • [27] On the importance of metadata when sharing and opening data
    Sabot, Francois
    BMC GENOMIC DATA, 2022, 23 (01):
  • [28] Challenges and Opportunities in Sociolinguistic Data and Metadata Sharing
    Cieri, Christopher
    LANGUAGE AND LINGUISTICS COMPASS, 2014, 8 (11): : 472 - 485
  • [29] Study on groundwater data sharing based on metadata
    Zhu, YQ
    Zha, SX
    Yu, ML
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 1233 - 1236
  • [30] Metadata Management for Data Lakes
    Ravat, Franck
    Zhao, Yan
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 1064 : 37 - 44