The eGenVar data management system-cataloguing and sharing sensitive data and metadata for the life sciences

被引:8
|
作者
Razick, Sabry [1 ]
Mocnik, Rok [1 ]
Thomas, Laurent F. [1 ]
Ryeng, Einar [1 ]
Drablos, Finn [1 ]
Saetrom, Pal [1 ,2 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Canc Res & Mol Med, NO-7491 Trondheim, Norway
[2] Norwegian Univ Sci & Technol, Dept Comp & Informat Sci, NO-7491 Trondheim, Norway
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2014年
关键词
SUSCEPTIBILITY LOCI; ASSOCIATION; STANDARDS; RISK;
D O I
10.1093/database/bau027
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Systematic data management and controlled data sharing aim at increasing reproducibility, reducing redundancy in work, and providing a way to efficiently locate complementing or contradicting information. One method of achieving this is collecting data in a central repository or in a location that is part of a federated system and providing interfaces to the data. However, certain data, such as data from biobanks or clinical studies, may, for legal and privacy reasons, often not be stored in public repositories. Instead, we describe a metadata cataloguing system and a software suite for reporting the presence of data from the life sciences domain. The system stores three types of metadata: file information, file provenance and data lineage, and content descriptions. Our software suite includes both graphical and command line interfaces that allow users to report and tag files with these different metadata types. Importantly, the files remain in their original locations with their existing access- control mechanisms in place, while our system provides descriptions of their contents and relationships. Our system and software suite thereby provide a common framework for cataloguing and sharing both public and private data. Database URL: http://bigr.medisin.ntnu. no/data/eGenVar/
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Facilitating data sharing in the behavioural sciences
    Anderson, William
    Data Science Journal, 2012, 11
  • [42] Facilitating data sharing in the behavioural sciences
    1600, Ubiquity Press Ltd (11):
  • [43] A heterogeneous data sharing approach based on ontology and metadata
    Li, Xiaotao
    Hu, Xiaohui
    Lu, Weina
    Liu, Xi
    Journal of Computational Information Systems, 2015, 11 (08): : 2709 - 2719
  • [44] The management of digital data: a metadata approach
    Chilvers, A
    Feather, J
    ELECTRONIC LIBRARY, 1998, 16 (06): : 365 - 372
  • [45] On data lake architectures and metadata management
    Sawadogo, Pegdwende
    Darmont, Jerome
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2021, 56 (01) : 97 - 120
  • [46] Data management and sharing
    Pellen, Claude
    Munung, Nchangwi Syntia
    Armond, Anna Catharina
    Kulp, Daniel
    Mansmann, Ulrich
    Siebert, Maximilian
    Naudet, Florian
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2025, 180
  • [47] Metadata management for data warehousing: An overview
    Vaduva, A
    Vetterli, T
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2001, 10 (03) : 273 - 298
  • [48] Data Profiling Method for Metadata Management
    Aikoh, Kazuhide
    Isoda, Yuya
    Sugimoto, Ken
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 779 - 780
  • [49] Scientific data management with navigational metadata
    Stillerman, J.
    Greenwald, M.
    Wright, J.
    FUSION ENGINEERING AND DESIGN, 2018, 128 : 113 - 116
  • [50] On data lake architectures and metadata management
    Pegdwendé Sawadogo
    Jérôme Darmont
    Journal of Intelligent Information Systems, 2021, 56 : 97 - 120