Metadata integrity in bioinformatics: Bridging the gap between data and knowledge

被引:2
|
作者
Caliskan, Aylin [1 ]
Dangwal, Seema [2 ]
Dandekar, Thomas [1 ]
机构
[1] Univ Wurzburg, Dept Bioinformat, Bioctr, D-97074 Wurzburg, Germany
[2] Stanford Univ, Stanford Cardiovasc Inst, Dept Med, Sch Med, Stanford, CA 94305 USA
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2023年 / 21卷
关键词
Meta-data; Error; Annotation; Error-transfer; Wrong labelling; Patient data; Control group; Tools overview; CONTROLLED VOCABULARIES; GENE-EXPRESSION; CELL; CHALLENGES; ONTOLOGIES;
D O I
10.1016/j.csbj.2023.10.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research.
引用
收藏
页码:4895 / 4913
页数:19
相关论文
共 50 条
  • [1] Towards a knowledge driven framework for bridging the gap between software and data engineering
    Solanki, Monika
    Bozic, Bojan
    Dirschl, Christian
    Brennan, Rob
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 149 : 476 - 484
  • [2] Bridging the geospatial gap: Data about space and indigenous knowledge of place
    Briggs, Carolyn
    Burfurd, Ingrid
    Duckham, Matt
    Guntarik, Olivia
    Kerr, Di
    McMillan, Mark
    Saldias, Daisy San Martin
    GEOGRAPHY COMPASS, 2020, 14 (11):
  • [3] Knowledge graphs for seismic data and metadata
    Davis, William
    Hunt, Cassandra R.
    APPLIED COMPUTING AND GEOSCIENCES, 2024, 21
  • [4] LabPipe: an extensible bioinformatics toolkit to manage experimental data and metadata
    Bo Zhao
    Luke Bryant
    Rebecca Cordell
    Michael Wilde
    Dahlia Salman
    Dorota Ruszkiewicz
    Wadah Ibrahim
    Amisha Singapuri
    Tim Coats
    Erol Gaillard
    Caroline Beardsmore
    Toru Suzuki
    Leong Ng
    Neil Greening
    Paul Thomas
    Paul Monks
    Christopher Brightling
    Salman Siddiqui
    Robert C. Free
    BMC Bioinformatics, 21
  • [5] LabPipe: an extensible bioinformatics toolkit to manage experimental data and metadata
    Zhao, Bo
    Bryant, Luke
    Cordell, Rebecca
    Wilde, Michael
    Salman, Dahlia
    Ruszkiewicz, Dorota
    Ibrahim, Wadah
    Singapuri, Amisha
    Coats, Tim
    Gaillard, Erol
    Beardsmore, Caroline
    Suzuki, Toru
    Ng, Leong
    Greening, Neil
    Thomas, Paul
    Monks, Paul
    Brightling, Christopher
    Siddiqui, Salman
    Free, Robert C.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [6] Bridging the Gap between Connectome and Transcriptome
    Fornito, Alex
    Arnatkeviciute, Aurina
    Fulcher, Ben D.
    TRENDS IN COGNITIVE SCIENCES, 2019, 23 (01) : 34 - 50
  • [7] Towards Bridging the Data Exchange Gap Between Atomistic Simulation and Larger Scale Models
    Reith, David
    Christensen, Mikael
    Wolf, Walter
    Wimmer, Erich
    Schmitz, Georg J.
    PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTEGRATED COMPUTATIONAL MATERIALS ENGINEERING (ICME 2017), 2017, : 45 - 55
  • [8] Bridging the gap between data acquisition and inference ontologies - towards ontology based link discovery
    Goldstein, ML
    Morris, SA
    Yen, GG
    SENSORS, AND COMMAND, CONTROL, COMMUNICATIONS, AND INTELLIGENCE (C3I) TECHNOLOGIES FOR HOMELAND DEFENSE AND LAW ENFORCEMENT II, 2003, 5071 : 116 - 127
  • [9] Relations as patterns: bridging the gap between OBO and OWL
    Hoehndorf, Robert
    Oellrich, Anika
    Dumontier, Michel
    Kelso, Janet
    Rebholz-Schuhmann, Dietrich
    Herre, Heinrich
    BMC BIOINFORMATICS, 2010, 11
  • [10] Bridging the gap between servitization and social innovation.
    Spadafora, Maria
    Rapaccini, Mario
    JOURNAL OF CLEANER PRODUCTION, 2024, 452