Publishing DisGeNET as nanopublications

被引:13
|
作者
Queralt-Rosinach, Nuria [1 ]
Kuhn, Tobias [2 ]
Chichester, Christine [3 ]
Dumontier, Michel [4 ]
Sanz, Ferran [1 ]
Furlong, Laura I. [1 ]
机构
[1] Univ Pompeu Fabra, DCEXS, IMIM, Res Programme Biomed Informat GRIB,IBI Grp, Barcelona, Spain
[2] ETH, Dept Humanities Social & Polit Sci, Zurich, Switzerland
[3] Swiss Inst Bioinformat, CALIPHO Grp, CMU Rue Michel Servet 1, CH-1211 Geneva 4, Switzerland
[4] Stanford Univ, Stanford Ctr Biomed Informat Res, Stanford, CA USA
关键词
Gene-disease associations; linked data; nanopublication; provenance; trusty URIs; BIOMEDICAL-RESEARCH; TEXT;
D O I
10.3233/SW-150189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for knowledge discovery in the Life Sciences. The manual curation of facts from published scientific papers is slow and inefficient, and therefore new approaches are needed that can enable the automatic, scalable and reliable extraction of assertions. While the publication of scientific assertions and datasets on the Semantic Web is gaining traction, it also creates new challenges such as the proper representation of provenance and versioning. Here, we address these issues and describe our efforts to represent the DisGeNET database of human gene-disease associations as permanent, immutable, and provenance rich digital objects called nanopublications. Our nanopublications are the first instance of a Linked Data model that ensures stable interlinking of the assertion and its metadata by Trusty URIs. As DisGeNET integrates manually curated as well as text-mined data of different origins, the semantic description of the evidence for each assertion is important to provide trust and allow evidence-based hypothesis generation. Here, we describe our steps to ensure high quality and demonstrate the utility of linking our data to other datasets on the emerging Semantic Web.
引用
收藏
页码:519 / 528
页数:10
相关论文
共 50 条
  • [1] Decentralized provenance-aware publishing with nanopublications
    Kuhn, Tobias
    Chichester, Christine
    Krauthammer, Michael
    Queralt-Rosinach, Nuria
    Verborgh, Ruben
    Giannakopoulos, George
    Ngomo, Axel-Cyrille Ngonga
    Viglianti, Raffaele
    Dumontier, Michel
    PEERJ COMPUTER SCIENCE, 2016,
  • [2] Converting neXtProt into Linked Data and nanopublications
    Chichester, Christine
    Karch, Oliver
    Gaudet, Pascale
    Lane, Lydie
    Mons, Barend
    Bairoch, Amos
    SEMANTIC WEB, 2015, 6 (02) : 147 - 153
  • [3] Querying neXtProt nanopublications and their value for insights on sequence variants and tissue expression
    Chichester, Christine
    Gaudet, Pascale
    Karch, Oliver
    Groth, Paul
    Lane, Lydie
    Bairoch, Amos
    Mons, Barend
    Loizou, Antonis
    JOURNAL OF WEB SEMANTICS, 2014, 29 : 3 - 11
  • [4] DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants
    Pinero, Janet
    Bravo, Alex
    Queralt-Rosinach, Nuria
    Gutierrez-Sacristan, Alba
    Deu-Pons, Jordi
    Centeno, Emilio
    Garcia-Garcia, Javier
    Sanz, Ferran
    Furlong, Laura I.
    NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D833 - D839
  • [5] Nanopublications for exposing experimental data in the life-sciences: a Huntington’s Disease case study
    Eleni Mina
    Mark Thompson
    Rajaram Kaliyaperumal
    Jun Zhao
    van Eelke der Horst
    Zuotian Tatum
    Kristina M Hettne
    Erik A Schultes
    Barend Mons
    Marco Roos
    Journal of Biomedical Semantics, 6
  • [6] Nanopublications for exposing experimental data in the life-sciences: a Huntington's Disease case study
    Mina, Eleni
    Thompson, Mark
    Kaliyaperumal, Rajaram
    Zhao, Jun
    van der Horst, Eelke
    Tatum, Zuotian
    Hettne, Kristina M.
    Schultes, Erik A.
    Mons, Barend
    Roos, Marco
    JOURNAL OF BIOMEDICAL SEMANTICS, 2015, 6
  • [7] Human Computation VGI Provenance: Semantic Web-Based Representation and Publishing
    Celino, Irene
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (11): : 5137 - 5144
  • [8] PUBLISHING STATISTICAL DATA ON THE WEB
    Salas, Percy E. Rivera
    Da Mota, Fernando Maia
    Breitman, Karin K.
    Casanova, Marco A.
    Martin, Michael
    Auer, Soren
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2012, 6 (04) : 373 - 388
  • [9] Publishing the Trove Newspaper Corpus
    Cassidy, Steve
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4520 - 4525
  • [10] Publishing deep web geographic data
    Piccinini, Helena
    Casanova, Marco A.
    Leme, Luiz Andre P. P.
    Furtado, Antonio L.
    GEOINFORMATICA, 2014, 18 (04) : 769 - 792