Graph-based Information Exploration over Structured and Unstructured Data

被引:0
作者
Koumoutsos, Giannis [1 ]
Fasli, Maria [1 ]
Lewin, Ian [2 ]
Milward, David [2 ]
机构
[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester, Essex, England
[2] Linguamatics, Cambridge, England
来源
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2017年
基金
英国工程与自然科学研究理事会;
关键词
structured; unstructured; graph-based; exploring relations; biomedical; SEMANTIC WEB; INTEGRATION; DISCOVERY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rise of the Semantic Web, several public semantic repositories like Knowledge Bases, Ontologies and Taxonomies have been developed in a variety of domains. For specific domains like the biomedical domain they have already formed a huge valuable infrastructure. On the other hand, the development of efficient algorithms for Natural Language Processing gave us access to the massive knowledge hidden in many unstructured resources. Combining and harvesting these two worlds would result into a very productive knowledge fusion applicable in several domains. In this paper, an extensible framework is presented that focuses on accessing and graphically presenting the knowledge coming from all available structured and unstructured resources. An abstraction formalism for representing any type of query based on graphs is the base of this approach. This formalism makes the framework accessible to non-expert users that have no knowledge of constructing queries in any querying language and barely understand what structured and unstructured resources are. The architecture that will allow for the framework to be adaptable to all available resources is described along with a proof of concept implementation in the biomedical domain.
引用
收藏
页码:1991 / 2000
页数:10
相关论文
共 25 条
  • [11] SemMedDB: a PubMed-scale repository of biomedical semantic predications
    Kilicoglu, Halil
    Shin, Dongwook
    Fiszman, Marcelo
    Rosemblat, Graciela
    Rindflesch, Thomas C.
    [J]. BIOINFORMATICS, 2012, 28 (23) : 3158 - 3160
  • [12] WikiPathways: capturing the full diversity of pathway knowledge
    Kutmon, Martina
    Riutta, Anders
    Nunes, Nuno
    Hanspers, Kristina
    Willighagen, Egon L.
    Bohler, Anwesha
    Melius, Jonathan
    Waagmeester, Andra
    Sinha, Sravanthi R.
    Miller, Ryan
    Coort, Susan L.
    Cirillo, Elisa
    Smeets, Bart
    Evelo, Chris T.
    Pico, Alexander R.
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D488 - D494
  • [13] An automated real-time integration and interoperability framework for bioinformatics
    Lopes, Pedro
    Oliveira, Jose Luis
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [14] COEUS: "semantic web in a box" for biomedical applications
    Lopes, Pedro
    Oliveira, Jose Luis
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2012, 3
  • [15] PowerAqua: Supporting users in querying and exploring the Semantic Web
    Lopez, Vanessa
    Fernandez, Miriam
    Motta, Enrico
    Stieler, Nico
    [J]. SEMANTIC WEB, 2012, 3 (03) : 249 - 265
  • [16] Piero J., 2015, DATABASE
  • [17] Ramakrishnan C., 2005, SIGKDD EXPLORATIONS, V7, P5663
  • [18] The Application of the Open Pharmacological Concepts Triple Store (Open PHACTS) to Support Drug Discovery Research
    Ratnam, Joseline
    Zdrazil, Barbara
    Digles, Daniela
    Cuadrado-Rodriguez, Emiliano
    Neefs, Jean-Marc
    Tipney, Hannah
    Siebes, Ronald
    Waagmeester, Andra
    Bradley, Glyn
    Chau, Chau Han
    Richter, Lars
    Brea, Jose
    Evelo, Chris T.
    Jacoby, Edgar
    Senger, Stefan
    Isabel Loza, Maria
    Ecker, Gerhard F.
    Chichester, Christine
    [J]. PLOS ONE, 2014, 9 (12):
  • [19] BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF
    Salvadores, Manuel
    Alexander, Paul R.
    Musen, Mark A.
    Noy, Natalya F.
    [J]. SEMANTIC WEB, 2013, 4 (03) : 277 - 284
  • [20] Toward interoperable bioscience data
    Sansone, Susanna-Assunta
    Rocca-Serra, Philippe
    Field, Dawn
    Maguire, Eamonn
    Taylor, Chris
    Hofmann, Oliver
    Fang, Hong
    Neumann, Steffen
    Tong, Weida
    Amaral-Zettler, Linda
    Begley, Kimberly
    Booth, Tim
    Bougueleret, Lydie
    Burns, Gully
    Chapman, Brad
    Clark, Tim
    Coleman, Lee-Ann
    Copeland, Jay
    Das, Sudeshna
    de Daruvar, Antoine
    de Matos, Paula
    Dix, Ian
    Edmunds, Scott
    Evelo, Chris T.
    Forster, Mark J.
    Gaudet, Pascale
    Gilbert, Jack
    Goble, Carole
    Griffin, Julian L.
    Jacob, Daniel
    Kleinjans, Jos
    Harland, Lee
    Haug, Kenneth
    Hermjakob, Henning
    Sui, Shannan J. Ho
    Laederach, Alain
    Liang, Shaoguang
    Marshall, Stephen
    McGrath, Annette
    Merrill, Emily
    Reilly, Dorothy
    Roux, Magali
    Shamu, Caroline E.
    Shang, Catherine A.
    Steinbeck, Christoph
    Trefethen, Anne
    Williams-Jones, Bryn
    Wolstencroft, Katherine
    Xenarios, Ioannis
    Hide, Winston
    [J]. NATURE GENETICS, 2012, 44 (02) : 121 - 126