Linked open data-based framework for automatic biomedical ontology generation

被引:19
作者
Alobaidi, Mazen [1 ,2 ]
Malik, Khalid Mahmood [1 ]
Sabra, Susan [1 ]
机构
[1] Oakland Univ, Comp Sci & Engn Dept, 2200 N Squirrel Rd, Rochester, MI 48309 USA
[2] Micro Focus Int Plc, Troy, MI 48084 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Semantic web; Ontology generation; Linked open data; Semantic enrichment; INFORMATION; EXTRACTION; TEXT;
D O I
10.1186/s12859-018-2339-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Fulfilling the vision of Semantic Web requires an accurate data model for organizing knowledge and sharing common understanding of the domain. Fitting this description, ontologies are the cornerstones of Semantic Web and can be used to solve many problems of clinical information and biomedical engineering, such as word sense disambiguation, semantic similarity, question answering, ontology alignment, etc. Manual construction of ontology is labor intensive and requires domain experts and ontology engineers. To downsize the labor-intensive nature of ontology generation and minimize the need for domain experts, we present a novel automated ontology generation framework, Linked Open Data approach for Automatic Biomedical Ontology Generation (LOD-ABOG), which is empowered by Linked Open Data (LOD). LOD-ABOG performs concept extraction using knowledge base mainly UMLS and LOD, along with Natural Language Processing (NLP) operations; and applies relation extraction using LOD, Breadth first Search (BSF) graph method, and Freepal repository patterns. Results: Our evaluation shows improved results in most of the tasks of ontology generation compared to those obtained by existing frameworks. We evaluated the performance of individual tasks (modules) of proposed framework using CDR and SemMedDB datasets. For concept extraction, evaluation shows an average F-measure of 58.12% for CDR corpus and 81.68% for SemMedDB; F-measure of 65.26% and 77.44% for biomedical taxonomic relation extraction using datasets of CDR and SemMedDB, respectively; and F-measure of 52.78% and 58.12% for biomedical non-taxonomic relation extraction using CDR corpus and SemMedDB, respectively. Additionally, the comparison with manually constructed baseline Alzheimer ontology shows F-measure of 72.48% in terms of concepts detection, 76.27% in relation extraction, and 83.28% in property extraction. Also, we compared our proposed framework with ontology learning framework called "OntoGain" which shows that LOD-ABOG performs 14.76% better in terms of relation extraction. Conclusion: This paper has presented LOD-ABOG framework which shows that current LOD sources and technologies are a promising solution to automate the process of biomedical ontology generation and extract relations to a greater extent. In addition, unlike existing frameworks which require domain experts in ontology development process, the proposed approach requires involvement of them only for improvement purpose at the end of ontology life cycle.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Linked open data-based framework for automatic biomedical ontology generation
    Mazen Alobaidi
    Khalid Mahmood Malik
    Susan Sabra
    BMC Bioinformatics, 19
  • [2] Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain
    Alobaidi, Mazen
    Malik, Khalid Mahmood
    Hussain, Maqbool
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 165 : 117 - 128
  • [3] Linked open data-based explanations for transparent recommender systems
    Musto, Cataldo
    Narducci, Fedelucio
    Lops, Pasquale
    de Gemmis, Marco
    Semeraro, Giovanni
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2019, 121 : 93 - 107
  • [4] OLOUD - An Ontology for Linked Open University Data
    Fleineri, Rita
    Szasz, Barnabas
    Micsik, Andras
    ACTA POLYTECHNICA HUNGARICA, 2017, 14 (04) : 63 - 82
  • [5] The Current State of Linked Data-based Recommender Systems
    Mandi, Ahmed Mounaf
    Hadi, Asaad Sabah
    PROCEEDING OF 2021 2ND INFORMATION TECHNOLOGY TO ENHANCE E-LEARNING AND OTHER APPLICATION (IT-ELA 2021), 2021, : 154 - 160
  • [6] Constructing Biomedical Knowledge Graph Based on SemMedDB and Linked Open Data
    Cong, Qing
    Feng, Zhiyong
    Li, Fang
    Zhang, Li
    Rao, Guozheng
    Tao, Cui
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1628 - 1631
  • [7] Implementing Automatic Ontology Generation for the New Zealand Open Government Data: An Evaluative Approach
    Kaur, Paramjeet
    Nand, Parma
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 26 - 36
  • [8] Linked Open Data for Legislative Domain - Ontology and Experimental Data
    Necasky, Martin
    Knap, Tomas
    Klimek, Jakub
    Holubova, Irena
    Vidova-Hladka, Barbora
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2013, 2013, 160 : 172 - 183
  • [9] Public funding accountability: a linked open data-based methodology for analysing the scientific productivity and influence of funded projects
    Perianes-Rodriguez, Antonio
    Olmeda-Gomez, Carlos
    Delbianco, Natalia R.
    Gracio, Maria Claudia Cabrini
    SCIENTOMETRICS, 2024, 129 (10) : 5841 - 5868
  • [10] Design and Development of a Linked Open Data-Based Health Information Representation and Visualization System: Potentials and Preliminary Evaluation
    Tilahun, Binyam
    Kauppinen, Tomi
    Kessler, Carsten
    Fritz, Fleur
    JMIR MEDICAL INFORMATICS, 2014, 2 (02) : 196 - 208