Linked open data-based framework for automatic biomedical ontology generation

被引:19
作者
Alobaidi, Mazen [1 ,2 ]
Malik, Khalid Mahmood [1 ]
Sabra, Susan [1 ]
机构
[1] Oakland Univ, Comp Sci & Engn Dept, 2200 N Squirrel Rd, Rochester, MI 48309 USA
[2] Micro Focus Int Plc, Troy, MI 48084 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Semantic web; Ontology generation; Linked open data; Semantic enrichment; INFORMATION; EXTRACTION; TEXT;
D O I
10.1186/s12859-018-2339-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Fulfilling the vision of Semantic Web requires an accurate data model for organizing knowledge and sharing common understanding of the domain. Fitting this description, ontologies are the cornerstones of Semantic Web and can be used to solve many problems of clinical information and biomedical engineering, such as word sense disambiguation, semantic similarity, question answering, ontology alignment, etc. Manual construction of ontology is labor intensive and requires domain experts and ontology engineers. To downsize the labor-intensive nature of ontology generation and minimize the need for domain experts, we present a novel automated ontology generation framework, Linked Open Data approach for Automatic Biomedical Ontology Generation (LOD-ABOG), which is empowered by Linked Open Data (LOD). LOD-ABOG performs concept extraction using knowledge base mainly UMLS and LOD, along with Natural Language Processing (NLP) operations; and applies relation extraction using LOD, Breadth first Search (BSF) graph method, and Freepal repository patterns. Results: Our evaluation shows improved results in most of the tasks of ontology generation compared to those obtained by existing frameworks. We evaluated the performance of individual tasks (modules) of proposed framework using CDR and SemMedDB datasets. For concept extraction, evaluation shows an average F-measure of 58.12% for CDR corpus and 81.68% for SemMedDB; F-measure of 65.26% and 77.44% for biomedical taxonomic relation extraction using datasets of CDR and SemMedDB, respectively; and F-measure of 52.78% and 58.12% for biomedical non-taxonomic relation extraction using CDR corpus and SemMedDB, respectively. Additionally, the comparison with manually constructed baseline Alzheimer ontology shows F-measure of 72.48% in terms of concepts detection, 76.27% in relation extraction, and 83.28% in property extraction. Also, we compared our proposed framework with ontology learning framework called "OntoGain" which shows that LOD-ABOG performs 14.76% better in terms of relation extraction. Conclusion: This paper has presented LOD-ABOG framework which shows that current LOD sources and technologies are a promising solution to automate the process of biomedical ontology generation and extract relations to a greater extent. In addition, unlike existing frameworks which require domain experts in ontology development process, the proposed approach requires involvement of them only for improvement purpose at the end of ontology life cycle.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Curation of physical objects in botany: Architecture and development of a linked open data-based application
    Mauricio Yagui, Marcela Mayumi
    Monsores Passos Maia, Luis Fernando
    Oliveira, Jonice
    Vivacqua, Adriana S.
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 889 - 892
  • [12] TongSACOM: A TongYiCiCiLin and Sequence Alignment-Based Ontology Mapping Model for Chinese Linked Open Data
    Wang, Ting
    Xu, Tiansheng
    Tang, Zheng
    Todo, Yuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1251 - 1261
  • [13] The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction
    Witte, Rene
    Sateli, Bahar
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2423 - 2428
  • [14] Legal Compliance in a Linked Open Data Framework
    Francesconi, Enrico
    Governatori, Guido
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2019), 2019, 322 : 175 - 180
  • [15] Ontology Based Framework for Automatic Software's Documentation
    Bhatia, M. P. S.
    Kumar, Akshi
    Beniwal, Rohit
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 421 - 424
  • [16] Linked Open Data in the Biomedical Information Area: A Keywords Analysis
    Bonacina, Stefano
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 1429 - 1430
  • [17] A linked open data framework to enhance the discoverability and impact of culture heritage
    Candela, Gustavo
    Escobar, Pilar
    Carrasco, Rafael C.
    Marco-Such, Manuel
    JOURNAL OF INFORMATION SCIENCE, 2019, 45 (06) : 756 - 766
  • [18] An Integrated Framework for RESTful Web Services Using Linked Open Data
    Modi, Kiritkumar J.
    Garg, Sanjay
    Chaudhary, Sanjay
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (02) : 24 - 49
  • [19] Design and Development of a Linked Open Data-Based Web Portal for Sharing IoT Health and Fitness Datasets
    Reda, Roberto
    Carbonaro, Antonella
    GOODTECHS '18: PROCEEDINGS OF THE 4TH EAI INTERNATIONAL CONFERENCE ON SMART OBJECTS AND TECHNOLOGIES FOR SOCIAL GOOD (GOODTECHS), 2018, : 43 - 48
  • [20] BioOntoVerb: A top level ontology based framework to populate biomedical ontologies from texts
    Maria Ruiz-Martinez, Juana
    Valencia-Garcia, Rafael
    Martinez-Bejar, Rodrigo
    Hoffmann, Achim
    KNOWLEDGE-BASED SYSTEMS, 2012, 36 : 68 - 80