Ontology Alignment Based on Word Embedding and Random Forest Classification

被引:9
作者
Nkisi-Orji, Ikechukwu [1 ]
Wiratunga, Nirmalie [1 ]
Massie, Stewart [1 ]
Hui, Kit-Ying [1 ]
Heaven, Rachel [2 ]
机构
[1] Robert Gordon Univ, Aberdeen, Scotland
[2] British Geol Survey, Nottingham, England
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I | 2019年 / 11051卷
关键词
Ontology alignment; Word embedding; Machine classification; Semantic web; AGGREGATION;
D O I
10.1007/978-3-030-10925-7_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ontology alignment is crucial for integrating heterogeneous data sources and forms an important component of the semantic web. Accordingly, several ontology alignment techniques have been proposed and used for discovering correspondences between the concepts (or entities) of different ontologies. Most alignment techniques depend on string-based similarities which are unable to handle the vocabulary mismatch problem. Also, determining which similarity measures to use and how to effectively combine them in alignment systems are challenges that have persisted in this area. In this work, we introduce a random forest classifier approach for ontology alignment which relies on word embedding for determining a variety of semantic similarity features between concepts. Specifically, we combine string-based and semantic similarity measures to form feature vectors that are used by the classifier model to determine when concepts align. By harnessing background knowledge and relying on minimal information from the ontologies, our approach can handle knowledge-light ontological resources. It also eliminates the need for learning the aggregation weights of a composition of similarity measures. Experiments using Ontology Alignment Evaluation Initiative (OAEI) dataset and real-world ontologies highlight the utility of our approach and show that it can outperform state-of-the-art alignment systems. Code related to this paper is available at: https://bitbucket.org/paravariar/rafcom.
引用
收藏
页码:557 / 572
页数:16
相关论文
共 50 条
  • [21] Correlation analysis and text classification of chemical accident cases based on word embedding
    Jing, Sifeng
    Liu, Xiwei
    Gong, Xiaoyan
    Tang, Ying
    Xiong, Gang
    Liu, Sheng
    Xiang, Shuguang
    Bi, Rongshan
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2022, 158 (698-710) : 698 - 710
  • [22] Tagged Video Retrieval System using Domain Ontology and Word Embedding
    Hahm, Gyeong-june
    Kwak, Chang-uk
    Kin, Sun-joong
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1100 - 1102
  • [23] Text classification with improved word embedding and adaptive segmentation
    Sun, Guoying
    Cheng, Yanan
    Zhang, Zhaoxin
    Tong, Xiaojun
    Chai, Tingting
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [24] Emotion Classification on Youtube Comments using Word Embedding
    Savigny, Julio
    Purwarianti, Ayu
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS, 2017,
  • [25] Text Classification Using Word Embedding in Rule-Based Methodologies: A Systematic Mapping
    Aubaid, Asmaa M.
    Mishra, Alok
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2018, 7 (04): : 902 - 914
  • [26] Exploring the effectiveness of word embedding based deep learning model for improving email classification
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 483 - 505
  • [27] Updating Ontology Alignment on the Instance Level Based on Ontology Evolution
    Kozierkiewicz, Adrianna
    Pietranik, Marcin
    Nguyen, Loan T. T.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 301 - 311
  • [28] Updating Ontology Alignment on the Concept Level Based on Ontology Evolution
    Kozierkiewicz, Adrianna
    Pietranik, Marcin
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 11695 : 201 - 214
  • [29] Wasf-Vec: Topology-based Word Embedding for Modern Standard Arabic and Iraqi Dialect Ontology
    Abdulhameed, Tiba Zaki
    Zitouni, Imed
    Abdel-Qader, Ikhlas
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (02)
  • [30] Word Embedding Composition for Data Imbalances in Sentiment and Emotion Classification
    Xu, Ruifeng
    Chen, Tao
    Xia, Yunqing
    Lu, Qin
    Liu, Bin
    Wang, Xuan
    COGNITIVE COMPUTATION, 2015, 7 (02) : 226 - 240