Ontology Alignment Based on Word Embedding and Random Forest Classification

被引:9
作者
Nkisi-Orji, Ikechukwu [1 ]
Wiratunga, Nirmalie [1 ]
Massie, Stewart [1 ]
Hui, Kit-Ying [1 ]
Heaven, Rachel [2 ]
机构
[1] Robert Gordon Univ, Aberdeen, Scotland
[2] British Geol Survey, Nottingham, England
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I | 2019年 / 11051卷
关键词
Ontology alignment; Word embedding; Machine classification; Semantic web; AGGREGATION;
D O I
10.1007/978-3-030-10925-7_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ontology alignment is crucial for integrating heterogeneous data sources and forms an important component of the semantic web. Accordingly, several ontology alignment techniques have been proposed and used for discovering correspondences between the concepts (or entities) of different ontologies. Most alignment techniques depend on string-based similarities which are unable to handle the vocabulary mismatch problem. Also, determining which similarity measures to use and how to effectively combine them in alignment systems are challenges that have persisted in this area. In this work, we introduce a random forest classifier approach for ontology alignment which relies on word embedding for determining a variety of semantic similarity features between concepts. Specifically, we combine string-based and semantic similarity measures to form feature vectors that are used by the classifier model to determine when concepts align. By harnessing background knowledge and relying on minimal information from the ontologies, our approach can handle knowledge-light ontological resources. It also eliminates the need for learning the aggregation weights of a composition of similarity measures. Experiments using Ontology Alignment Evaluation Initiative (OAEI) dataset and real-world ontologies highlight the utility of our approach and show that it can outperform state-of-the-art alignment systems. Code related to this paper is available at: https://bitbucket.org/paravariar/rafcom.
引用
收藏
页码:557 / 572
页数:16
相关论文
共 50 条
  • [41] Learning Topic-Oriented Word Embedding for Query Classification
    Yang, Hebin
    Hu, Qinmin
    He, Liang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 188 - 198
  • [42] Ontology Alignment Based Service Interface Adaptation
    Jin, Lu
    Wu, Jian
    Yin, Jianwei
    Li, Ying
    Deng, Shuiguang
    2009 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, 2009, : 494 - 497
  • [43] Deontic Logic Based Ontology Alignment Technique for E-Learning
    Deborah, Lazarus
    Baskaran, Ramachandran
    Kannan, Arputharaj
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2012, 8 (03) : 56 - 72
  • [44] A Method for Ontology Alignment Based on Semantics of Attributes
    Pietranik, Marcin
    Ngoc Thanh Nguyen
    CYBERNETICS AND SYSTEMS, 2012, 43 (04) : 319 - 339
  • [45] A SURVEY ON AGENT-BASED ONTOLOGY ALIGNMENT
    Davidovsky, Maxim
    Ermolayev, Vadim
    Tolok, Vyacheslav
    ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL. 2, 2012, : 355 - 361
  • [46] Fuzzy based approach to ontology relations alignment
    Hnatkowska, Bogumila
    Kozierkiewicz, Adrianna
    Pietranik, Marcin
    IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [47] Combining Document Classification and Ontology Alignment for Semantically Enriching Web Services
    Marco Crasso
    Alejandro Zunino
    Marcelo Campo
    New Generation Computing, 2010, 28 : 371 - 403
  • [48] Combining Document Classification and Ontology Alignment for Semantically Enriching Web Services
    Crasso, Marco
    Zunino, Alejandro
    Campo, Marcelo
    NEW GENERATION COMPUTING, 2010, 28 (04) : 371 - 403
  • [49] Text Semantic Steganalysis Based on Word Embedding
    Zuo, Xin
    Hu, Huanhuan
    Zhang, Weiming
    Yu, Nenghai
    CLOUD COMPUTING AND SECURITY, PT IV, 2018, 11066 : 485 - 495
  • [50] Analysing the Semantic Change Based on Word Embedding
    Liao, Xuanyi
    Cheng, Guang
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 213 - 223