VersaMatch: Ontology Matching with Weak Supervision

被引:2
|
作者
Furst, Jonathan [1 ]
Argerich, Mauricio Fadel [2 ]
Cheng, Bin [3 ]
机构
[1] Zurich Univ Appl Sci, NEC Labs Europe, Zurich, Switzerland
[2] Univ Politecn Madrid, NEC Labs Europe, Madrid, Spain
[3] Springer Nat, NEC Labs Europe, Berlin, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
D O I
10.14778/3583140.3583148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ontology matching is crucial to data integration for across-silo data sharing and has been mainly addressed with heuristic and machine learning (ML) methods. While heuristic methods are often inflexible and hard to extend to new domains, ML methods rely on substantial and hard to obtain amounts of labeled training data. To overcome these limitations, we propose VersaMatch, a flexible, weakly-supervised ontology matching system. VersaMatch employs various weak supervision sources, such as heuristic rules, pattern matching, and external knowledge bases, to produce labels from a large amount of unlabeled data for training a discriminative ML model. For prediction, VersaMatch develops a novel ensemble model combining the weak supervision sources with the discriminative model to support generalization while retaining a high precision. Our ensemble method boosts end model performance by 4 points compared to a traditional weak-supervision baseline. In addition, compared to state-of-the-art ontology matchers, VersaMatch achieves an overall 4-point performance improvement in F1 score across 26 ontology combinations from different domains. For recently released, in-the-wild datasets, VersaMatch beats the next best matchers by 9 points in F1. Furthermore, its core weak-supervision logic can easily be improved by adding more knowledge sources and collecting more unlabeled data for training.
引用
收藏
页码:1305 / 1318
页数:14
相关论文
共 50 条
  • [31] Special issue: Ontology matching
    Shvaiko, Pavel
    Euzenat, Jérôme
    Semantic Web and Information Systems, 2007, 3 (02):
  • [32] An Identification Ontology for Entity Matching
    Bortoli, Stefano
    Bouquet, Paolo
    Bazzanella, Barbara
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 WORKSHOPS, 2014, 8842 : 587 - 596
  • [33] On the ontology instance matching problem
    Castano, S.
    Ferrara, A.
    Lorusso, D.
    Montanelli, S.
    DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 180 - 184
  • [34] Ontology matching: A literature review
    Otero-Cerdeira, Lorena
    Rodriguez-Martinez, Francisco J.
    Gomez-Rodriguez, Alma
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 949 - 971
  • [35] Ontology Matching with Knowledge Rules
    Jiang, Shangpu
    Lowd, Daniel
    Kafle, Sabin
    Dou, Dejing
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXVIII: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS, 2016, 9940 : 75 - 95
  • [36] Ten Challenges for Ontology Matching
    Shvaiko, Pavel
    Euzenat, Jerone
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008, PT II, PROCEEDINGS, 2008, 5332 : 1164 - +
  • [37] Ontology matching with semantic verification
    Jean-Mary, Yves R.
    Shironoshita, E. Patrick
    Kabuka, Mansur R.
    JOURNAL OF WEB SEMANTICS, 2009, 7 (03): : 235 - 251
  • [38] A novel algorithm for ontology matching
    Akbari, Ismail
    Fathian, Mohammad
    JOURNAL OF INFORMATION SCIENCE, 2010, 36 (03) : 324 - 334
  • [39] Ontology Matching: Status and Challenges
    Kotis, Konstantinos
    Lanzenberger, Monika
    IEEE INTELLIGENT SYSTEMS, 2008, 23 (06) : 84 - 85
  • [40] A Flexible System for Ontology Matching
    DuyHoa Ngo
    Bellahsene, Zohra
    Coletta, Remi
    IS OLYMPICS: INFORMATION SYSTEMS IN A DIVERSE WORLD, 2012, 107 : 79 - 94