VersaMatch: Ontology Matching with Weak Supervision

被引:2
|
作者
Furst, Jonathan [1 ]
Argerich, Mauricio Fadel [2 ]
Cheng, Bin [3 ]
机构
[1] Zurich Univ Appl Sci, NEC Labs Europe, Zurich, Switzerland
[2] Univ Politecn Madrid, NEC Labs Europe, Madrid, Spain
[3] Springer Nat, NEC Labs Europe, Berlin, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
D O I
10.14778/3583140.3583148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ontology matching is crucial to data integration for across-silo data sharing and has been mainly addressed with heuristic and machine learning (ML) methods. While heuristic methods are often inflexible and hard to extend to new domains, ML methods rely on substantial and hard to obtain amounts of labeled training data. To overcome these limitations, we propose VersaMatch, a flexible, weakly-supervised ontology matching system. VersaMatch employs various weak supervision sources, such as heuristic rules, pattern matching, and external knowledge bases, to produce labels from a large amount of unlabeled data for training a discriminative ML model. For prediction, VersaMatch develops a novel ensemble model combining the weak supervision sources with the discriminative model to support generalization while retaining a high precision. Our ensemble method boosts end model performance by 4 points compared to a traditional weak-supervision baseline. In addition, compared to state-of-the-art ontology matchers, VersaMatch achieves an overall 4-point performance improvement in F1 score across 26 ontology combinations from different domains. For recently released, in-the-wild datasets, VersaMatch beats the next best matchers by 9 points in F1. Furthermore, its core weak-supervision logic can easily be improved by adding more knowledge sources and collecting more unlabeled data for training.
引用
收藏
页码:1305 / 1318
页数:14
相关论文
共 50 条
  • [41] Survey on complex ontology matching
    Thieblin, Elodie
    Haemmerle, Ollivier
    Hernandez, Nathalie
    Trojahn, Cassia
    SEMANTIC WEB, 2020, 11 (04) : 689 - 727
  • [42] Active and Incremental Learning with Weak Supervision
    Brust, Clemens-Alexander
    Kaeding, Christoph
    Denzler, Joachim
    KUNSTLICHE INTELLIGENZ, 2020, 34 (02): : 165 - 180
  • [43] End-to-End Weak Supervision
    Cachay, Salva Ruehling
    Boecking, Benedikt
    Dubrawski, Artur
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [44] Contextualized Weak Supervision for Text Classification
    Mekala, Dheeraj
    Shang, Jingbo
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 323 - 333
  • [45] Self-Training with Weak Supervision
    Karamanolakis, Giannis
    Mukherjee, Subhabrata
    Zheng, Guoqing
    Awadallah, Ahmed Hassan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 845 - 863
  • [46] Weak Supervision for Learning Discourse Structure
    Badene, Sonia
    Thompson, Kate
    Lorre, Jean-Pierre
    Asher, Nicholas
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2296 - 2305
  • [47] Policy Learning Using Weak Supervision
    Wang, Jingkang
    Guo, Hongyi
    Zhu, Zhaowei
    Liu, Yang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [48] Classifying process deviations with weak supervision
    Laghmouch, Manal
    Jans, Mieke
    Depaire, Benoit
    2020 2ND INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2020), 2020, : 89 - 96
  • [49] Bootstrapping Conversational Agents with Weak Supervision
    Mallinar, Neil
    Shah, Abhishek
    Ugrani, Rajendra
    Gupta, Ayush
    Gurusankar, Manikandan
    Ho, Tin Kam
    Liao, Q. Vera
    Zhang, Yunfeng
    Bellamy, Rachel K. E.
    Yates, Robert
    Desmarais, Chris
    McGregor, Blake
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9528 - 9533
  • [50] Unified Semantic Parsing with Weak Supervision
    Agrawal, Priyanka
    Jain, Parag
    Dalmia, Ayushi
    Bansal, Abhishek
    Mittal, Ashish
    Sankaranarayanan, Karthik
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4801 - 4810