Mixed Hierarchical Networks for Deep Entity Matching

被引:0
|
作者
Chen-Chen Sun
De-Rong Shen
机构
[1] Engineering Research Center of Learning-Based Intelligent System (Ministry of Education),School of Computer Science and Engineering
[2] Tianjin University of Technology,School of Computer Science and Engineering
[3] Tianjin University of Technology,undefined
[4] Northeastern University,undefined
来源
Journal of Computer Science and Technology | 2021年 / 36卷
关键词
entity matching; attention mechanism; mixed hierarchical neural network (MHN); domain adaption; data integration;
D O I
暂无
中图分类号
学科分类号
摘要
Entity matching is a fundamental problem of data integration. It groups records according to underlying real-world entities. There is a growing trend of entity matching via deep learning techniques. We design mixed hierarchical deep neural networks (MHN) for entity matching, exploiting semantics from different abstract levels in the record internal hierarchy. A family of attention mechanisms is utilized in different periods of entity matching. Self-attention focuses on internal dependency, inter-attention targets at alignments, and multi-perspective weight attention is devoted to importance discrimination. Especially, hybrid soft token alignment is proposed to address corrupted data. Attribute order is for the first time considered in deep entity matching. Then, to reduce utilization of labeled training data, we propose an adversarial domain adaption approach (DA-MHN) to transfer matching knowledge between different entity matching tasks by maximizing classifier discrepancy. Finally, we conduct comprehensive experimental evaluations on 10 datasets (seven for MHN and three for DA-MHN), which illustrate our two proposed approaches’ superiorities. MHN apparently outperforms previous studies in accuracy, and also each component of MHN is tested. DA-MHN greatly surpasses existing studies in transferability.
引用
收藏
页码:822 / 838
页数:16
相关论文
共 50 条
  • [1] Mixed Hierarchical Networks for Deep Entity Matching
    Sun, Chen-Chen
    Shen, De-Rong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 36 (04) : 822 - 838
  • [2] DEM: Deep Entity Matching Across Heterogeneous Information Networks
    Kong, Chao
    Chen, Bao-Xiang
    Zhang, Li-Ping
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (04) : 739 - 750
  • [3] DEM: Deep Entity Matching Across Heterogeneous Information Networks
    Chao Kong
    Bao-Xiang Chen
    Li-Ping Zhang
    Journal of Computer Science and Technology, 2020, 35 : 739 - 750
  • [4] Deep Hierarchical Attention Networks for Text Matching in Information Retrieval
    Song, Meina
    Liu, Qing
    Haihong, E.
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER AIDED EDUCATION (ICISCAE 2018), 2018, : 476 - 481
  • [5] Hierarchical Matching Network for Heterogeneous Entity Resolution
    Fu, Cheng
    Han, Xianpei
    He, Jiaming
    Sun, Le
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3665 - 3671
  • [6] Knowledge entity learning and representation for ontology matching based on deep neural networks
    Qiu, Lirong
    Yu, Jia
    Pu, Qiumei
    Xiang, Chuncheng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (02): : 969 - 977
  • [7] Knowledge entity learning and representation for ontology matching based on deep neural networks
    Lirong Qiu
    Jia Yu
    Qiumei Pu
    Chuncheng Xiang
    Cluster Computing, 2017, 20 : 969 - 977
  • [8] Deep Entity Matching: Challenges and Opportunities
    Li, Yuliang
    Li, Jinfeng
    Suhara, Yoshihiko
    Wang, Jin
    Hirota, Wataru
    Tan, Wang-Chiew
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2021, 13 (01):
  • [9] Entity Matching in Online Social Networks
    Peled, Olga
    Fire, Michael
    Rokach, Lior
    Elovici, Yuval
    2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 339 - 344
  • [10] Neural Networks for Entity Matching: A Survey
    Barlaug, Nils
    Gulla, Jon Atle
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (03)