Variety-aware GAN and online learning augmented self-training model for knowledge graph entity alignment

被引:8
作者
Qian, Ye [1 ]
Pan, Li [1 ,2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Cyber Sci & Technol, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Key Lab Integrated Adm Technol Informat S, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Zhang jiang Inst Adv Study, Shanghai 201203, Peoples R China
基金
中国国家自然科学基金;
关键词
Entity alignment; Self-training; Generative adversarial network; Online learning;
D O I
10.1016/j.ipm.2023.103472
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, self-training strategies are adopted in some entity alignment methods, which address the scarcity of training data by selecting newly-aligned pairs from the predicted alignment of unlabeled data. However, the boundary between positive and negative pairs in predicted alignment is difficult to determine, which may lead to inappropriate alignment in newly-aligned pairs. Besides, some pre-aligned pairs have been fitted by the method during training iteration, and combining all pre-aligned pairs with newly-aligned pairs to retrain the method may result in overfitting problems. To address these problems, a Self-training Entity Alignment Framework based on Variety-aware GAN and Online Learning Algorithm named SEAGAN is proposed in this paper. To select reliable newly-aligned pairs from the predicted alignment, a variety-aware GAN with a metric of match variety that eliminates negative pairs differing significantly from positive pairs is designed. It leverages the distribution of entity pairs to determine the boundary between different types of pairs. Moreover, SEAGAN designs an online learning algorithm that combines newly-aligned pairs with their one-hop and two-hop neighbor entities in pre -aligned pairs to update model parameters, which alleviates overfitting problems. We conduct extensive experiments on four real-world datasets to compare SEAGAN with fifteen state-of-the-art methods. Experiment results show that SEAGAN has better performance than state-of-the-art methods on metrics of Prec@1, Prec@5, and MRR.
引用
收藏
页数:16
相关论文
共 48 条
[1]   A Critical Assessment of State-of-the-Art in Entity Alignment [J].
Berrendorf, Max ;
Wacker, Ludwig ;
Faerman, Evgeniy .
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2021, PT II, 2021, 12657 :18-32
[2]   Multi-heterogeneous neighborhood-aware for Knowledge Graphs alignment [J].
Cai, Weishan ;
Wang, Yizhao ;
Mao, Shun ;
Zhan, Jieyu ;
Jiang, Yuncheng .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
[3]  
Cai Weishan, 2022, IEEE Transactions on Knowledge and Data Engineering
[4]   MAUIL: Multilevel attribute embedding for semisupervised user identity linkage [J].
Chen, Baiyang ;
Chen, Xiaoliang .
INFORMATION SCIENCES, 2022, 593 :527-545
[5]   Multi-modal Siamese Network for Entity Alignment [J].
Chen, Liyi ;
Li, Zhi ;
Xu, Tong ;
Wu, Han ;
Wang, Zhefeng ;
Yuan, Nicholas Jing ;
Chen, Enhong .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :118-126
[6]  
Chen MH, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1511
[7]   Interactive online learning for graph matching using active strategies [J].
Conte, Donatello ;
Serratosa, Francesc .
KNOWLEDGE-BASED SYSTEMS, 2020, 205
[8]  
Cui Zeyu, 2022, IEEE Transactions on Neural Networks and Learning Systems
[9]   Self-learning and embedding based entity alignment [J].
Guan, Saiping ;
Jin, Xiaolong ;
Wang, Yuanzhuo ;
Jia, Yantao ;
Shen, Huawei ;
Li, Zixuan ;
Cheng, Xueqi .
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (02) :361-386
[10]   Domain-Adversarial Network Alignment [J].
Hong, Huiting ;
Li, Xin ;
Pan, Yuangang ;
Tsang, Ivor W. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (07) :3211-3224