Variety-aware GAN and online learning augmented self-training model for knowledge graph entity alignment

被引：8

作者：

Qian, Ye ^{[1
]}

Pan, Li ^{[1
,2
,3
]}

机构：

[1] Shanghai Jiao Tong Univ, Inst Cyber Sci & Technol, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

[2] Shanghai Key Lab Integrated Adm Technol Informat S, Shanghai 200240, Peoples R China

[3] Shanghai Jiao Tong Univ, Zhang jiang Inst Adv Study, Shanghai 201203, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2023年 / 60卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Entity alignment; Self-training; Generative adversarial network; Online learning;

D O I：

10.1016/j.ipm.2023.103472

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, self-training strategies are adopted in some entity alignment methods, which address the scarcity of training data by selecting newly-aligned pairs from the predicted alignment of unlabeled data. However, the boundary between positive and negative pairs in predicted alignment is difficult to determine, which may lead to inappropriate alignment in newly-aligned pairs. Besides, some pre-aligned pairs have been fitted by the method during training iteration, and combining all pre-aligned pairs with newly-aligned pairs to retrain the method may result in overfitting problems. To address these problems, a Self-training Entity Alignment Framework based on Variety-aware GAN and Online Learning Algorithm named SEAGAN is proposed in this paper. To select reliable newly-aligned pairs from the predicted alignment, a variety-aware GAN with a metric of match variety that eliminates negative pairs differing significantly from positive pairs is designed. It leverages the distribution of entity pairs to determine the boundary between different types of pairs. Moreover, SEAGAN designs an online learning algorithm that combines newly-aligned pairs with their one-hop and two-hop neighbor entities in pre -aligned pairs to update model parameters, which alleviates overfitting problems. We conduct extensive experiments on four real-world datasets to compare SEAGAN with fifteen state-of-the-art methods. Experiment results show that SEAGAN has better performance than state-of-the-art methods on metrics of Prec@1, Prec@5, and MRR.

引用

页数：16

共 48 条

[1] A Critical Assessment of State-of-the-Art in Entity Alignment [J].