M 2 ixKG: Mixing for harder negative samples in knowledge graph

被引:3
作者
Che, Feihu [1 ]
Tao, Jianhua [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Negative sampling; Knowledge graph; Mixing operation; Hard negatives;
D O I
10.1016/j.neunet.2024.106358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph embedding (KGE) involves mapping entities and relations to low -dimensional dense embeddings, enabling a wide range of real -world applications. The mapping is achieved via distinguishing the positive and negative triplets in knowledge graphs. Therefore, how to design high -quality negative triplets is critical in the effectiveness of KEG models. Existing KGE models face challenges in generating high -quality negative triplets. Some models employ simple static distributions, i.e. uniform or Bernoulli distribution, and it is difficult for these methods to be trained distinguishably because of the sampled uninformative negative triplets. Furthermore, current methods are confined to constructing negative triplets from existing entities within the knowledge graph, limiting their ability to explore harder negatives. We introduce a novel mixing strategy in knowledge graphs called M 2 ixKG. M 2 ixKG adopts mixing operation in generating harder negative samples from two aspects: one is mixing among the heads and tails in triplets with the same relation to strengthen the robustness and generalization of the entity embeddings; the other is mixing the negatives with high scores to generate harder negatives. Our experiments, utilizing three datasets and four classical score functions, highlight the exceptional performance of M 2 ixKG in comparison to previous negative sampling algorithms.
引用
收藏
页数:10
相关论文
共 53 条
[1]  
Ahrabian K, 2020, Arxiv, DOI arXiv:2009.11355
[2]  
[Anonymous], 2019, Advances in Neural Information Processing Systems
[3]  
Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185
[4]  
Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
[5]  
Bordes A., 2014, MACHINE LEARNING KNO, V8724, P165
[6]  
Bordes A, 2014, Arxiv, DOI arXiv:1406.3676
[7]  
Bordes Antoine, 2013, Advances in neural information processing systems, P2787
[8]  
Cai L., 2018, P ACL
[9]  
Chami Ines, 2020, Low dimensional hyperbolic knowledge graph embeddings
[10]  
Chen JA, 2020, Arxiv, DOI arXiv:2004.12239