Dynamic Strategies for High Performance Training of Knowledge Graph Embeddings

被引：0

作者：

Panda, Anwesh ^{[1
]}

Vadhiyar, Sathish ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Computat & Data, Bangalore, India

来源：

51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022 | 2022年

关键词：

Knowledge graph embeddings; communication minimization; gradient quantization; selection of gradient vectors;

D O I：

10.1145/3545008.3545075

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Knowledge graph embeddings (KGEs) are the low dimensional representations of entities and relations between the entities. They can be used for various downstream tasks such as triple classification, link prediction, knowledge base completion, etc. Training these embeddings for a large dataset takes a huge amount of time. This work proposes strategies to make the training of KGEs faster in a distributed memory parallel environment. The first strategy is to choose between either an all-gather or an all-reduce operation based on the sparsity of the gradient matrix. The second strategy focuses on selecting those gradient vectors which significantly contribute to the reduction in the loss. The third strategy employs gradient quantization to reduce the number of bits to be communicated. The fourth strategy proposes to split the knowledge graph triples based on relations so that inter-node communication for the gradient matrix corresponding to the relation embedding matrix is eliminated. The fifth and last strategy is to select the negative triple which the model finds difficult to classify. All the strategies are combined and this allows us to train the ComplEx Knowledge Graph Embedding (KGE) model on the FB250K dataset in 6 hours with 16 nodes when compared to 11.5 hours taken to train on the same number of nodes without applying any of the above optimizations. This reduction in training time is also accompanied by a significant improvement in Mean Reciprocal Rank (MRR) and Triple Classification Accuracy (TCA).

引用

页数：10

共 43 条

[21] Named Entity Recognition using Knowledge Graph Embeddings and DistilBERT
Mehta, Shreyansh
Radke, Mansi A.
Sunkle, Sagar
2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 146 - 150
[22] Effects of Locality and Rule Language on Explanations for Knowledge Graph Embeddings
Galarraga, Luis
ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 143 - 155
[23] Bringing Back Semantics to Knowledge Graph Embeddings: An Interpretability Approach
Domingues, Antoine
Jain, Nitisha
Penuela, Albert Merono
Simperl, Elena
NEURAL-SYMBOLIC LEARNING AND REASONING, PT II, NESY 2024, 2024, 14980 : 192 - 203
[24] On Training Knowledge Graph Embedding Models
Mohamed, Sameh K.
Munoz, Emir
Novacek, Vit
INFORMATION, 2021, 12 (04)
[25] The KEEN Universe An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability
Ali, Mehdi
Jabeen, Hajira
Hoyt, Charles Tapley
Lehmann, Jens
SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 : 3 - 18
[26] Tab2Onto: Unsupervised Semantification with Knowledge Graph Embeddings
Zahera, Hamada M.
Heindorf, Stefan
Balke, Stefan
Haupt, Jonas
Voigt, Martin
Walter, Carolin
Witter, Fabian
Ngomo, Axel-Cyrille Ngonga
SEMANTIC WEB: ESWC 2022 SATELLITE EVENTS, 2022, 13384 : 47 - 51
[27] Duality-Induced Regularizer for Semantic Matching Knowledge Graph Embeddings
Wang, Jie
Zhang, Zhanqiu
Shi, Zhihao
Cai, Jianyu
Ji, Shuiwang
Wu, Feng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1652 - 1667
[28] MulDE: Multi-teacher Knowledge Distillation for Low-dimensional Knowledge Graph Embeddings
Wang, Kai
Liu, Yu
Ma, Qian
Sheng, Quan Z.
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1716 - 1726
[29] Hardware-agnostic computation for large-scale knowledge graph embeddings
Demir, Caglar
Ngomo, Axel-Cyrille Ngonga
SOFTWARE IMPACTS, 2022, 13
[30] A knowledge graph embeddings based approach for author name disambiguation using literals
Cristian Santini
Genet Asefa Gesese
Silvio Peroni
Aldo Gangemi
Harald Sack
Mehwish Alam
Scientometrics, 2022, 127 : 4887 - 4912

← 1 2 3 4 5 →