Fast and Accurate Learning of Knowledge Graph Embeddings at Scale

被引:1
作者
Gupta, Udit [1 ]
Vadhiyar, Sathish [2 ]
机构
[1] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
[2] Indian Inst Sci, Dept Computat & Data Sci, Supercomp Educ & Res Ctr, Bangalore, Karnataka, India
来源
2019 IEEE 26TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC) | 2019年
关键词
Knowledge graph embeddings; distributed learning; Horovod;
D O I
10.1109/HiPC.2019.00030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge Graph Embedding (KGE) is used to represent the entities and relations of a KG in a low dimensional vector space. KGE can then be used in a downstream task such as entity classification, link prediction and knowledge base completion. Training on large KG datasets takes a considerable amount of time. This work proposes three strategies which lead to faster training in distributed setting. The first strategy is a reduced communication approach which decreases the All-Gather size by sparsifying the Sparse Gradient Matrix (SGM). The second strategy is a variable margin approach that takes advantage of reduced communication for lower margins but retains the accuracy as obtained by the best fixed margin. The third strategy is called DistAdam which is a distributed version of the popular Adam optimization algorithm. Combining the three strategies results in reduction of training time for the FB250K dataset from twenty-seven hours on one processing node to under one hour on thirty-two nodes with each node consisting of twenty-four cores.
引用
收藏
页码:173 / 182
页数:10
相关论文
共 15 条
[1]  
[Anonymous], 1999, USING MPI PORTABLE P
[2]   Demystifying Parallel and Distributed Deep Learning: An In-depth Concurrency Analysis [J].
Ben-Nun, Tal ;
Hoefler, Torsten .
ACM COMPUTING SURVEYS, 2019, 52 (04)
[3]  
Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247
[4]  
Bordes A, 2013, ADV NEURAL INFORM PR, V26
[5]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[6]  
Goyal P., 2017, Technical Report
[7]  
Kingma DP, 2014, ADV NEUR IN, V27
[8]  
Li Mu, 2014, OPERATING SYSTEMS DE, P583
[9]  
Mohr B., 1995, TAUTUNING ANAL UTILI
[10]  
Niu X.-F., 2017, ARXIV170305614