Hardware-agnostic computation for large-scale knowledge graph embeddings

被引：2

作者：

Demir, Caglar ^{[1
]}

Ngomo, Axel-Cyrille Ngonga ^{[1
]}

机构：

[1] Paderborn Univ, Data Sci Grp, Paderborn, Germany

来源：

SOFTWARE IMPACTS | 2022年 / 13卷

关键词：

Knowledge graph embeddings; Hardware-agnostic computation; Continual training;

D O I：

10.1016/j.simpa.2022.100377

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Knowledge graph embedding research has mainly focused on learning continuous representations of knowledge graphs towards the link prediction problem. Recently developed frameworks can be effectively applied in research related applications. Yet, these frameworks do not fulfill many requirements of real-world applications. As the size of the knowledge graph grows, moving computation from a commodity computer to a cluster of computers in these frameworks becomes more challenging. Finding suitable hyperparameter settings w.r.t. time and computational budgets are left to practitioners. In addition, the continual learning aspect in knowledge graph embedding frameworks is often ignored, although continual learning plays an important role in many real-world (deep) learning-driven applications. Arguably, these limitations explain the lack of publicly available knowledge graph embedding models for large knowledge graphs. We developed a framework based on the frameworks DASK, Pytorch Lightning and Hugging Face to compute embeddings for large-scale knowledge graphs in a hardware-agnostic manner, which is able to address real-world challenges pertaining to the scale of real application. We provide an open-source version of our framework along with a hub of pre-trained models having more than 11.4 B parameters.(1)

引用

页数：3

共 17 条

[1]

Ali M, 2021, J MACH LEARN RES, V22

[2]

Bottou L., 2007, Adv. Neural Inf. Process. Syst., V20

[3]

Broscheit S, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, P165

[4]

Demir Caglar, 2022, HT '22: Proceedings of the 33rd ACM Conference on Hypertext and Social Media, P1, DOI 10.1145/3511095.3531276

[5]

Demir C., 2021, 18 EXTENDED SEMANTIC

[6]

Demir C, 2021, PR MACH LEARN RES, V157, P656

[7] A shallow neural model for relation prediction [J].

Demir, Caglar ;

Moussallem, Diego ;

Ngomo, Axel-Cyrille Ngonga .

2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, :179-182

[8]

Diethe T, 2019, Arxiv, DOI [arXiv:1903.05202, DOI 10.48550/ARXIV.1903.05202]

[9]

Falcon W., 2019, GitHub

[10]

Goyal Siddharth, Fairscale: A general purpose modular pytorch library for high performance and large scale training

← 1 2 →