SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples

被引:7
作者
Faerber, Michael [1 ]
Lamprecht, David [1 ]
Krause, Johan [1 ]
Aung, Linn [2 ]
Haase, Peter [2 ]
机构
[1] Karlsruhe Inst Technol KIT, Inst AIFB, Karlsruhe, Germany
[2] Metaphacts GmbH, Walldorf, Germany
来源
SEMANTIC WEB, ISWC 2023, PT II | 2023年 / 14266卷
关键词
Scholarly Data; Open Science; Digital Libraries; LINKED DATA; KNOWLEDGE; GRAPH;
D O I
10.1007/978-3-031-47243-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present SemOpenAlex, an extensive RDF knowledge graph that contains over 26 billion triples about scientific publications and their associated entities, such as authors, institutions, journals, and concepts. SemOpenAlex is licensed under CC0, providing free and open access to the data. We offer the data through multiple channels, including RDF dump files, a SPARQL endpoint, and as a data source in the Linked Open Data cloud, complete with resolvable URIs and links to other data sources. Moreover, we provide embeddings for knowledge graph entities using high-performance computing. SemOpenAlex enables a broad range of use-case scenarios, such as exploratory semantic search via our website, large-scale scientific impact quantification, and other forms of scholarly big data analytics within and across scientific disciplines. Additionally, it enables academic recommender systems, such as recommending collaborators, publications, and venues, including explainability capabilities. Finally, SemOpenAlex can serve for RDF query optimization benchmarks, creating scholarly knowledge-guided language models, and as a hub for semantic scientific publishing. Data and Services: https://semopenalex.org https://w3id.org/SemOpenAlex Code: https://github.com/metaphacts/semopenalex/ Data License: Creative Commons Zero (CC0) Code License: MIT License
引用
收藏
页码:94 / 112
页数:19
相关论文
共 57 条
[1]   SwetoDblp ontology of computer science publications [J].
Aleman-Meza, Boanerges ;
Hakimpour, Farshad ;
Arpinar, I. Budak ;
Sheth, Amit P. .
JOURNAL OF WEB SEMANTICS, 2007, 5 (03) :151-155
[2]   AIDA: A knowledge graph about research dynamics in academia and industry [J].
Angioni, Simone ;
Salatino, Angelo ;
Osborne, Francesco ;
Recupero, Diego Reforgiato ;
Motta, Enrico .
QUANTITATIVE SCIENCE STUDIES, 2022, 2 (04) :1356-1398
[3]   Improving Access to Scientific Literature with Knowledge Graphs [J].
Auer, Soren ;
Oelen, Allard ;
Haris, Muhammad ;
Stocker, Markus ;
D'Souza, Jennifer ;
Farfar, Kheir Eddine ;
Vogt, Lars ;
Prinz, Manuel ;
Wiens, Vitalis ;
Jaradeh, Mohamad Yaser .
BIBLIOTHEK FORSCHUNG UND PRAXIS, 2020, 44 (03) :516-529
[4]   A Multi-Domain Benchmark for Personalized Search Evaluation [J].
Bassani, Elias ;
Kasela, Pranav ;
Raganato, Alessandro ;
Pasi, Gabriella .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, :3822-3827
[5]  
Berners-Lee Tim, 2006, Linked data
[6]  
Bordes A., 2013, P ADV NEUR INF PROC, P2787, DOI DOI 10.5555/2999792.2999923
[7]  
Chen C, 2020, FRONT RES METRICS AN, V5, DOI 10.3389/frma.2020.607286
[8]  
Christensen A., 2022, Wissenschaftliche Literatur entdecken: Was bibliothekarische Discovery-Systeme von der Konkurrenz lernen und was sie ihr zeigen konnen
[9]  
Cossu M., 2018, P 21 INT C EXT DAT T, P469, DOI 10.5441
[10]   Wikibase as an Infrastructure for Knowledge Graphs: The EU Knowledge Graph [J].
Diefenbach, Dennis ;
De Wilde, Max ;
Alipio, Samantha .
SEMANTIC WEB - ISWC 2021, 2021, 12922 :631-647