DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia

被引:1702
|
作者
Lehmann, Jens [1 ]
Isele, Robert [7 ]
Jakob, Max [5 ]
Jentzsch, Anja [4 ]
Kontokostas, Dimitris [1 ]
Mendes, Pablo N. [6 ]
Hellmann, Sebastian [1 ]
Morsey, Mohamed [1 ]
van Kleef, Patrick [3 ]
Auer, Soeren [1 ,8 ,9 ]
Bizer, Christian [2 ]
机构
[1] Univ Leipzig, Inst Comp Sci, AKSW Grp, D-04009 Leipzig, Germany
[2] Univ Mannheim, Res Grp Data & Web Sci, D-68159 Mannheim, Germany
[3] OpenLink Software, Burlington, MA 01803 USA
[4] Hasso Plattner Inst IT Syst Engn, D-14482 Potsdam, Germany
[5] Neofonie GmbH, D-10115 Berlin, Germany
[6] Wright State Univ, Kno E Sis Ohio Ctr Excellence Knowledge Enabled C, Dayton, OH 45435 USA
[7] Brox IT Solut GmbH, D-30625 Hannover, Germany
[8] Univ Bonn, Enterprise Informat Syst, D-53117 Bonn, Germany
[9] Fraunhofer IAIS, D-53117 Bonn, Germany
关键词
Knowledge extraction; Wikipedia; multilingual knowledge bases; Linked Data; RDF; LINKED DATA; SEMANTIC WEB;
D O I
10.3233/SW-140134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The DBpedia community project extracts structured, multilingual knowledge from Wikipedia and makes it freely available on the Web using Semantic Web and Linked Data technologies. The project extracts knowledge from 111 different language editions of Wikipedia. The largest DBpedia knowledge base which is extracted from the English edition of Wikipedia consists of over 400 million facts that describe 3.7 million things. The DBpedia knowledge bases that are extracted from the other 110Wikipedia editions together consist of 1.46 billion facts and describe 10 million additional things. The DBpedia project maps Wikipedia infoboxes from 27 different language editions to a single shared ontology consisting of 320 classes and 1,650 properties. The mappings are created via a world-wide crowd-sourcing effort and enable knowledge from the different Wikipedia editions to be combined. The project publishes releases of all DBpedia knowledge bases for download and provides SPARQL query access to 14 out of the 111 language editions via a global network of local DBpedia chapters. In addition to the regular releases, the project maintains a live knowledge base which is updated whenever a page in Wikipedia changes. DBpedia sets 27 million RDF links pointing into over 30 external data sources and thus enables data from these sources to be used together with DBpedia data. Several hundred data sets on the Web publish RDF links pointing to DBpedia themselves and make DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud. In this system report, we give an overview of the DBpedia community project, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications.
引用
收藏
页码:167 / 195
页数:29
相关论文
共 27 条
  • [21] Optimization of Large-Scale Knowledge Forward Reasoning Based on OWL 2 DL Ontology
    Cui, Lingyun
    Ren, Tenglong
    Zhang, Xiaowang
    Feng, Zhiyong
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 380 - 399
  • [22] NORIA UI: Efficient Incident Management on Large-Scale ICT Systems Represented as Knowledge Graphs
    Tailhardat, Lionel
    Chabot, Yoan
    Py, Antoine
    Guillemette, Perrine
    19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,
  • [23] CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science
    Sattler, Ulrike
    Hogan, Aidan
    Keet, Maria
    Presutti, Valentina
    Almeida, Joao Paulo A.
    Takeda, Hideaki
    Monnin, Pierre
    Pirro, Giuseppe
    Amato, Claudia d
    SEMANTIC WEB - ISWC 2022, 2022, 13489 : 678 - 696
  • [24] Hike: A Hybrid Human-Machine Method for Entity Alignment in Large-Scale Knowledge Bases
    Zhuang, Yan
    Li, Guoliang
    Zhong, Zhuojian
    Feng, Jianhua
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1917 - 1926
  • [25] LOD Conversion System for generating Large Knowledge Base from Web Contents
    Takahashi, Kazuki
    Maki, Toshitaka
    Wakahara, Toshihiko
    Kobayashi, Toru
    Kodate, Akihisa
    Sonehara, Noboru
    2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [26] Valuable Knowledge Mining: Deep Analysis of Heart Disease and Psychological Causes Based on Large-Scale Medical Data
    Wang, Ling
    Shan, Minglei
    Zhou, Tie Hua
    Ryu, Keun Ho
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [27] FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores
    Nederstigt, Lennart J.
    Aanen, Steven S.
    Vandic, Damir
    Frasincar, Flavius
    DECISION SUPPORT SYSTEMS, 2014, 59 : 296 - 311