Knowledge graph construction from multiple online encyclopedias

被引:0
作者
Tianxing Wu
Haofen Wang
Cheng Li
Guilin Qi
Xing Niu
Meng Wang
Lin Li
Chaomin Shi
机构
[1] Southeast University,
[2] Nanyang Technological University,undefined
[3] Intelligent Big Data Visualization Lab,undefined
[4] Tongji University,undefined
[5] University of Maryland,undefined
来源
World Wide Web | 2020年 / 23卷
关键词
Knowledge graph; Knowledge extraction; Knowledge linking; Semantic Web;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, lots of knowledge graphs built from Wikipedia, the largest multilingual online encyclopedia, have been published on the Web to support various applications. However, since non-English data in Wikipedia are sparse, some projects work on knowledge graph construction from multiple non-English online encyclopedias, but many technical details are missing, so it is hard to reuse their frameworks or techniques. In this paper, we propose a new framework to solve knowledge graph construction from multiple online encyclopedias. The core modules are knowledge extraction and knowledge linking. Knowledge extraction consists of regular extraction, i.e., extracting targeted article contents in the whole online encyclopedias periodically, and live extraction, which only extracts the article contents of new and updated entities. Knowledge linking utilizes heuristic lightweight entity matching strategies and a semi-supervised learning method to find duplicated entities and properties from different online encyclopedias. Experimental results show that our approaches for knowledge extraction and linking outperform state-of-the-art baselines in different evaluation metrics, and our framework can generate a large-scale knowledge graph after inputting multiple online encyclopedias.
引用
收藏
页码:2671 / 2698
页数:27
相关论文
共 49 条
  • [1] Agrawal R(1994)Fast algorithms for mining association rules Proc. of VLDB 1215 487-499
  • [2] Srikant R(2009)Linked data-the story so far Int. J. Semantic Web Inf. Syst. 5 1-22
  • [3] Bizer C(2009)DBpedia-a crystallization point for the Web of data J. Web Semantics 7 154-165
  • [4] Heath T(1971)Measuring nominal scale agreement among many raters Psychol. Bull. 76 378-1780
  • [5] Berners-Lee T(1997)Long short-term memory Neural Comput. 9 1735-12
  • [6] Bizer C(2015)A bootstrapping approach to entity linkage on the semantic Web J. Web Semantics 34 1-98
  • [7] Lehmann J(2019)XLORE2: Large-scale cross-lingual knowledge graph construction and application Data Intell. 1 77-195
  • [8] Kobilarov G(2015)Dbpedia–A large-scale, multilingual knowledge base extracted from Wikipedia Semantic Web 6 167-250
  • [9] Auer S(2012)BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network Artif. Intell. 193 217-436
  • [10] Becker C(2017)A survey of current link discovery frameworks Semantic Web 8 419-1560