Semantic Quran A multilingual resource for natural-language processing

被引:23
作者
Sherif, Mohamed Ahmed [1 ]
Ngomo, Axel-Cyrille Ngonga [1 ]
机构
[1] Univ Leipzig, Inst Informat, AKSW, D-04009 Leipzig, Germany
关键词
Multilingual dataset; natural language processing; morpho-syntatic data; Linked Data; ontology;
D O I
10.3233/SW-140137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe the Semantic Quran dataset, a multilingual RDF representation of translations of the Quran. The dataset was created by integrating data from two different semi-structured sources and aligned to an ontology designed to represent multilingual data from sources with a hierarchical structure. The resulting RDF data encompasses 43 different languages which belong to the most under-represented languages in the Linked Data Cloud, including Arabic, Amharic and Amazigh. We designed the dataset to be easily usable in natural-language processing applications with the goal of facilitating the development of knowledge extraction tools for these languages. In particular, the Semantic Quran is compatible with the Natural-Language Interchange Format and contains explicit morpho-syntactic information on the utilized terms. We present the ontology devised for structuring the data. We also provide the transformation rules implemented in our extraction framework. Finally, we detail the link creation process as well as possible usage scenarios for the Semantic Quran dataset.
引用
收藏
页码:339 / 345
页数:7
相关论文
共 7 条
  • [1] Auer S, 2011, LECT NOTES COMPUT SC, V6848, P1, DOI 10.1007/978-3-642-23032-5_1
  • [2] Farrar S., 2003, GLOT Int, V7, P97
  • [3] Hellmann Sebastian, 2012, Knowledge Engineering and Knowledge Management. 18th International Conference, EKAW 2012. Proceedings, P175, DOI 10.1007/978-3-642-33876-2_17
  • [4] Hellmann S, 2010, LECT NOTES COMPUT SC, V6089, P462, DOI 10.1007/978-3-642-13489-0_44
  • [5] On Link Discovery using a Hybrid Approach
    Ngomo, Axel-Cyrille Ngonga
    [J]. JOURNAL ON DATA SEMANTICS, 2012, 1 (04) : 203 - 217
  • [6] Somers H., 1997, TRANSLATING COMPUTER, P13
  • [7] Unger C., 2012, P 21STWORLDWIDE WEB, P639, DOI DOI 10.1145/2187836.2187923