Statistical Knowledge Patterns: Identifying Synonymous Relations in Large Linked Datasets

被引:0
作者
Zhang, Ziqi [1 ]
Gentile, Anna Lisa [1 ]
Blomqvist, Eva [2 ]
Augenstein, Isabelle [1 ]
Ciravegna, Fabio [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, S Yorkshire, England
[2] Linkoping Univ, Dept Comp & Informat Sci, S-58183 Linkoping, Sweden
来源
SEMANTIC WEB - ISWC 2013, PART I | 2013年 / 8218卷
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Web of Data is a rich common resource with billions of triples available in thousands of datasets and individual Web documents created by both expert and non-expert ontologists. A common problem is the imprecision in the use of vocabularies: annotators can misunderstand the semantics of a class or property or may not be able to find the right objects to annotate with. This decreases the quality of data and may eventually hamper its usability over large scale. This paper describes Statistical Knowledge Patterns (SKP) as a means to address this issue. SKPs encapsulate key information about ontology classes, including synonymous properties in (and across) datasets, and are automatically generated based on statistical data analysis. SKPs can be effectively used to automatically normalise data, and hence increase recall in querying. Both pattern extraction and pattern usage are completely automated. The main benefits of SKPs are that: (1) their structure allows for both accurate query expansion and restriction; (2) they are context dependent, hence they describe the usage and meaning of properties in the context of a particular class; and (3) they can be generated offline, hence the equivalence among relations can be used efficiently at run time.
引用
收藏
页码:703 / 719
页数:17
相关论文
共 15 条
[1]  
Augenstein I., 2013, P 2 INT WORKSH KNOWL
[2]  
Basse A., 2010, P WEBSCI 2010 EXT FR
[3]  
Blomqvist E, 2009, LECT NOTES COMPUT SC, V5823, P65, DOI 10.1007/978-3-642-04930-9_5
[4]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[5]  
Cabrio E., 2012, P ESWC 2012 WORKSH I
[6]  
Gangemi A., 2009, HDB ONTOLOGIES INT H, V2
[7]   Towards a pattern science for the Semantic Web [J].
Gangemi, Aldo ;
Presutti, Valentina .
SEMANTIC WEB, 2010, 1 (1-2) :61-68
[8]  
Musetti A., 2011, FINALIST SEMANTIC WE
[9]  
Le NT, 2010, SEMAPRO 2010: THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, P61
[10]  
Nuzzolese AG, 2011, LECT NOTES COMPUT SC, V7031, P520, DOI 10.1007/978-3-642-25073-6_33