Increasing Quality of Austrian Open Data by Linking Them to Linked Data Sources: Lessons Learned

被引:2
作者
Knap, Tomas [1 ,2 ]
机构
[1] Charles Univ Prague, Fac Math & Phys, Malostranske Nam 25, CR-11800 Prague 1, Czech Republic
[2] Semant Web Co, Mariahilfer Str 70-8, A-1070 Vienna, Austria
来源
SEMANTIC WEB, ESWC 2016 | 2016年 / 9989卷
关键词
Open Data; Linked Data; Data quality; Data linking; Data integration; Entity disambiguation;
D O I
10.1007/978-3-319-47602-5_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the goals of the ADEQUATe project is to improve the quality of the (tabular) open data being published at two Austrian open data portals by leveraging these tabular data to Linked Data, i.e., (1) classifying columns using Linked Data vocabularies, (2) linking cell values against Linked Data entities, and (3) discovering relations in the data by searching for evidences of such relations among Linked Data sources. Integrating data at Austrian data portals with existing Linked (Open) Data sources allows to, e.g., increase data completeness and reveal discrepancies in the data. In this paper, we describe lessons learned from using TableMiner+, an algorithm for (semi) automatic leveraging of tabular data to Linked Data. In particular, we evaluate TableMiner+'s ability to (1) classify columns of the tabular data and (2) link (disambiguate) cell values against Linked Data entities in Freebase. The lessons learned described in this paper are relevant not only for the goals of the ADEQUATe project, but also for other data publishers and wranglers who need to increase quality of open data by (semi) automatically interl-inking them to Linked (Open) Data entities.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 10 条
[1]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[2]  
Ermilov I., 2013, P ISEM 2013 04 06 SE
[3]   Annotating and Searching Web Tables Using Entities, Types and Relationships [J].
Limaye, Girija ;
Sarawagi, Sunita ;
Chakrabarti, Soumen .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :1338-1347
[4]  
Mulwad V., 2011, P 1 INT WORKSH SEARC, P17
[5]  
Mulwad V, 2013, LECT NOTES COMPUT SC, V8218, P363, DOI 10.1007/978-3-642-41335-3_23
[6]  
Mulwad Varish, 2010, P 1 INT WORKSH CONS
[7]  
Suchanek F. M., 2007, P 16 INT C WORLD WID, P697
[8]  
Syed Zareen, 2008, P 2 INT C WEBL SOC M
[9]   Quality assessment for Linked Data: A Survey [J].
Zaveri, Amrapali ;
Rula, Anisa ;
Maurino, Andrea ;
Pietrobon, Ricardo ;
Lehmann, Jens ;
Auer, Soeren .
SEMANTIC WEB, 2016, 7 (01) :63-93
[10]  
Zhang Z., 2016, SEMANT WEB J