Consistent Process Mining Over Big Data Triple Stores

被引:12
作者
Azzini, Antonia [1 ]
Ceravolo, Paolo [1 ]
机构
[1] Univ Milan, SESAR Lab, Dipartimento Informat, I-20122 Milan, Italy
来源
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA | 2013年
关键词
Data Integration; Process Mining; Big Data;
D O I
10.1109/BigData.Congress.2013.17
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
'Big Data' techniques are often adopted in cross-organization scenarios for integrating multiple data sources to extract statistics or other latent information. Even if these techniques do not require the support of a schema for processing data, a common conceptual model is typically defined to address name resolution. This implies that each local source is tasked of applying a semantic lifting procedure for expressing the local data in term of the common model. Semantic heterogeneity is then potentially introduced in data. In this paper we illustrate a methodology designed to the implementation of consistent process mining algorithms in a 'Big Data' context. In particular, we exploit two different procedures. The first one is aimed at computing the mismatch among the data sources to be integrated. The second uses mismatch values to extend data to be processed with a traditional map reduce algorithm.
引用
收藏
页码:54 / 61
页数:8
相关论文
共 30 条
[1]  
[Anonymous], EFFICIENT SPARQL QUE
[2]  
[Anonymous], 2004, RESOURCE DESCRIPTION
[3]  
[Anonymous], 2008, W3C RECOMMENDATION
[4]  
[Anonymous], J WEB SEMANTICS
[5]  
[Anonymous], B SECTION LOGIC
[6]  
[Anonymous], THESIS EINDHOVEN
[7]  
[Anonymous], 2010, P 2010 ACM S APPL CO
[8]  
[Anonymous], 2012 IEEE 16 INT ENT
[9]  
[Anonymous], KNOWLEDGE DISCOVERY
[10]  
[Anonymous], LECT NOTES BUSINESS