Data Integration using Machine Learning

被引:0
作者
Birgersson, Marcus [1 ,2 ]
Hansson, Gustav [1 ,2 ]
Franke, Ulrik [3 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, SE-41296 Gothenburg, Sweden
[2] iCore Solut, SE-41250 Gothenburg, Sweden
[3] SICS Swedish Inst Comp Sci, SE-16429 Kista, Sweden
来源
2016 IEEE 20TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING WORKSHOP (EDOCW) | 2016年
关键词
Enterprise interoperability; Data integration; Machine Learning; ENTERPRISE INTEGRATION; SYSTEMS; FUTURE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Today, enterprise integration and cross-enterprise collaboration is becoming evermore important. The Internet of things, digitization and globalization are pushing continuous growth in the integration market. However, setting up integration systems today is still largely a manual endeavor. Most probably, future integration will need to leverage more automation in order to keep up with demand. This paper presents a first version of a system that uses tools from artificial intelligence and machine learning to ease the integration of information systems, aiming to automate parts of it. Three models are presented and evaluated for precision and recall using data from real, past, integration projects. The results show that it is possible to obtain F-0.5 scores in the order of 80% for models trained on a particular kind of data, and in the order of 60% - 70% for less specific models trained on a several kinds of data. Such models would be valuable enablers for integration brokers to keep up with demand, and obtain a competitive advantage. Future work includes fusing the results from the different models, and enabling continuous learning from an operational production system.
引用
收藏
页码:313 / 322
页数:10
相关论文
共 25 条
[1]  
[Anonymous], 2005, P 2005 ACM SIGMOD IN
[2]  
[Anonymous], 2004, Service-oriented architecture
[3]  
Bernstein PA, 2011, PROC VLDB ENDOW, V4, P695
[4]  
Birgersson M., 2016, THESIS
[5]   Architectures for enterprise integration and interoperability: Past, present and future [J].
Chen, David ;
Doumeingts, Guy ;
Vernadat, Francois .
COMPUTERS IN INDUSTRY, 2008, 59 (07) :647-659
[6]  
Cummins F.A., 2002, ENTERPRISE INTEGRATI
[7]  
Doan AH, 2001, SIGMOD REC, V30, P509
[8]  
Fazlollahi A, 2012, LECT NOTES BUS INF P, V122, P34
[9]  
Gal Avigdor, 2011, SYNTHESIS LECT DATA, V3, P1, DOI [10.2200/S00337ED1V01Y201102DTM013, DOI 10.2200/S00337ED1V01Y201102DTM013]
[10]   CUSTOMS CLEARANCE AND ELECTRONIC DATA INTERCHANGE - A STUDY OF NORWEGIAN FREIGHT FORWARDERS USING EDI [J].
HELLBERG, R ;
SANNES, R .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 1991, 24 (1-2) :91-101