A Utility to Transform CSV Data into EMF

被引:7
作者
Al-Azzoni, Issam [1 ]
Petrovic, Nenad [2 ]
Alqahtani, Abdullah [3 ]
机构
[1] Al Ain Univ, Coll Engn, Al Ain, U Arab Emirates
[2] Univ Nis, Fac Elect Engn, Nish, Serbia
[3] Prince Sattam Bin Abduulaziz Univ, Coll Comp Engn & Sci, Alkharj, Saudi Arabia
来源
2021 EIGHTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS) | 2021年
关键词
Model-Driven Development; Meta-modeling; Eclipse Modeling Framework; Data Management; Data Adapters;
D O I
10.1109/SDS54264.2021.9732143
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the era of data evolution, enterprises increasingly depend on data utilization tools to import or export data from various data sources. Traditionally, enterprises archive such data into row formats, commonly in CSV files. The flat representation of these files has become an excessive burden to opt the right approach for developing and designing applications that structurally meet business needs. CASE (Computer-Aided Software Environment) tools have been praised by domain experts to build applications by describing their domains in a high abstracted level and automatically generating the appropriate implementations. However, these tools lack the appropriate facilities to support efficient and generic bulk data import. In this paper, we present a generic CSV data parser based on EMF (Eclipse Modeling Framework) to automatically map row data into platform-specific models. We define a mapping model which defines the mapping between the CSV files and the target metamodels, and an auxiliary Python script to retrieve the corresponding elements. The experimental evaluation of our parser demonstrates its efficiency to import large CSV files into EMF. In this sense, we aim to increase the adoption of model-based approaches for data-driven use cases by executing bulk and row data import into EMF in an agnostic manner.
引用
收藏
页码:37 / 42
页数:6
相关论文
共 22 条
[1]  
Al-Ashwal D., 2018, 2018 INT ARAB C INFO, P1
[2]   Model Based Development of Data Integration in Graph Databases Using Triple Graph Grammars [J].
Alqahtani, Abdullah ;
Heckel, Reiko .
SOFTWARE TECHNOLOGIES: APPLICATIONS AND FOUNDATIONS, 2018, 11176 :399-414
[3]  
Anjorin Anthony, 2012, P INTERNA TIONAL C S, P124
[4]  
[Anonymous], 2014 KDD CUP DATASET
[5]  
[Anonymous], ECLIPSE MODELING FRA
[6]  
[Anonymous], EMFXCEL USING ECLIPS
[7]  
[Anonymous], PYECORE
[8]  
[Anonymous], OMG's MetaObject Facility (MOF) homepage
[9]  
[Anonymous], NEO4J GRAPH DATA PLA
[10]  
[Anonymous], MDA