Towards a Semantic Extract-Transform-Load (ETL) framework for Big Data Integration

被引:56
|
作者
Bansal, Srividya K. [1 ]
机构
[1] Arizona State Univ, Dept Engn & Comp Syst, Mesa, AZ 85212 USA
来源
2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS) | 2014年
关键词
Big data; Data integration; Ontology; Semantic technolgies; DESIGN;
D O I
10.1109/BigData.Congress.2014.82
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data has become the new ubiquitous term used to describe massive collection of datasets that are difficult to process using traditional database and software techniques. Most of this data is inaccessible to users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. One aspect of Big Data research is dealing with the Variety of data that includes various formats such as structured, numeric, unstructured text data, email, video, audio, stock ticker, etc. Managing, merging, and governing a variety of data is the focus of this paper. This paper proposes a semantic Extract-Transform-Load (ETL) framework that uses semantic technologies to integrate and publish data from multiple sources as open linked data. This includes - creation of a semantic data model to provide a basis for integration and understanding of knowledge from multiple sources; creation of a distributed Web of data using Resource Description Framework (RDF) as the graph data model; extraction of useful knowledge and information from the combined data using SPARQL as the semantic query language.
引用
收藏
页码:521 / 528
页数:8
相关论文
共 50 条
  • [31] TMR: Towards an efficient semantic-based heterogeneous transportation media big data retrieval
    Guo, Kehua
    Zhang, Ruifang
    Kuang, Li
    NEUROCOMPUTING, 2016, 181 : 122 - 131
  • [32] Towards an Integrated Semantic Framework for Neurological Multidimensional Data Analysis
    Timon Reina, Santiago
    Rincon Zamorano, M.
    Bjornerud, Atle
    ARTIFICIAL COMPUTATION IN BIOLOGY AND MEDICINE, PT I (IWINAC 2015), 2015, 9107 : 175 - 184
  • [33] EXTRACTION TRANSFORMATION LOAD (ETL) SOLUTION FOR DATA INTEGRATION: A CASE STUDY OF RUBBER IMPORT AND EXPORT INFORMATION
    Jamaluddin, Mimi Safinaz
    Azmi, Nurulhuda Firdaus Mohd
    JURNAL TEKNOLOGI, 2016, 78 (01): : 79 - 84
  • [34] A survey towards an integration of big data analytics to big insights for value-creation
    Saggi, Mandeep Kaur
    Jain, Sushma
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (05) : 758 - 790
  • [35] Multi-level semantic annotation and unified data integration using semantic web ontology in big data processing
    Rani, P. Shobha
    Suresh, R. M.
    Sethukarasi, R.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 10401 - 10413
  • [36] Strategic Positioning in Big Data Utilization: Towards a Conceptual Framework
    Wiren, Milla
    Mantymaki, Matti
    CHALLENGES AND OPPORTUNITIES IN THE DIGITAL ERA, 2018, 11195 : 117 - 128
  • [37] Towards a Conceptual Framework for Customer Intelligence in the Era of Big Data
    Nguyen Anh Khoa Dam
    Thang Le Dinh
    Menvielle, William
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2021, 17 (04)
  • [38] Multi-level semantic annotation and unified data integration using semantic web ontology in big data processing
    P. Shobha Rani
    R. M. Suresh
    R. Sethukarasi
    Cluster Computing, 2019, 22 : 10401 - 10413
  • [39] Framework to Extract Context Vectors from Unstructured Data using Big Data Analytics
    Ahmad, Tanvir
    Ahmad, Rafeeq
    Masud, Sarah
    Nilofer, Farheen
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 221 - 226
  • [40] Interpretation and automatic integration of geospatial data into the Semantic Web Towards a process of automatic geospatial data interpretation, classification and integration using semantic technologies
    Prudhomme, Claire
    Homburg, Timo
    Ponciano, Jean-Jacques
    Boochs, Frank
    Cruz, Christophe
    Roxin, Ana-Maria
    COMPUTING, 2020, 102 (02) : 365 - 391