Data curation in the Internet of Things: A decision model approach

被引:4
作者
Jose de Haro-Olmo, Francisco [1 ]
Valencia-Parra, Alvaro [2 ]
Jesus Varela-Vaca, Angel [2 ]
Antonio Alvarez-Bermejo, Jose [1 ]
机构
[1] Univ Almeria, Dept Comp Sci, Almeria 04120, Spain
[2] Univ Seville, Dept Comp Languages & Syst, Seville, Spain
关键词
big data pipeline; data curation; data quality; Internet of Things; sensors; IOT; ARCHITECTURE;
D O I
10.1002/cmm4.1191
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Current Internet of Things (IoT) scenarios have to deal with many challenges especially when a large amount of heterogeneous data sources are integrated, that is, data curation. In this respect, the use of poor-quality data (i.e., data with problems) can produce terrible consequence from incorrect decision-making to damaging the performance in the operations. Therefore, using data with an acceptable level of usability has become essential to achieve success. In this article, we propose an IoT-big data pipeline architecture that enables data acquisition and data curation in any IoT context. We have customized the pipeline by including the DMN4DQ approach to enable us the measuring and evaluating data quality in the data produced by IoT sensors. Further, we have chosen a real dataset from sensors in an agricultural IoT context and we have defined a decision model to enable us the automatic measuring and assessing of the data quality with regard to the usability of the data in the context.
引用
收藏
页数:11
相关论文
共 33 条
[1]   A Model-Driven Methodology for Big Data Analytics-as-a-Service [J].
Ardagna, Claudio A. ;
Bellandi, Valerio ;
Ceravolo, Paolo ;
Damiani, Ernesto ;
Bezzi, Michele ;
Hebert, Cedric .
2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, :105-112
[2]  
Batini C, 2016, DATA CENTRIC SYST AP, P1, DOI 10.1007/978-3-319-24106-7
[3]   CrowdCorrect: A Curation Pipeline for Social Data Cleansing and Curation [J].
Beheshti, Amin ;
Vaghani, Kushal ;
Benatallah, Boualem ;
Tabebordbar, Alireza .
INFORMATION SYSTEMS IN THE BIG DATA ERA, 2018, 317 :24-38
[4]   DataSynapse: A Social Data Curation Foundry [J].
Beheshti, Amin ;
Benatallah, Boualem ;
Tabebordbar, Alireza ;
Motahari-Nezhad, Hamid Reza ;
Barukh, Moshe Chai ;
Nouri, Reza .
DISTRIBUTED AND PARALLEL DATABASES, 2019, 37 (03) :351-384
[5]   On Automating Basic Data Curation Tasks [J].
Beheshti, Seyed-Mehdi-Reza ;
Tabebordbar, Alireza ;
Benatallah, Boualem ;
Nouri, Reza .
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :165-169
[6]   An Architecture to Support the Collection of Big Data in the Internet of Things [J].
Cecchinel, Cyril ;
Jimenez, Matthieu ;
Mosser, Sebastien ;
Riveill, Michel .
2014 IEEE WORLD CONGRESS ON SERVICES (SERVICES), 2014, :442-449
[7]   Big Data Semantics [J].
Ceravolo, Paolo ;
Azzini, Antonia ;
Angelini, Marco ;
Catarci, Tiziana ;
Cudre-Mauroux, Philippe ;
Damiani, Ernesto ;
Mazak, Alexandra ;
Van Keulen, Maurice ;
Jarrar, Mustafa ;
Santucci, Giuseppe ;
Sattler, Kai-Uwe ;
Scannapieco, Monica ;
Wimmer, Manuel ;
Wrembel, Robert ;
Zaraket, Fadi .
JOURNAL ON DATA SEMANTICS, 2018, 7 (02) :65-85
[8]   Chrological Big data Curation: A Study on the Enhanced Information Retrieval System [J].
Choi, Seungjin ;
Seo, Jiwan ;
Kim, Mucheol ;
Kang, Sunghyun ;
Han, Sangyong .
IEEE ACCESS, 2017, 5 :11269-11277
[9]  
Chu X., 2019, DATA CLEANING
[10]  
Curry Edward., 2016, New horizons for a data-driven economy, P29, DOI DOI 10.1007/978-3-319-21569-3