Fuzzy Join for Flexible Combining Big Data Lakes in Cyber-Physical Systems

被引:14
作者
Malysiak-Mrozek, Bozena [1 ]
Lipinska, Anna [1 ]
Mrozek, Dariusz [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, PL-44100 Gliwice, Poland
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Cyber-physical systems; big data; fuzzy logic; querying; cloud computing; biomedical data analysis; declarative languages; DATA ANALYTICS; MAPREDUCE; ARCHITECTURE; IMPLEMENTATION; FRAMEWORK;
D O I
10.1109/ACCESS.2018.2879829
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-physical. systems produce large amounts of data that are stored in domain-related data lakes in a variety of formats. By using the big data technologies that enable efficient data processing, the value of the data increases, as these technologies can turn the data into actionable information that influences important decision-making processes. However, a broader view of the operational environment, an investigated phenomena, and challenges related to them can frequently be obtained after combining data from many data sets located in various big data lakes. This requires contact points in both data lakes that must be flexibly joined because in many cases, data sets do not correspond to one another directly. In this paper, we show fuzzy join operation for flexible combining big data lakes. The fuzzy join transforms numerical values of common attributes of joined data sets into fuzzy sets and uses such a representation in the join operation. We propose two variants of the join operation that transforms crisp numerical values of joining attributes into: 1) fuzzy numbers and 2) linguistic terms. The fuzzy join operation is implemented and tested in the declarative U-SQL language that is used for scalable and parallel querying in big data lakes. The ideas presented here are exemplified by a distributed analysis of cardiac disease data on Microsoft Azure cloud. The results of the conducted experiments confirm that the fuzzy join can enrich data sets that are used in making critical decisions and, as a highly scalable cloud-based solution, can be successfully used in processing large volumes of data delivered by cyber-physical systems.
引用
收藏
页码:69545 / 69558
页数:14
相关论文
共 50 条
  • [1] Big Data Meet Cyber-Physical Systems: A Panoramic Survey
    Atat, Rachad
    Liu, Lingjia
    Wu, Jinsong
    Li, Guangyu
    Ye, Chunxuan
    Yi, Yang
    IEEE ACCESS, 2018, 6 : 73603 - 73636
  • [2] Big Data Conceptual Modelling in Cyber-Physical Systems
    Bagozi, Ada
    Bianchini, Devis
    De Antonellis, Valeria
    Marini, Alessandro
    Ragazzi, Davide
    ENTERPRISE MODELLING AND INFORMATION SYSTEMS ARCHITECTURES-AN INTERNATIONAL JOURNAL, 2018, 13 : 316 - 332
  • [3] Big Data and virtualization for manufacturing cyber-physical systems: A survey of the current status and future outlook
    Babiceanu, Radu F.
    Seker, Remzi
    COMPUTERS IN INDUSTRY, 2016, 81 : 128 - 137
  • [4] A Framework to handle Big Data for Cyber-Physical Systems
    Rehman, Shafiq Ur
    Hark, Andre
    Gruhn, Volker
    2017 8TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2017, : 72 - 78
  • [5] Big Data as a Service for Monitoring Cyber-Physical Production Systems
    Marini, Alessandro
    Bianchini, Devis
    PROCEEDINGS - 30TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2016, 2016, : 579 - 586
  • [6] BIG DATA APPLIED TO CYBER-PHYSICAL LOGISTIC SYSTEMS: CONCEPTUAL MODEL AND PERSPECTIVES
    Frazzon, Enzo Morosini
    Dutra, Moises Lima
    Vianna, William Barbosa
    BRAZILIAN JOURNAL OF OPERATIONS & PRODUCTION MANAGEMENT, 2015, 12 (02) : 330 - 337
  • [7] On the impact of fuzzy-logic based BDI agent model for cyber-physical systems
    Karaduman, Burak
    Tezel, Baris Tekin
    Challenger, Moharram
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [8] Privacy-preserving big data analytics for cyber-physical systems
    Keshk, Marwa
    Moustafa, Nour
    Sitnikova, Elena
    Turnbull, Benjamin
    WIRELESS NETWORKS, 2022, 28 (03) : 1241 - 1249
  • [9] Dependable Deep Computation Model for Feature Learning on Big Data in Cyber-Physical Systems
    Zhang, Qingchen
    Yang, Laurence T.
    Chen, Zhikui
    Li, Peng
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2019, 3 (01)
  • [10] Privacy-preserving big data analytics for cyber-physical systems
    Marwa Keshk
    Nour Moustafa
    Elena Sitnikova
    Benjamin Turnbull
    Wireless Networks, 2022, 28 : 1241 - 1249