Fuzzy Join for Flexible Combining Big Data Lakes in Cyber-Physical Systems

被引:14
作者
Malysiak-Mrozek, Bozena [1 ]
Lipinska, Anna [1 ]
Mrozek, Dariusz [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, PL-44100 Gliwice, Poland
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Cyber-physical systems; big data; fuzzy logic; querying; cloud computing; biomedical data analysis; declarative languages; DATA ANALYTICS; MAPREDUCE; ARCHITECTURE; IMPLEMENTATION; FRAMEWORK;
D O I
10.1109/ACCESS.2018.2879829
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-physical. systems produce large amounts of data that are stored in domain-related data lakes in a variety of formats. By using the big data technologies that enable efficient data processing, the value of the data increases, as these technologies can turn the data into actionable information that influences important decision-making processes. However, a broader view of the operational environment, an investigated phenomena, and challenges related to them can frequently be obtained after combining data from many data sets located in various big data lakes. This requires contact points in both data lakes that must be flexibly joined because in many cases, data sets do not correspond to one another directly. In this paper, we show fuzzy join operation for flexible combining big data lakes. The fuzzy join transforms numerical values of common attributes of joined data sets into fuzzy sets and uses such a representation in the join operation. We propose two variants of the join operation that transforms crisp numerical values of joining attributes into: 1) fuzzy numbers and 2) linguistic terms. The fuzzy join operation is implemented and tested in the declarative U-SQL language that is used for scalable and parallel querying in big data lakes. The ideas presented here are exemplified by a distributed analysis of cardiac disease data on Microsoft Azure cloud. The results of the conducted experiments confirm that the fuzzy join can enrich data sets that are used in making critical decisions and, as a highly scalable cloud-based solution, can be successfully used in processing large volumes of data delivered by cyber-physical systems.
引用
收藏
页码:69545 / 69558
页数:14
相关论文
共 50 条
  • [21] Big Data analytics and Computational Intelligence for Cyber-Physical Systems: Recent trends and state of the art applications
    Iqbal, Rahat
    Doctor, Faiyaz
    More, Brian
    Mahmud, Shahid
    Yousuf, Usman
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 : 766 - 778
  • [22] Smart Grids: A Cyber-Physical Systems Perspective
    Yu, Xinghuo
    Xue, Yusheng
    PROCEEDINGS OF THE IEEE, 2016, 104 (05) : 1058 - 1070
  • [23] Cyber-Physical Systems Forensics: Today and Tomorrow
    Mohamed, Nader
    Al-Jaroodi, Jameela
    Jawhar, Imad
    JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2020, 9 (03)
  • [24] Data Quality Challenges in Cyber-Physical Systems
    Sha, Kewei
    Zeadally, Sherali
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2015, 6 (2-3):
  • [25] Industrial big data analytics and cyber-physical systems for future maintenance & service innovation
    Lee, Jay
    Ardakani, Hossein Davari
    Yang, Shanhu
    Bagheri, Behrad
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON THROUGH-LIFE ENGINEERING SERVICES, 2015, 38 : 3 - 7
  • [26] A Granular GA-SVM Predictor for Big Data in Agricultural Cyber-Physical Systems
    Ruan, Junhu
    Jiang, Hua
    Li, Xiaoyu
    Shi, Yan
    Chan, Felix T. S.
    Rao, Weizhen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (12) : 6510 - 6521
  • [27] Cyber-physical Systems
    Wolf, Wayne
    COMPUTER, 2009, 42 (03) : 88 - 89
  • [28] Deep iterative fuzzy pooling in unmanned robotics and autonomous systems for Cyber-Physical systems
    Chandar, V. Ravindra Krishna
    Baskaran, P.
    Mohanraj, G.
    Karthikeyan, D.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 4621 - 4639
  • [29] Big Data Platform for Integrated Cyber and Physical Security of Critical Infrastructures for the Financial Sector Critical Infrastructures as Cyber-Physical Systems
    Troiano, Ernesto
    Soldatos, John
    Polyviou, Ariana
    Polyviou, Andreas
    Mamelli, Alessandro
    Drakoulis, Dimitris
    11TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES), 2019, : 262 - 269
  • [30] Process execution in Cyber-Physical Systems using cloud and Cyber-Physical Internet services
    Bordel, Borja
    Alcarria, Ramon
    Sanchez de Rivera, Diego
    Robles, Tomas
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (08) : 4127 - 4169