Fuzzy Join for Flexible Combining Big Data Lakes in Cyber-Physical Systems

被引:14
作者
Malysiak-Mrozek, Bozena [1 ]
Lipinska, Anna [1 ]
Mrozek, Dariusz [1 ]
机构
[1] Silesian Tech Univ, Inst Informat, PL-44100 Gliwice, Poland
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Cyber-physical systems; big data; fuzzy logic; querying; cloud computing; biomedical data analysis; declarative languages; DATA ANALYTICS; MAPREDUCE; ARCHITECTURE; IMPLEMENTATION; FRAMEWORK;
D O I
10.1109/ACCESS.2018.2879829
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-physical. systems produce large amounts of data that are stored in domain-related data lakes in a variety of formats. By using the big data technologies that enable efficient data processing, the value of the data increases, as these technologies can turn the data into actionable information that influences important decision-making processes. However, a broader view of the operational environment, an investigated phenomena, and challenges related to them can frequently be obtained after combining data from many data sets located in various big data lakes. This requires contact points in both data lakes that must be flexibly joined because in many cases, data sets do not correspond to one another directly. In this paper, we show fuzzy join operation for flexible combining big data lakes. The fuzzy join transforms numerical values of common attributes of joined data sets into fuzzy sets and uses such a representation in the join operation. We propose two variants of the join operation that transforms crisp numerical values of joining attributes into: 1) fuzzy numbers and 2) linguistic terms. The fuzzy join operation is implemented and tested in the declarative U-SQL language that is used for scalable and parallel querying in big data lakes. The ideas presented here are exemplified by a distributed analysis of cardiac disease data on Microsoft Azure cloud. The results of the conducted experiments confirm that the fuzzy join can enrich data sets that are used in making critical decisions and, as a highly scalable cloud-based solution, can be successfully used in processing large volumes of data delivered by cyber-physical systems.
引用
收藏
页码:69545 / 69558
页数:14
相关论文
共 50 条
  • [41] Maintaining Data Freshness in Distributed Cyber-Physical Systems
    Li, Guohui
    Zhou, Chunyang
    Li, Jianjun
    Guo, Bing
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (07) : 1077 - 1090
  • [42] Optimal Data Injection Attacks in Cyber-Physical Systems
    Wu, Guangyu
    Sun, Jian
    Chen, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (12) : 3302 - 3312
  • [43] Health-CPS: Healthcare Cyber-Physical System Assisted by Cloud and Big Data
    Zhang, Yin
    Qiu, Meikang
    Tsai, Chun-Wei
    Hassan, Mohammad Mehedi
    Alamri, Atif
    IEEE SYSTEMS JOURNAL, 2017, 11 (01): : 88 - 95
  • [44] Data Visualization Support for Complex Logistics Operations and Cyber-Physical Systems
    Gurdur, Didem
    Raizer, Klaus
    El-Khoury, Jad
    VISIGRAPP 2018: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS / INTERNATIONAL CONFERENCE ON INFORMATION VISUALIZATION THEORY AND APPLICATIONS (IVAPP), VOL 3, 2018, : 197 - 208
  • [45] Data space randomization for securing cyber-physical systems
    Potteiger, Bradley
    Cai, Feiyang
    Zhang, Zhenkai
    Koutsoukos, Xenofon
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2022, 21 (03) : 597 - 610
  • [46] Digital transformation of manufacturing. Industry of the Future with Cyber-Physical Production Systems
    Borangiu, Theodor
    Morariu, Octavian
    Raileanu, Silviu
    Trentesaux, Damien
    Leitao, Paulo
    Barata, Jose
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2020, 23 (01): : 3 - 37
  • [47] Customizable and Scalable Fuzzy Join for Big Data
    Chen, Zhimin
    Wang, Yue
    Narasayya, Vivek
    Chaudhuri, Surajit
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 2106 - 2117
  • [48] Securing Big Data and IoT Networks in Smart Cyber-Physical Environments
    Das, Sajal K.
    Yamana, Hayato
    2017 INTERNATIONAL CONFERENCE ON SMART DIGITAL ENVIRONMENT (ICSDE'17), 2017, : 189 - 194
  • [49] Big data analytics - enabled cyber-physical system: model and applications
    Luo, Shuai
    Liu, Hongwei
    Qi, Ershi
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2019, 119 (05) : 1072 - 1088
  • [50] Communication in Cyber-Physical Systems
    Mois, George
    Folea, Silviu
    Sanislav, Teodora
    Miclea, Liviu
    2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2015, : 303 - 307