CLIC: An Extensible and Efficient Cross-Platform Data Analytics System

被引:0
作者
Chen, Qixiang [1 ]
Chen, Zhijun [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci Technol, Shanghai 200437, Peoples R China
关键词
Data analysis; data processing; data systems; systems;
D O I
10.1109/TPDS.2023.3298038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the ever-increasing data volume and application diversity, a modern data analytics job is generally built as a workflow consisting of multiple tasks. For either specific functionalities or higher performance, tasks in a workflow may need to be deployed on different data processing platforms. This article proposes CLIC, a highly extensible system for efficient cross-platform data analytics. To leverage the advantage of diverse platforms while alleviating development efforts, we propose an embedding-based operator encoding scheme and a Graph Convolutional Network model for efficient platform selection. Aiming at flexibly integrating new operators and platforms, CLIC is designed with a highly extensible system architecture that decouples the core functionalities from backend platforms. Experiments show that CLIC can significantly improve the performance of modern data analysis workflows with fast platform selection.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 50 条
  • [31] Data analytics and firm performance: An empirical study in an online B2C platform
    Song, Peijian
    Zheng, Chengde
    Zhang, Cheng
    Yu, Xiaofeng
    INFORMATION & MANAGEMENT, 2018, 55 (05) : 633 - 642
  • [32] Toward Reference Architectures: A Cloud-Agnostic Data Analytics Platform Empowering Autonomous Systems
    Marosi, Attila Csaba
    Emodi, Mark
    Farkas, Attila
    Lovas, Robert
    Beregi, Richard
    Pedone, Gianfranco
    Nemeth, Balazs
    Gaspar, Peter
    IEEE ACCESS, 2022, 10 : 60658 - 60673
  • [33] Periodicity-Oriented Data Analytics on Time-Series Data for Intelligence System
    Kim, Heonho
    Yun, Unil
    Vo, Bay
    Lin, Jerry Chun-Wei
    Pedrycz, Witold
    IEEE SYSTEMS JOURNAL, 2021, 15 (04): : 4958 - 4969
  • [34] Big Cross-Modal Social Media Data Analytics With Deep Intelligence Introduction
    Wang, Yang
    Fang, Meng
    Zhou, Joey Tianyi
    Mu, Tingting
    Tao, Dacheng
    IEEE MULTIMEDIA, 2020, 27 (04) : 6 - 8
  • [35] CHOOSING A DATA ACQUISITION-SYSTEM FOR A PC PLATFORM
    PERROW, MG
    QUALITY PROGRESS, 1991, 24 (11) : 79 - &
  • [36] Cyber Physical System Based Smart Healthcare System with Deep Learning Architectures with Data Analytics
    Xu, Wuyue
    Xu, Haitang
    Zhang, Jiping
    WIRELESS PERSONAL COMMUNICATIONS, 2024,
  • [37] Research on IoT Based Cyber Physical System for Industrial Big Data Analytics
    Lee, C. K. M.
    Yeung, C. L.
    Cheng, M. N.
    2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2015, : 1855 - 1859
  • [38] ExeKG: Executable Knowledge Graph System for User-friendly Data Analytics
    Zheng, Zhuoxun
    Zhou, Baifan
    Zhou, Dongzhuoran
    Soylu, Ahmet
    Kharlamov, Evgeny
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5064 - 5068
  • [39] CPLDP: An Efficient Large Dataset Processing System Built on Cloud Platform
    Zhong, Zhiyong
    Li, Mark
    Chang, Jin
    Zhou, Le
    Huang, Joshua Zhexue
    Feng, Shengzhong
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 13 - 33
  • [40] Ever-est: The platform allowing scientists to cross-fertilize and cross-validate data
    Albani M.
    Leone R.
    Foglini F.
    De Leo F.
    Marelli F.
    Maggio I.
    Data Science Journal, 2020, 19 (01)