CLIC: An Extensible and Efficient Cross-Platform Data Analytics System

被引:0
作者
Chen, Qixiang [1 ]
Chen, Zhijun [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci Technol, Shanghai 200437, Peoples R China
关键词
Data analysis; data processing; data systems; systems;
D O I
10.1109/TPDS.2023.3298038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the ever-increasing data volume and application diversity, a modern data analytics job is generally built as a workflow consisting of multiple tasks. For either specific functionalities or higher performance, tasks in a workflow may need to be deployed on different data processing platforms. This article proposes CLIC, a highly extensible system for efficient cross-platform data analytics. To leverage the advantage of diverse platforms while alleviating development efforts, we propose an embedding-based operator encoding scheme and a Graph Convolutional Network model for efficient platform selection. Aiming at flexibly integrating new operators and platforms, CLIC is designed with a highly extensible system architecture that decouples the core functionalities from backend platforms. Experiments show that CLIC can significantly improve the performance of modern data analysis workflows with fast platform selection.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 50 条
  • [21] Cross-platform metabolomics investigating the intracellular metabolic alterations of HaCaT cells exposed to phenanthrene
    Jiang, Guoting
    Kang, Hongyan
    Yu, Yunqiu
    JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2017, 1060 : 15 - 21
  • [22] Cross-platform approach to create the interactive applications based on ROOT and Qt GUI libraries
    Brun, R
    Fine, V
    Lauret, J
    Rademakers, F
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2004, 534 (1-2) : 94 - 97
  • [23] Framework for a smart data analytics platform towards process monitoring and alarm management
    Hu, Wenkai
    Shah, Sirish L.
    Chen, Tongwen
    COMPUTERS & CHEMICAL ENGINEERING, 2018, 114 : 225 - 244
  • [24] Developing a data analytics platform to support decision making in emergency and security management
    Perez-Gonzalez, Carlos J.
    Colebrook, Marcos
    Roda-Garcia, Jose L.
    Rosa-Remedios, Carlos B.
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 : 167 - 184
  • [25] Cross-Platform Distributed Product Online Ratings Aggregation Approach for Decision Making with Basic Uncertain Linguistic Information
    Yang, Yi
    Xia, Dan-Xia
    Pedrycz, Witold
    Deveci, Muhammet
    Chen, Zhen-Song
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2024, 26 (06) : 1936 - 1957
  • [26] Cross-platform metabolic profiling deciphering the potential targets of Shenfu injection against acute viral myocarditis in mice
    Tan, Guangguo
    Zhou, Qian
    Liu, Kui
    Dong, Xin
    Li, Ling
    Liao, Wenting
    Wu, Hong
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2018, 160 : 1 - 11
  • [27] Trinity: In-Database Near-Data Machine Learning Acceleration Platform for Advanced Data Analytics
    Kim, Ji-Hoon
    Han, Seunghee
    Park, Kwanghyun
    Ji, Soo-Young
    Kim, Joo-Young
    IEEE ACCESS, 2024, 12 : 11945 - 11962
  • [28] An extensible framework for collaborative e-governance platform workflow modeling using data flow analysis
    Liu, Luning
    Ju, Jingrui
    Feng, Yuqiang
    INFORMATION TECHNOLOGY FOR DEVELOPMENT, 2017, 23 (03) : 415 - 437
  • [29] Predictive Maintenance with Sensor Data Analytics on a Raspberry Pi-Based Experimental Platform
    Chuang, Shang-Yi
    Sahoo, Nilima
    Lin, Hung-Wei
    Chang, Yeong-Hwa
    SENSORS, 2019, 19 (18)