CLIC: An Extensible and Efficient Cross-Platform Data Analytics System

被引:0
作者
Chen, Qixiang [1 ]
Chen, Zhijun [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci Technol, Shanghai 200437, Peoples R China
关键词
Data analysis; data processing; data systems; systems;
D O I
10.1109/TPDS.2023.3298038
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the ever-increasing data volume and application diversity, a modern data analytics job is generally built as a workflow consisting of multiple tasks. For either specific functionalities or higher performance, tasks in a workflow may need to be deployed on different data processing platforms. This article proposes CLIC, a highly extensible system for efficient cross-platform data analytics. To leverage the advantage of diverse platforms while alleviating development efforts, we propose an embedding-based operator encoding scheme and a Graph Convolutional Network model for efficient platform selection. Aiming at flexibly integrating new operators and platforms, CLIC is designed with a highly extensible system architecture that decouples the core functionalities from backend platforms. Experiments show that CLIC can significantly improve the performance of modern data analysis workflows with fast platform selection.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 50 条
  • [11] An Integrated Data Analytics Platform
    Armstrong, Edward M.
    Bourassa, Mark A.
    Cram, Thomas A.
    DeBellis, Maya
    Elya, Jocelyn
    Greguska, Frank R., III
    Huang, Thomas
    Jacob, Joseph C.
    Ji, Zaihua
    Jiang, Yongyao
    Li, Yun
    Quach, Nga
    McGibbney, Lewis
    Smith, Shawn
    Tsontos, Vardis M.
    Wilson, Brian
    Worley, Steven J.
    Yang, Chaowei
    Yam, Elizabeth
    FRONTIERS IN MARINE SCIENCE, 2019, 6
  • [12] SpinStudioJ: A cross-platform NMR data acquisition and processing workbench based on a plug-in architecture
    Liu, Zao
    Chen, Zhiwei
    MAGNETIC RESONANCE IN CHEMISTRY, 2019, 57 (07) : 380 - 389
  • [13] Big Data Platform for Educational Analytics
    Munshi, Amr A.
    Alhindi, Ahmad
    IEEE ACCESS, 2021, 9 : 52883 - 52890
  • [14] A Cross-platform Modular Software Solution for Automated Data Evaluation Applied in Elemental and Structural Mass Spectrometry
    Fleischer, H.
    Adam, M.
    Thurow, K.
    2015 INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2015, : 758 - 763
  • [15] Adaptive Cross-Platform Learning for Teachers in Adult and Continuing Education
    Krause, Thorsten
    Goesling, Henning
    Digel, Sabine
    Biel, Carmen
    Kolvenbach, Sabine
    Thomas, Oliver
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 138 - 143
  • [16] Spyke Viewer: a flexible and extensible platform for electrophysiological data analysis
    Proepper, Robert
    Obermayer, Klaus
    FRONTIERS IN NEUROINFORMATICS, 2013, 7
  • [17] Unlocking Online Reputation On the Effectiveness of Cross-Platform Signaling in the Sharing Economy
    Teubner, Timm
    Adam, Marc T. P.
    Hawlitschek, Florian
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2020, 62 (06) : 501 - 513
  • [18] Big Data Analytics Platform for Flight Safety Monitoring
    Li, Bo
    Ming, Xinguo
    Li, Guoming
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 355 - 358
  • [19] Disco: A Computing Platform for Large-Scale Data Analytics
    Mundkur, Prashanth
    Tuulos, Ville
    Flatow, Jared
    ERLANG 11: PROCEEDINGS OF THE 2011 ACM SIGPLAN ERLANG WORKSHOP, 2011, : 84 - 89
  • [20] The REThinkWASTE data integration and analytics platform for intelligent waste management
    Livaldi, Andrea
    Bergamaschi, Sonia
    Orsini, Mirko
    Magnotta, Luca
    Venturi, Riccardo
    Gabri, Stefano
    PROCEEDINGS OF THE IEEE/ACM 10TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, BDCAT 2023, 2023,