Graphical Flow-based Spark Programming

被引:0
|
作者
Tanmaya Mahapatra
Christian Prehofer
机构
[1] Technische Universität München,Lehrstuhl für Software und Systems Engineering, Fakultät für Informatik
来源
Journal of Big Data | / 7卷
关键词
Spark pipelines; IoT mashup tools; Graphical tools; Stream analytics; Flow-based programming;
D O I
暂无
中图分类号
学科分类号
摘要
Increased sensing data in the context of the Internet of Things (IoT) necessitates data analytics. It is challenging to write applications for Big Data systems due to complex, highly parallel software frameworks and systems. The inherent complexity in programming Big Data applications is also due to the presence of a wide range of target frameworks, with different data abstractions and APIs. The paper aims to reduce this complexity and its ensued learning curve by enabling domain experts, that are not necessarily skilled Big Data programmers, to develop data analytics applications via domain-specific graphical tools. The approach follows the flow-based programming paradigm used in IoT mashup tools. The paper contributes to these aspects by (i) providing a thorough analysis and classification of the widely used Spark framework and selecting suitable data abstractions and APIs for use in a graphical flow-based programming paradigm and (ii) devising a novel, generic approach for programming Spark from graphical flows that comprises early-stage validation and code generation of Spark applications. Use cases for Spark have been prototyped and evaluated to demonstrate code-abstraction, automatic data abstraction interconversion and automatic generation of target Spark programs, which are the keys to lower the complexity and its ensued learning curve involved in the development of Big Data applications.
引用
收藏
相关论文
共 50 条
  • [21] Evaluating a Flow-Based Programming Approach as an Alternative for Developing CEP Applications in IoT
    Ortiz, Guadalupe
    Castillo, Ivan
    Garcia-de-Prado, Alfonso
    Boubeta-Puig, Juan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13) : 11489 - 11499
  • [22] Flow-based microimmunoassay
    Hayes, MA
    Polson, NA
    Phayre, AN
    Garcia, AA
    ANALYTICAL CHEMISTRY, 2001, 73 (24) : 5896 - 5902
  • [23] Work in Progress: Mflow, a Flow-based Music Programming Platform for Young Children
    Minces, Victor Hugo
    Xing, Wanli
    Li, Chenglu
    2023 IEEE WORLD ENGINEERING EDUCATION CONFERENCE, EDUNINE, 2023,
  • [24] ERSAP: Toward Better NP Data-Stream Analytics With Flow-Based Programming
    Gyurjyan, V.
    Abbott, D.
    Brei, N.
    Goodrich, M.
    Heyes, G.
    Jastrzembski, E.
    Lawrence, D.
    Raydo, Benjamin
    Timmer, C.
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2023, 70 (06) : 966 - 970
  • [25] Control Flow Versus Data Flow in Distributed Systems Integration: Revival of Flow-Based Programming for the Industrial Internet of Things
    Hasselbring, Wilhelm
    Wojcieszak, Maik
    Dustdar, Schahram
    IEEE INTERNET COMPUTING, 2021, 25 (04) : 5 - 12
  • [26] Flow-Based Image Abstraction
    Kang, Henry
    Lee, Seungyong
    Chui, Charles K.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2009, 15 (01) : 62 - 76
  • [27] Flow-based Analytical Techniques
    Suzuki, Yasutada
    ANALYTICAL SCIENCES, 2018, 34 (08) : 865 - 865
  • [28] Flow-based Physical Security
    Al-Fedaghi, Sabah
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (08): : 239 - 250
  • [29] Flow-Based Simulation Methodology
    Hoseinzadeh, Morteza
    IEEE COMPUTER ARCHITECTURE LETTERS, 2018, 17 (01) : 51 - 54
  • [30] Flow-based Analytical Techniques
    Yasutada SuziKl
    Analytical Sciences, 2018, 34 : 865 - 865