Investigating Apache Hama: a bulk synchronous parallel computing framework

被引:8
|
作者
Siddique, Kamran [1 ]
Akhtar, Zahid [2 ]
Kim, Yangwoo [1 ]
Jeong, Young-Sik [1 ]
Yoon, Edward J. [3 ]
机构
[1] Dongguk Univ, Seoul, South Korea
[2] Univ Quebec, Montreal, PQ, Canada
[3] Samsung Elect, Seoul, South Korea
关键词
Apache Hama; Bsp; Bulk synchronous parallel; Distributed computing; Mapreduce; Hadoop;
D O I
10.1007/s11227-017-1987-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The quantity of digital data is growing exponentially, and the task to efficiently process such massive data is becoming increasingly challenging. Recently, academia and industry have recognized the limitations of the predominate Hadoop framework in several application domains, such as complex algorithmic computation, graph, and streaming data. Unfortunately, this widely known map-shuffle-reduce paradigm has become a bottleneck to address the challenges of big data trends. The demand for research and development of novel massive computing frameworks is increasing rapidly, and systematic illustration, analysis, and highlights of potential research areas are vital and very much in demand by the researchers in the field. Therefore, we explore one of the emerging and promising distributed computing frameworks, Apache Hama. This is a top level project under the Apache Software Foundation and a pure bulk synchronous parallel model for processing massive scientific computations, e.g. graph, matrix, and network algorithms. The objectives of this contribution are twofold. First, we outline the current state of the art, distinguish the challenges, and frame some research directions for researchers and application developers. Second, we present real-world use cases of Apache Hama to illustrate its potential specifically to the industrial community.
引用
收藏
页码:4190 / 4205
页数:16
相关论文
共 33 条
  • [1] Investigating Apache Hama: a bulk synchronous parallel computing framework
    Kamran Siddique
    Zahid Akhtar
    Yangwoo Kim
    Young-Sik Jeong
    Edward J. Yoon
    The Journal of Supercomputing, 2017, 73 : 4190 - 4205
  • [2] Apache Hama: An Emerging Bulk Synchronous Parallel Computing Framework for Big Data Applications
    Siddique, Kamran
    Akhtar, Zahid
    Yoon, Edward J.
    Jeong, Young-Sik
    Dasgupta, Dipankar
    Kim, Yangwoo
    IEEE ACCESS, 2016, 4 : 8879 - 8887
  • [3] Researching Apache Hama: A Pure BSP Computing Framework
    Siddique, Kamran
    Akhtar, Zahid
    Kim, Yangwoo
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURETECH & MUE, 2016, 393 : 215 - 221
  • [4] REDUCING THE BULK IN THE BULK SYNCHRONOUS PARALLEL MODEL
    Barrett, R. F.
    Vaughan, C. T.
    Hammond, S. D.
    Roweth, D.
    PARALLEL PROCESSING LETTERS, 2013, 23 (04)
  • [5] PRACTICAL RESULTS USING APACHE HADOOP PLATFORM FOR DISTRIBUTED AND PARALLEL COMPUTING
    Toma, Cristian
    INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2012, : 30 - 35
  • [6] Bulk synchronous parallel ML with exceptions
    Gesbert, Louis
    Gava, Fredric
    Loulergue, Frederic
    Dabrowski, Frederic
    DISTRIBUTED AND PARALLEL SYSTEMS: FROM CLUSTER TO GRID COMPUTING, 2007, : 33 - +
  • [7] An efficient iterative graph data processing framework based on bulk synchronous parallel model
    Liu, Chao
    Zeng, Deze
    Yao, Hong
    Yan, Xuesong
    Yu, Linchen
    Fu, Zhangjie
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (03)
  • [8] An object-oriented bulk synchronous parallel library for multicore programming
    Yzelman, A. N.
    Bisseling, Rob H.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (05) : 533 - 553
  • [9] A scalable cloud computing deployment framework for efficient MapReduce operations using Apache YARN
    Raj, Ebin Deni
    Nivash, J. P.
    Nirmala, M.
    Babu, L. D. Dhinesh
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [10] Typhoon Quantitative Rainfall Prediction from Big Data Analytics by Using the Apache Hadoop Spark Parallel Computing Framework
    Wei, Chih-Chiang
    Chou, Tzu-Hao
    ATMOSPHERE, 2020, 11 (08)