Performance Evaluation and Analysis of Multiple Scenarios of Big Data Stream Computing on Storm Platform

被引:5
作者
Sun, Dawei [1 ]
Yan, Hongbin [1 ]
Gao, Shang [2 ]
Zhou, Zhangbing [1 ]
机构
[1] China Univ Geosci, Sch Informat Engn, Beijing 100083, Peoples R China
[2] Deakin Univ, Sch Informat Technol, Geelong, Vic 3216, Australia
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2018年 / 12卷 / 07期
基金
中国国家自然科学基金;
关键词
multiple scenarios; high throughput; low latency; stream computing; big data; Storm; ALGORITHM; OPTIMIZATION;
D O I
10.3837/tiis.2018.07.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big data era, fresh data grows rapidly every day. More than 30,000 gigabytes of data are created every second and the rate is accelerating. Many organizations rely heavily on real time streaming, while big data stream computing helps them spot opportunities and risks from real time big data. Storm, one of the most common online stream computing platforms, has been used for big data stream computing, with response time ranging from milliseconds to sub-seconds. The performance of Storm plays a crucial role in different application scenarios, however, few studies were conducted to evaluate the performance of Storm. In this paper, we investigate the performance of Storm under different application scenarios. Our experimental results show that throughput and latency of Storm are greatly affected by the number of instances of each vertex in task topology, and the number of available resources in data center. The fault-tolerant mechanism of Storm works well in most big data stream computing environments. As a result, it is suggested that a dynamic topology, an elastic scheduling framework, and a memory based fault-tolerant mechanism are necessary for providing high throughput and low latency services on Storm platform.
引用
收藏
页码:2977 / 2997
页数:21
相关论文
共 28 条
  • [1] [Anonymous], 2017, IEEE T EMERGING TOPI
  • [2] Big Data computing and clouds: Trends and future directions
    Assuncao, Marcos D.
    Calheiros, Rodrigo N.
    Bianchi, Silvia
    Netto, Marco A. S.
    Buyya, Rajkumar
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 79-80 : 3 - 15
  • [3] Behroozifar M, 2017, INT J COMPUT SCI MAT, V8, P157, DOI 10.1504/IJCSM.2017.083749
  • [4] A survey on network community detection based on evolutionary computation
    Cai, Qing
    Ma, Lijia
    Gong, Maoguo
    Tian, Dayong
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2016, 8 (02) : 84 - 98
  • [5] Bat algorithm with triangle-flipping strategy for numerical optimization
    Cai, Xingjuan
    Wang, Hui
    Cui, Zhihua
    Cai, Jianghui
    Xue, Yu
    Wang, Lei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (02) : 199 - 215
  • [6] Improved bat algorithm with optimal forage strategy and random disturbance strategy
    Cai, Xingjuan
    Gao, Xiao-zhi
    Xue, Yu
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2016, 8 (04) : 205 - 214
  • [7] Cardellini V, 2016, 2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), P583, DOI 10.1109/HPCSim.2016.7568388
  • [8] Data-intensive applications, challenges, techniques and technologies: A survey on Big Data
    Chen, C. L. Philip
    Zhang, Chun-Yang
    [J]. INFORMATION SCIENCES, 2014, 275 : 314 - 347
  • [9] Optimal LEACH protocol with modified bat algorithm for big data sensing systems in Internet of Things
    Cui, Zhihua
    Cao, Yang
    Cai, Xingjuan
    Cai, Jianghui
    Chen, Jinjun
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 132 : 217 - 229
  • [10] A novel oriented cuckoo search algorithm to improve DV-Hop performance for cyber-physical systems
    Cui, Zhihua
    Sun, Bin
    Wang, Gaige
    Xue, Yu
    Chen, Jinjun
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 103 : 42 - 52