A topology-based scaling mechanism for Apache Storm

被引:12
作者
Shieh, Ce-Kuen [1 ]
Huang, Sheng-Wei [1 ]
Sun, Li-Da [1 ]
Tsai, Ming-Fong [2 ]
Chilamkurti, Naveen [3 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Inst Comp & Commun Engn, Tainan, Taiwan
[2] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung, Taiwan
[3] La Trobe Univ, Dept Comp Sci & Comp Engn, Melbourne, Vic, Australia
关键词
Communication systems; computer networks; telecommunications; heterogeneous systems; network management; network monitoring; network analysis; capacity planning; network security; quality of service; SNMP; RMON; research and development; journal; online journal; Wiley Online Library;
D O I
10.1002/nem.1933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As more and more well-known companies, such as Twitter, Yahoo, and Alibaba, start to focus on real-time big data applications, how to build a platform for processing real-time data becomes an important issue. Among all the real-time processing systems, Apache Storm is the most well-known and representative open-source, distributed, real-time computation system. In Storm, the computation is implemented by a topology such as a graph where nodes are operators and edges represent the data flows between operators. In big data processing and analysis systems, scalability is an important issue. Storm provides rebalance mechanism for its scalability property, which can adjust the parallelism of a running topology. However, there are some drawbacks in rebalance command, such as resource usage restriction and topology execution suspension. In this paper, we propose a topology-based scaling mechanism for Apache Storm. When a topology is overloaded, it scales by adjusting the number of the cloned topologies or replaced by another new topology with more tasks. When scaling by topology-based mechanism, it eliminates resource usage restriction and execution suspension in the topology, and the procedure is automatically launched. The experimental results show that our topology-based scaling mechanism can improve the scaling performance of Storm. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页数:12
相关论文
共 8 条
[1]  
Aniello L., 2013, P 7 ACM INT C DISTR, P207
[2]  
Bedini Ivan, 2013, P 4 ACMSPEC INT C PE, P173
[3]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[4]  
Goetz P., 2014, Storm blueprints: Patterns for distributed real-time computation
[5]  
Gunelius S., 2014, DATA EXPLOSION 2014
[6]  
Kreps J., 2011, ACM SIGMOD WORKSH NE, P6
[7]  
O'Malley O., 2009, TECHNICAL REPORT
[8]   T-Storm: Traffic-aware Online Scheduling in Storm [J].
Xu, Jielong ;
Chen, Zhenhua ;
Tang, Jian ;
Su, Sen .
2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2014), 2014, :535-544