TOSS: A Topology-based Scheduler for Storm Clusters

被引:2
作者
Zhou, Yi [1 ]
Liu, Yangyang [2 ]
Zhang, Chaowei [2 ]
Peng, Xiaopu [2 ]
Oin, Xiao [2 ]
机构
[1] Columbus State Univ, TSYS Sch Comp Sci, 4225 Univ Ave, Columbus, GA 31907 USA
[2] Auburn Univ, Dept Comp Sci & Sgftware Engn, Auburn, AL 36849 USA
来源
2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020) | 2020年
关键词
D O I
10.1109/IPDPSW50202.2020.00101
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, our world is undergoing profound challenges in processing a massive amount of data. A handful of computation technologies emerge as a promising computation platform for data intensive processing. Apache Storm is an outstanding open-source platform for large-scale streaming computation, which is widely used in the industry (e.g., Twitter). Performance bottleneck problems encountered in streaming data applications motivate us to investigate scheduling issues in Storm. A key aspect of tuning Storm performance is to decide how to deploy components of a storm application among all available nodes in a cluster. Driven by our observations, we design and implement a new scheduling strategy called TOSS based on application structures. Compared to the existing round-robin scheduler, TOSS not only judiciously handles tight-bind components, but also balances workloads by introducing a self-tuning mechanism in the deployment stage. We conduct experiments by applying two popular and distinct topologies to evaluate the performance of TOSS. The experimental results suggest that TOSS significantly boost the performance of the round-robin scheduler. In particular, TOSS sub-stantially improves the system throughput of Storm while shortens latency of Storm applications.
引用
收藏
页码:587 / 596
页数:10
相关论文
共 29 条
[1]  
Akay B, 2009, LECT NOTES ARTIF INT, V5796, P608
[2]  
Aniello L., 2013, P 7 ACM INT C DISTR, P207, DOI [10.1145/2488222, DOI 10.1145/2488222.2488267]
[3]  
[Anonymous], 1996, APPL LINEAR STAT MOD
[4]  
[Anonymous], 2018, EXTENSIBLE ADAPTABLE, DOI DOI 10.1007/978-981-13-2348-5_8
[5]  
Apache, 2015, STORM GITH REP
[6]   APPLYING NEW SCHEDULING THEORY TO STATIC PRIORITY PREEMPTIVE SCHEDULING [J].
AUDSLEY, N ;
BURNS, A ;
RICHARDSON, M ;
TINDELL, K ;
WELLINGS, AJ .
SOFTWARE ENGINEERING JOURNAL, 1993, 8 (05) :284-292
[7]  
BRYANT RE, 1986, IEEE T COMPUT, V35, P677, DOI 10.1109/TC.1986.1676819
[8]  
Chellappan S., 2018, PRACTICAL APACHE SPA, P141
[9]   POS: A High-Level System to Simplify Real-Time Stream Application Development on Storm [J].
Cui, Bin ;
Jiang, Jie ;
Huang, Quanlong ;
Xu, Ying ;
Gui, Yanjun ;
Zhang, Wenyu .
DATA SCIENCE AND ENGINEERING, 2016, 1 (01) :41-50
[10]  
Morales GD, 2015, J MACH LEARN RES, V16, P149